digger/.gitignore
Randa 5cc8c99109 docs: research findings and v1 design (IR contract, index, ADRs)
Design phase output for digger — no implementation yet.

Research (docs/research/): six findings docs (Meilisearch, local-model
tooling incl. the existing arabic-ocr setup, Office/legacy, audio/video,
frontend/UX, Forgejo CI + Windows runner) plus SYNTHESIS.md.

Design (docs/decisions/): the Canonical Document IR JSON Schema v1.0
(the contract) with worked examples, the concrete Meilisearch settings,
and ADRs 0001–0010 covering architecture/layering, the IR, index design
(single index, chunk-granularity collapsed by parent_id), chunking,
model backends + Ollama deployment, conversion routing, the read-side
SearchProvider + HTMX UI, dedup/StateStore/incremental/reindex,
Docker-Compose packaging, and layered CI with a native Windows runner.

Confirmed decisions baked in: Arabic+English; one document per path;
chunk long docs in v1; vectors designed-for but switched off; Ollama as
a host service; Windows CI on a KVM VM.

Also adds project README, CLAUDE.md, the brief, and .gitignore.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
2026-07-01 03:14:06 +04:00

39 lines
400 B
Text

# --- Editors / IDEs ---
.idea/
.vscode/
*.iml
.obsidian/
**/.obsidian/
# --- Python ---
__pycache__/
*.py[cod]
*.egg-info/
.eggs/
build/
dist/
.venv/
venv/
*_env/
.mypy_cache/
.ruff_cache/
.pytest_cache/
.coverage
coverage.xml
htmlcov/
# --- Secrets / local config ---
.env
.env.*
!.env.example
# --- Runtime / data ---
*.log
/data/
ir-output/
*.ms/
meili_data/
# --- OS ---
.DS_Store
Thumbs.db