Design phase output for digger — no implementation yet. Research (docs/research/): six findings docs (Meilisearch, local-model tooling incl. the existing arabic-ocr setup, Office/legacy, audio/video, frontend/UX, Forgejo CI + Windows runner) plus SYNTHESIS.md. Design (docs/decisions/): the Canonical Document IR JSON Schema v1.0 (the contract) with worked examples, the concrete Meilisearch settings, and ADRs 0001–0010 covering architecture/layering, the IR, index design (single index, chunk-granularity collapsed by parent_id), chunking, model backends + Ollama deployment, conversion routing, the read-side SearchProvider + HTMX UI, dedup/StateStore/incremental/reindex, Docker-Compose packaging, and layered CI with a native Windows runner. Confirmed decisions baked in: Arabic+English; one document per path; chunk long docs in v1; vectors designed-for but switched off; Ollama as a host service; Windows CI on a KVM VM. Also adds project README, CLAUDE.md, the brief, and .gitignore. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
39 lines
400 B
Text
39 lines
400 B
Text
# --- Editors / IDEs ---
|
|
.idea/
|
|
.vscode/
|
|
*.iml
|
|
.obsidian/
|
|
**/.obsidian/
|
|
|
|
# --- Python ---
|
|
__pycache__/
|
|
*.py[cod]
|
|
*.egg-info/
|
|
.eggs/
|
|
build/
|
|
dist/
|
|
.venv/
|
|
venv/
|
|
*_env/
|
|
.mypy_cache/
|
|
.ruff_cache/
|
|
.pytest_cache/
|
|
.coverage
|
|
coverage.xml
|
|
htmlcov/
|
|
|
|
# --- Secrets / local config ---
|
|
.env
|
|
.env.*
|
|
!.env.example
|
|
|
|
# --- Runtime / data ---
|
|
*.log
|
|
/data/
|
|
ir-output/
|
|
*.ms/
|
|
meili_data/
|
|
|
|
# --- OS ---
|
|
.DS_Store
|
|
Thumbs.db
|