Skip to main content

M13: Multi-Translation & Schema Evolution — ingest/v0.2.0-alpha, content/v0.2.0-alpha

Version tag: ingest/v0.2.0-alpha, content/v0.2.0-alpha Phase: P0: Foundation (Phases 1--2), P2: Enhancement (Phases 3--4) Target: Weeks 5--26 Sprints: S2--S12


Phase Context

Goal: Establish symmetric Translation-node schema for multi-translation support, fix critical data corruption (wrong translation text on 31K Bible passages), recover ~350K silently-discarded data points, and progressively enrich the graph with word alignment, lexicon, and citation data.

Key constraint: This work addresses two CRITICAL bugs blocking all downstream features -- every Bible passage shows wrong text, and 9 translations have no storage mechanism. Must complete Phase 1--2 before M03 Scripture Reader can ship correct data.


ZenHub Configuration

FieldValue
MilestoneM13: Multi-Translation & Schema Evolution
Due Date2026-09-06
Default PipelineProduct Backlog
Primary Epic(s)Schema Foundation & Critical Fixes, Content Service Translation Support, Word Alignment & Lexicon Enrichment, Citation Extraction

Prerequisites

  • M01: Data Pipeline -- ingest pipeline must exist before modification (Stages 0--6 operational)
  • M00: Tech Prep -- FalkorDB local instance, structured logging

Epic: Schema Foundation & Critical Fixes (Phase 1)

Translation schema foundation, ID generators, slug registry, JST versification, pipeline rework.

Issues

IssueTitleStatusNotes
M13-001Translation Node Cypher Templates✅ DonePR #959
M13-002Translation ID Generators✅ DonePR #959
M13-003Translation Slug Registry Module✅ DonePR #959
M13-004Translation Slug Discovery✅ DonePR #959
M13-005JST Versification Support✅ DonePR #959
M13-006Scripture Text Pipeline Translation Support✅ DonePR #968
M13-007Dictionary Gloss Resolution✅ DonePR #972
M13-008Scholarly Metadata Fix✅ DonePR #972
M13-009Missing Secondary Indexes✅ DonePR #972
M13-010PendingPassage Monitoring✅ DonePR #972
M13-011Phase 1 Unit Tests✅ DonePR #978
M13-012Phase 1 Integration Tests✅ DonePR #978

Epic: Content Service Translation Support (Phase 2)

Content query migration, translation routes, Book/Chapter hierarchy, note anchor positions.

Issues

IssueTitleStatusNotes
M13-013Content Service Translation Queries✅ DonePR #982
M13-014Translation Registry Routes & Models✅ DonePR #982
M13-015Book/Chapter Hierarchy Pipeline✅ DonePR #982
M13-016Note Anchor Positions✅ DonePR #982
M13-017Phase 2 Unit Tests✅ DonePR #983
M13-018Phase 2 Integration Tests✅ DonePR #983

Epic: Word Alignment & Lexicon Enrichment (Phase 3)

TranslationWord pipeline, lexicon JSON properties, DefinitionSense and TranslationOccurrence graph nodes.

Issues

IssueTitleStatusNotes
M13-019TranslationWord Pipeline✅ DonePR #1344 (batch A1.6)
M13-020Lexicon Enrichment -- JSON Properties✅ DonePR #1344 (batch A1.6)
M13-021Lexicon Enrichment -- Graph Nodes✅ DonePR #1344 (batch A1.6)
M13-022Phase 3 Tests✅ DonePR A2.4

Epic: Citation Extraction (Phase 4)

Scripture reference parser, BD mentions, section citations extraction.

Issues

IssueTitleStatusNotes
M13-023Scripture Reference Parser✅ DonePR #1337 (batch A0.2)
M13-024BD Mentions Extraction Pipeline✅ DonePR #1345 (batch A1.5)
M13-025Section Citations Extraction Pipeline✅ DonePR #1345 (batch A1.5)
M13-026Phase 4 Tests✅ DonePR A2.4

Progress: 26 Done · 0 Partial · 0 To Do (100%)


Summary

MetricCount
Total Issues26
Sub-Issues4
Total Estimate (pts)~112
SprintsS2--S12
Dependencies (blocking)32
Dependencies (blocked by)32