MorphGNT SBLGNT
- Repository: morphgnt/sblgnt
- Maintainer: MorphGNT project (James Tauber)
- License: CC-BY-SA 3.0 — allows sharing and adaptation with attribution and share-alike. The share-alike clause means derivative works must use the same or compatible license. Compatible with open-source distribution but imposes share-alike on derived datasets.
- Suitability Score: ⭐⭐⭐⭐ (4/5)
Coverage
Format: Space-separated text files. One file per NT book (27 files). Each line represents one word with 7 columns:
bcvwp— book/chapter/verse/word/position codeccat-pos— part-of-speech codeccat-parse— full morphological parsing coderobinson— Robinson morphological codetext— text as it appearsword— word formnormalized— normalized formlemma— dictionary lemma
All 27 New Testament books. Version 6.12. Complete word-level morphological analysis.
Quality
High. The SBLGNT (Society of Biblical Literature Greek New Testament) is a well-regarded critical text. Morphological parsing has been refined through multiple versions. Includes both CCAT and Robinson coding systems.
Gaps Filled
- ✅ Morphological tags (Greek NT) — comprehensive per-word analysis
- ✅ Source tokens (Greek NT) — individual word forms with lemmas
- 🔶 Transliteration — not included but derivable from normalized forms via algorithm
Integration Notes
- The CC-BY-SA 3.0 share-alike clause is the main consideration — need to verify compatibility with GospeLib's eventual license
- Data maps directly to enriching existing
:InterlinearWordnodes for Greek NT — addpos,parsing,lemmaproperties - Dual morphological coding (CCAT + Robinson) provides redundancy and cross-validation
- Word position codes (
bcvwp) provide precise alignment to verse text - Can serve as a cross-check against STEPBible TAGNT data
- Fits as an enrichment sub-stage in the existing interlinear pipeline stage