Skip to main content

MorphGNT SBLGNT

  • Repository: morphgnt/sblgnt
  • Maintainer: MorphGNT project (James Tauber)
  • License: CC-BY-SA 3.0 — allows sharing and adaptation with attribution and share-alike. The share-alike clause means derivative works must use the same or compatible license. Compatible with open-source distribution but imposes share-alike on derived datasets.
  • Suitability Score: ⭐⭐⭐⭐ (4/5)

Coverage

Format: Space-separated text files. One file per NT book (27 files). Each line represents one word with 7 columns:

  1. bcvwp — book/chapter/verse/word/position code
  2. ccat-pos — part-of-speech code
  3. ccat-parse — full morphological parsing code
  4. robinson — Robinson morphological code
  5. text — text as it appears
  6. word — word form
  7. normalized — normalized form
  8. lemma — dictionary lemma

All 27 New Testament books. Version 6.12. Complete word-level morphological analysis.

Quality

High. The SBLGNT (Society of Biblical Literature Greek New Testament) is a well-regarded critical text. Morphological parsing has been refined through multiple versions. Includes both CCAT and Robinson coding systems.

Gaps Filled

  • ✅ Morphological tags (Greek NT) — comprehensive per-word analysis
  • ✅ Source tokens (Greek NT) — individual word forms with lemmas
  • 🔶 Transliteration — not included but derivable from normalized forms via algorithm

Integration Notes

  • The CC-BY-SA 3.0 share-alike clause is the main consideration — need to verify compatibility with GospeLib's eventual license
  • Data maps directly to enriching existing :InterlinearWord nodes for Greek NT — add pos, parsing, lemma properties
  • Dual morphological coding (CCAT + Robinson) provides redundancy and cross-validation
  • Word position codes (bcvwp) provide precise alignment to verse text
  • Can serve as a cross-check against STEPBible TAGNT data
  • Fits as an enrichment sub-stage in the existing interlinear pipeline stage