Gap Inventory
This page catalogs the 16 identified data gaps in GospeLib's scripture content layer, ranked by severity.
Gap Table
| # | Gap | Severity | Rationale |
|---|---|---|---|
| 1 | Morphological tags (interlinear) | Critical | Existing interlinear data is 100% missing morphology — the primary scholarly feature |
| 2 | Source tokens (interlinear) | Critical | Existing interlinear data is 100% missing source tokens — renders interlinear incomplete |
| 3 | Transliteration (interlinear) | High | No transliteration data in existing interlinear — blocks accessibility for non-readers of Hebrew/Greek |
| 4 | Cross-references | High | No cross-reference data ingested — core navigation feature for scripture study |
| 5 | Septuagint (LXX) | High | Greek OT text not available — essential for scholarly OT study |
| 6 | Versification mapping | High | No mapping between English, Hebrew, Greek, Latin verse numbering systems |
| 7 | Person names database | Medium | No structured people data — needed for graph navigation and search |
| 8 | Place names / geocoding | Medium | No geographic data — needed for map features and place-based navigation |
| 9 | Vulgate (Latin) | Medium | Latin Bible text not available — relevant for historical/scholarly completeness |
| 10 | Additional translations | Medium | Only 9 translations — competitors offer dozens to hundreds |
| 11 | Dead Sea Scrolls | Low | ETCBC/dss provides word-level transcriptions under MIT license — viable for Phase 3 scholarly features |
| 12 | Aramaic Lexicon | Low | Composite approach via SEDRA IV (Apache 2.0) + Sefaria Jastrow (CC-BY-NC) provides substantial coverage |
| 13 | Extended Commentary | Low | CrossWire SWORD modules provide ~10 public-domain verse-aligned commentaries via SWORD→OSIS→JSON pipeline |
| 14 | Syntax / discourse analysis | Medium | No syntactic structure data — needed for advanced Greek/Hebrew study features |
| 15 | Synoptic Gospel parallels | Low | No structured pericope-level parallel mapping across Synoptic Gospels |
| 16 | OT quotations in NT | Low | No structured mapping of Old Testament passages quoted in the New Testament |
BLB Translation Inventory (Commercial Reference)
The following translations are available on BLB but not accessible for ingestion due to BLB's no-scraping policy. This table documents the scope of what exists commercially, informing our open-source sourcing priorities for Gap #10.
| Translation | License Status |
|---|---|
| KJV — King James Version | Public domain |
| ASV — American Standard Version | Public domain |
| YLT — Young's Literal Translation | Public domain |
| DBY — Darby Translation | Public domain |
| WEB — World English Bible | Public domain |
| HNV — Hebrew Names Version | Public domain |
| WLC — Westminster Leningrad Codex | Public domain |
| TR — Textus Receptus | Public domain |
| BES — Brenton's English Septuagint | Public domain |
| VUL — Latin Vulgate | Public domain |
| SVD — Smith & Van Dyck Arabic | Public domain |
| LXX — Septuagint (Rahlfs) | Non-commercial only |
| mGNT — Morphological Greek NT | Restricted |
| NKJV — New King James Version | Copyrighted (500-verse limit) |
| NLT — New Living Translation | Copyrighted (500-verse limit) |
| NIV — New International Version | Copyrighted (500-verse limit) |
| ESV — English Standard Version | Copyrighted (500-verse limit) |
| CSB — Christian Standard Bible | Copyrighted |
| NASB20 — NASB 2020 | Copyrighted (500-verse limit) |
| NASB95 — NASB 1995 | Copyrighted (500-verse limit) |
| LSB — Legacy Standard Bible | Copyrighted |
| AMP — Amplified Bible | Copyrighted (500-verse limit) |
| NET — New English Translation | Copyrighted |
| RSV — Revised Standard Version | Copyrighted (500-verse limit) |
| RVR60 — Reina-Valera 1960 (Spanish) | Copyrighted |
| NAV — Arabic New Arabic Version | Copyrighted |
Takeaway: Of BLB's 26+ translations, ~11 are public domain and available from open-source repositories (scrollmapper, ebible.org). The copyrighted translations require direct publisher licensing.
BLB Commentary Inventory (Commercial Reference)
BLB hosts 50+ commentary authors — most are individually copyrighted. Public-domain authors marked with ✅ may be obtainable from CCEL, Project Gutenberg, or similar archives.
- ✅ Matthew Henry (public domain)
- ✅ Jamieson, Fausset & Brown (public domain)
- ✅ John Calvin (public domain)
- ✅ John Wesley (public domain)
- ✅ C.H. Spurgeon (public domain)
- ✅ Martin Luther (public domain)
- ✅ Jonathan Edwards (public domain)
- ✅ John Trapp (public domain)
- ✅ John Bunyan (public domain)
- ✅ Alexander Maclaren (public domain)
- ✅ R.A. Torrey (public domain)
- ✅ Scofield Reference Bible Notes (public domain)
- ❌ David Guzik (copyrighted)
- ❌ Chuck Smith (copyrighted)
- ❌ John MacArthur (copyrighted)
- ❌ J. Vernon McGee (copyrighted)
- ❌ John Walvoord (copyrighted)
- ❌ …and 35+ additional copyrighted authors
Takeaway: ~12 commentary authors are public domain and could be sourced from open archives. Creating structured, verse-aligned datasets from these would require significant curation effort (see Gap #13).