Text-Fabric dataset for Greek New Testament based upon Nestle 1904 (Low Fat tree dataset)
About Text-FabricFeature group | Feature type | Data type | Available for node types | Feature status |
---|---|---|---|---|
Orthographic |
Node |
string |
word |
✅ |
Normalized Greek text (changed accents and diacritics to their standard forms). Also trailing punctuations are removed.
See also the following related features:
The following table will show the difference between these features using Mark 1:1 as example:
Use the option fmt='text-normalized'
to print results in transliterated format. Following example will print Mark 1:1 in normalized format:
T.text(139200,fmt='text-normalized') Ἀρχή τοῦ εὐαγγελίου Ἰησοῦ Χριστοῦ Υἱοῦ Θεοῦ.
See this jupyter notebook for usage examples.
Taken from XML attribute normalized
of tag w
(word).