Additional features for the N1904-TF, the syntactic annotated Text-Fabric dataset of the Greek New Testament.
About this datasetThis N1904addons repository provides a collection of supplementary features for the N1904-TF dataset that were created during my research projects. N1904-TF is a syntactically annotated version of the Nestle 1904 edition of the Greek New Testament.
These features are provided as is. Please take special note to the flags provided under the header ‘Feature status’, as some features may not be stable yet.
Morpheus: Incorporates both summary and detailed morphological data derived from the Morpheus analyzer, offering detailed morphological information.
Statistical: Includes features such as entropy calculations to assess the mapping from word to phrase function.
Cross-Referencing: Provides identifiers like OSIS IDs to facilitate cross-referencing with other biblical texts and resources.
Miscellaneous: Adds features like Penn Treebank-style display of sentences (this is experimental).
Text-Fabric is a powerful Python library and framework designed to facilitate the analysis and manipulation of large-scale textual data, particularly in the context of ancient languages and biblical texts. It provides a comprehensive set of tools for processing and querying structured text data efficiently. Text-Fabric was developed by Dirk Roorda. The software package is accessible at github.com/annotation/text-fabric.
This dataset is released under the Creative Commons Attribution 4.0 International (CC BY 4.0) license, allowing for broad use and adaptation with appropriate attribution.