Additional features for the N1904-TF, the syntactic annotated Text-Fabric dataset of the Greek New Testament.
About this datasetThis repository offers a set of features built specifically for use with the N1904 Text-Fabric dataset. All features are provided “as is” under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. Please take special notice to the flags provided under the header ‘Feature status’, as some features may not be stable yet.
The dataset N1904addons contains the following feature groups which are included in the ‘standard’ set:
Feature group | Description |
---|---|
Morpheus (Analyses, meta and summary) | Morpheus morphological analyser summary, meta and analytic features. |
Statistic | Statistic features like entropy and TTR |
Crossreference | Cross reference features like OSIS ID |
Miscellaneous | Miscellaneous features like Penn Tree |
An additional set of features from the ‘detailed’ set can optionaly be loaded:
Feature group | Description |
---|---|
Morpheus (Details) | Extensive set of features (more than 600) providing access to information elements in each Morpheus analytic block. |
First you need to load the Text-Fabric package and make use
available:
# Loading the Text-Fabric code
from tf.fabric import Fabric
from tf.app import use
When loading N1904-TF, by default, any additional features are not loaded. To include them, use the mod
option during invocation, as shown below:
# Load the N1904-TF app and data with the additional features
A = use ("CenterBLC/N1904", version="1.0.0", mod="tonyjurg/N1904addons/tf/", hoist=globals())
To use this functionality, the Text-Fabric package must support downloading files from GitHub. If Text-Fabric was installed without GitHub functionality, you might encounter errors like the following when trying to load the additional features:
The requested data is not available offline
~/text-fabric-data/github/tonyjurg/N1904addons/tf/1.0.0 not found
Backend provider github not supported.
Cannot reach online data on github
Try installing text-fabric one of the following:
pip install text-fabric[github]
pip install text-fabric[all]
To resolve this issue, ensure that GitHub support is added to Text-Fabric by running:
!pip install text-fabric[github]
Since GitHub implemented a API rate limit of 60 requests per hour, it is recommended to use a personal access token to increase this rate limit. Refer to the following resources for guidance:
The standard set of tools (Python documentation, tech sites like stackoverflow, and Python syntax checkers like Pythonium) were used to create this package. Furthermore, for the creation of a subset of features, also the Anaconda Assistant (using OpenAI as backend) and/or GitHub Copilot in Visual Studio were used to debug and/or optimize parts of the code. See the individual notebooks for details.
This software and data are provided ‘as-is’ under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. It comes without any express or implied warranty. In no event shall the authors be liable for any claim, damages or other liability, whether in an action of contract, tort or otherwise, arising from, out of or in connection with the software or the use or other dealings in the software.
See also LICENSE.md on the GitHub repository.
When citing this dataset, please use following BibTeX:
@software{Jurg_N1904addons_-_Additional_2025,
author = {Jurg, Tony},
doi = {TBD},
month = apr,
title = ,
url = {https://github.com/tonyjurg/N1904addons},
version = {0.1},
year = {2025}
}