N1904addons

Additional features for the N1904-TF, the syntactic annotated Text-Fabric dataset of the Greek New Testament.

About this dataset
Featureset
Loading the dataset
Use cases
Latest release

Project Status: Active – The project has reached a stable, usable state and is being actively developed. License: CC BY 4.0

N1904addons

This repository offers a set of features built specifically for use with the N1904 Text-Fabric dataset. All features are provided “as is” under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. Please take special notice to the flags provided under the header ‘Feature status’, as some features may not be stable yet.

Features

The dataset N1904addons contains the following feature groups which are included in the ‘standard’ set:

Feature group Description
Morpheus (Analyses, meta and summary) Morpheus morphological analyser summary, meta and analytic features.
Statistic Statistic features like entropy and TTR
Crossreference Cross reference features like OSIS ID
Miscellaneous Miscellaneous features like Penn Tree

An additional set of features from the ‘detailed’ set can optionaly be loaded:

Feature group Description
Morpheus (Details) Extensive set of features (more than 600) providing access to information elements in each Morpheus analytic block.

Adding the features

First you need to load the Text-Fabric package and make use available:

# Loading the Text-Fabric code
from tf.fabric import Fabric
from tf.app import use

When loading N1904-TF, by default, any additional features are not loaded. To include them, use the mod option during invocation, as shown below:

# Load the N1904-TF app and data with the additional features
A = use ("CenterBLC/N1904", version="1.0.0", mod="tonyjurg/N1904addons/tf/", hoist=globals())

To use this functionality, the Text-Fabric package must support downloading files from GitHub. If Text-Fabric was installed without GitHub functionality, you might encounter errors like the following when trying to load the additional features:

The requested data is not available offline
	~/text-fabric-data/github/tonyjurg/N1904addons/tf/1.0.0 not found
Backend provider github not supported.
Cannot reach online data on github
Try installing text-fabric one of the following:
pip install text-fabric[github]
pip install text-fabric[all]

To resolve this issue, ensure that GitHub support is added to Text-Fabric by running:

!pip install text-fabric[github]

Since GitHub implemented a API rate limit of 60 requests per hour, it is recommended to use a personal access token to increase this rate limit. Refer to the following resources for guidance:

Tools used

The standard set of tools (Python documentation, tech sites like stackoverflow, and Python syntax checkers like Pythonium) were used to create this package. Furthermore, for the creation of a subset of features, also the Anaconda Assistant (using OpenAI as backend) and/or GitHub Copilot in Visual Studio were used to debug and/or optimize parts of the code. See the individual notebooks for details.

License

This software and data are provided ‘as-is’ under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. It comes without any express or implied warranty. In no event shall the authors be liable for any claim, damages or other liability, whether in an action of contract, tort or otherwise, arising from, out of or in connection with the software or the use or other dealings in the software.

See also LICENSE.md on the GitHub repository.

Citation

When citing this dataset, please use following BibTeX:

@software{Jurg_N1904addons_-_Additional_2025,
author = {Jurg, Tony},
doi = {TBD},
month = apr,
title = ,
url = {https://github.com/tonyjurg/N1904addons},
version = {0.1},
year = {2025}
}