Create-TF-stat-features

Project Status: WIP – Initial development is in progress, but there has not yet been a stable, usable release suitable for the public. License: CC BY 4.0

Create TF statistic features

This repository contains the Jupyter Notebook used to create the following Text-Fabric features:

More statistic features may be added in the future.

The final feature files will be added to the package available at the tonyjurg/N1904addons repository.

Production notebook

You can view the production notebook on nbviewer.org.

Alternative, you can also download it from the GitHub repository.

About Text-Fabric

The Text-Fabric framework is designed to facilitate the analysis and manipulation of large-scale textual data, particularly in the context of ancient languages and biblical texts. The engine of Text-Fabric is its powerful Python library, which provides a comprehensive set of tools for processing and querying structured text data efficiently. The software package is accessible at https://github.com/annotation/text-fabric.

Attribution and footnotes

The Greek base text is from Nestle1904 Greek New Testament, edited by Eberhard Nestle, published in 1904 by the British and Foreign Bible Society:

Nestle, Eberhard. Η Καινή Διαθήκη Novum Testamentum Graece (New York: Fleming H. Revell Company, 1904). The 1913 reprint is available here, which was transcribed by Diego Santos. All this material is in Public domain.

The N1904-TF dataset available under MIT license. Formal reference:

Tony Jurg, Saulo de Oliveira Cantanhêde, & Oliver Glanz. (2024). CenterBLC/N1904: Nestle 1904 Text-Fabric data. Zenodo. DOI: 10.5281/zenodo.13117911.

License

This notebook is released under the Creative Commons Attribution 4.0 International (CC BY 4.0)

Citation

If you use this repository in your academic work, please cite it.