Create-TF-entropy-features

Project Status: Active – The project has reached a stable, usable state and is being actively developed. License: CC BY 4.0

Create TF entropy features

This repository contains the Jupyter Notebook used to create three new Text-Fabric feature:

The final feature files will be added to the package available at the tonyjurg/N1904addons repository.

Production notebook

You can view the production notebook on nbviewer.org.

Alternative, you can also download it from the GitHub repository.

About Text-Fabric

The Text-Fabric framework is designed to facilitate the analysis and manipulation of large-scale textual data, particularly in the context of ancient languages and biblical texts. The engine of Text-Fabric is its powerful Python library, which provides a comprehensive set of tools for processing and querying structured text data efficiently. The software package is accessible at https://github.com/annotation/text-fabric.

Documentation

See the individual features.

Attribution and footnotes

The Greek base text is from Nestle1904 Greek New Testament, edited by Eberhard Nestle, published in 1904 by the British and Foreign Bible Society:

Nestle, Eberhard. Η Καινή Διαθήκη Novum Testamentum Graece (New York: Fleming H. Revell Company, 1904). The 1913 reprint is available here, which was transcribed by Diego Santos. All this material is in Public domain.

The N1904-TF dataset available under MIT license. Formal reference:

Tony Jurg, Saulo de Oliveira Cantanhêde, & Oliver Glanz. (2024). CenterBLC/N1904: Nestle 1904 Text-Fabric data. Zenodo. DOI: 10.5281/zenodo.13117911.

License

This repository is released under the Creative Commons Attribution 4.0 International (CC BY 4.0)

Citation

If you use this repository in your academic work, please cite it.