This repository contains the Jupyter Notebook used to create a new Text-Fabric feature betacode.
The final feature file (betacode.tf) was added to the package available at the tonyjurg/N1904addons repository.
You can view the production notebook on nbviewer.org.
Alternative, you can also download it from the GitHub repository.
The Text-Fabric framework is designed to facilitate the analysis and manipulation of large-scale textual data, particularly in the context of ancient languages and biblical texts. The engine of Text-Fabric is its powerful Python library, which provides a comprehensive set of tools for processing and querying structured text data efficiently. The software package is accessible at https://github.com/annotation/text-fabric.
Background information on betacode can be found in the following resources:
The Greek base text is from Nestle1904 Greek New Testament, edited by Eberhard Nestle, published in 1904 by the British and Foreign Bible Society:
Nestle, Eberhard. Η Καινή Διαθήκη Novum Testamentum Graece (New York: Fleming H. Revell Company, 1904). The 1913 reprint is available here, which was transcribed by Diego Santos. All this material is in Public domain.
Beta‑Code syntax follows the TLG/Perseus convention: Thesaurus Linguae Graecae® / Perseus Project spec.
The conversion code between Unicode and Betacode is available at GitHub repository perseids-tools/beta-code-py. This library was made available under the MIT license.
The N1904-TF dataset available under MIT license. Formal reference:
Tony Jurg, Saulo de Oliveira Cantanhêde, & Oliver Glanz. (2024). CenterBLC/N1904: Nestle 1904 Text-Fabric data. Zenodo. DOI: 10.5281/zenodo.13117911.
This notebook is released under the Creative Commons Attribution 4.0 International (CC BY 4.0)
If you use this repository in your academic work, please cite it.