Text-Fabric dataset for Greek New Testament based upon Nestle 1904 (Low Fat tree dataset)
About Text-FabricFeature group | Feature type | Data type | Available for node types | Feature status |
---|---|---|---|---|
Grid |
Node |
configuration |
word wg Sentence verse chapter Book |
✅ |
Types for text objects as they are represented by nodes.
type | kind | description |
---|---|---|
word |
slot | single word, fills a slot; |
wg |
functional | word groups, maybe with gaps |
Sentence |
functional | |
verse |
section | numbered unit of a chapter |
chapter |
section | numbered unit of a book |
Book |
section | named part of the Greek New Testament |
All objects have a type, which is just a label. Objects and their slots are represented in Text-Fabric as nodes. The information which object occupies which slot is stored in the edge feature oslots.
type | description |
---|---|
Section types | division in books, chapters, etc. |
Word type | all about the individual words |
Linguistic types | wordgroups (wg) that can represent phrases and clauses. |
The section types correspond to the various divisional units in the Bible.
The Greek New Testament is divided in books, books are divided in chapters, chapters are divided in verses.
The sectional types book
, chapter
, verse
specify features which indicate which book, chapter, verse their objects refer to.
A book
object carries the book feature, which contains the name of the book.
A chapter
object carries the chapter feature, which contains the number of the chapter.
It carries also the book feature to indicate the book of which it is a chapter.
Analogously, the verse
object carries the verse feature, which contains the number of the chapter, and the book and chapter features.
There is only one type for words, the word
type.
Word objects correspond to the smallest divisional units in the Greek New Testament dataset.
They are also identified with slots, because each slot is filled by a word and each word fills a slot.
Words are not identified with strings, because there are various
string representations of the words, none of which is canonical. All word occurrences are numbered
with a slot number.
Word occurrences corresponds to lexemes, i.e. dictionary entries, for which we have a separate object type. For the textual representation of lexemes we have a variety of features, in order to get their
Linguistic types correspond to syntactical entities such as sentences, clauses and phrases.
The functional object types in this dataset are sentence
and wg
; where a wordgroup can function either as a phrase or a clause.