TextFabric dataset for Greek New Testament based upon Nestle 1904 (GBI Nodes dataset)
About Text-FabricFeature group | Feature type | Data type | Available for node types |
---|---|---|---|
Grid |
configuration | see below |
Definition of object types.
type | kind | description |
---|---|---|
word |
slot | single word, fills a slot; sometimes words are not separated by a space |
phrase |
functional | phrase, maybe with gaps |
clause |
functional | clause, maybe with gaps |
sentence |
functional | clause, maybe with gaps |
verse |
section | numbered unit of a chapter |
chapter |
section | numbered unit of a book |
book |
section | named part of the Bible |
All objects have a type, which is just a label. Objects and their slots are represented in Text-Fabric as nodes. The information which object occupies which slot is stored in the edge feature oslots.
type | description |
---|---|
Section types | division in books, chapters, etc |
Word type | all about the individual words |
Linguistic types | phrases, clauses, etc |
The section types correspond to the various divisional units in the Bible.
The Greek New Testament is divided in books, books are divided in chapters, chapters are divided in verses, and verses in half-verses.
The sectional types
book
, chapter
, verse
, and half_verse
specify features which indicate which book, chapter, verse, half-verse their objects refer to.
A book
object carries the book feature, which contains the name of the book.
A chapter
object carries the chapter feature, which contains the number of the chapter.
It carries also the book feature to indicate the book of which it is a chapter.
Analogously, the verse
object carries the verse feature, which contains the number of the chapter,
and the book and chapter features.
There is only one type for words, the word
type.
Word objects correspond to the smallest divisional units in the BHSA dataset.
They are also identified with slots, because each slot is filled by a word and each word fills a slot.
Words are not identified with strings, because there are various
string representations of the words, none of which is canonical. All word occurrences are numbered
with a slot number.
Linguistic types correspond to syntactical entities such as sentences, clauses and phrases.