Text-Fabric dataset for Greek New Testament based upon Nestle 1904 (Low Fat tree dataset)
About Text-FabricThis Text-Fabric dataset contains the following node types:
book
nodeschapter
nodesverse
nodessentence
nodesclause
nodeswg
(wordgroup) nodesword
nodesBelow are all node features listed:
Feature | Feature group | Data type | Description | Examples |
---|---|---|---|---|
book | Sectional |
string |
Book name (fully spelled out) | Luke Jude |
bookshort | Sectional |
string |
Book name (abbreviated) | MAT MAR … REV |
booknumber | Sectional |
integer |
NT book number (Matthew=1, … , Revelation=27) | 3 8 |
Feature | Feature group | Data type | Description | Examples |
---|---|---|---|---|
book | Sectional |
string |
Book name (fully spelled out) | Luke Jude |
chapter | Sectional |
string |
Chapter | 1 28 |
Feature | Feature group | Data type | Short description | Examples |
---|---|---|---|---|
book | Sectional |
string |
Book name (fully spelled out) | Luke Jude |
chapter | Sectional |
string |
Chapter | 1 28 |
verse | Sectional |
string |
Verse | 2 26 |
Feature | Feature group | Data type | Short description | Examples |
---|---|---|---|---|
book | Sectional |
string |
Book name (fully spelled out) | Luke Jude |
chapter | Sectional |
string |
Chapter | 1 28 |
headverse | Sectional |
string |
Verse where the sentence starts | 2 26 |
sentence | Sectional |
integer |
Sentence number (counted per chapter) | 9 |
Feature | Feature group | Data type | Description | Examples |
---|---|---|---|---|
junction | Syntactic |
string |
Junction information | apposition ` ` |
wgclass | Syntactic |
string |
Wordgroup class | np cl |
wglevel | Syntactic |
string |
Wordgroup level | |
wgnum | Sectional |
string |
Wordgroup number | |
wgrole | Syntactic |
string |
Wordgroup role | s o apposition |
wgrule | Syntactic |
string |
Wordgroup rule | PrepNp Conj-CL |
wgtype | Syntactic |
string |
Wordgroup type | group apposition |
Feature | Feature group | Data type | Description | Examples |
---|---|---|---|---|
after | Orthographic |
string |
space or punctuation after word | |
book | Sectional |
string |
Short book abbreviation | |
bookshort | Sectional |
string |
Book name (abbreviated) | |
booknumber | Sectional |
integer |
NT book number (Matthew=1, …, Revelation=27) | |
case | Morphological |
string |
Gramatical case | nominative accusative |
chapter | Sectional |
integer |
Chapter number inside book | |
degree | Morphological |
string |
Degree of an adjective | ` ` comparative superlative |
morph | Morphological |
string |
Morphological tag (Sandborg-Petersen morphology) | |
gloss | Lexical |
string |
English gloss | |
gn | Morphological |
string |
Gramatical gender | masculine feminine neuter |
lemma | Lexical |
string |
Lexeme (lemma) | |
lex_dom | Lexical |
string |
Lexical domain according to Semantic Dictionary of Biblical Greek, SDBG | 092004 |
ln | Lexical |
string |
Louw-Nida lexical classification | 93.169a |
markafter | Orthographic |
string |
Text critical marker after word | - , ) , ] , ]] |
markbefore | Orthographic |
string |
Text critical marker before word | - , ( , [ , [[ |
markorder | Orthographic |
string |
Order of punctuation and text critical marker | |
monad | Sectional |
integer |
Monad (word order in corpus) | 1 .. 137779 |
mood | Morphological |
string |
Gramatical mood of a verb | indicative optative |
nodeID | Sectional |
integer |
Node ID (as in the XML source data) | |
normalized | Lexical |
string |
Surface word stripped of punctations | |
nu | Morphological |
string |
Gramatical number | singular plural |
number | Morphological |
string |
Gramatical number of the verb | plural |
orig_order | Sectional |
integer |
Orig Order (word order in XML file) | 1 .. 137779 |
person | Morphological |
string |
Gramatical person of the verb | first second third |
sentence | Sectional |
integer |
Sentence number (counted per chapter) | |
sp | Morphological |
string |
Part of Speech (abbreviated) | |
sp_full | Syntactic |
string |
Part of Speech (long description) | |
strongs | Syntactic |
string |
Strongs number | |
subj_ref | Relational |
string |
Subject reference (to nodeID in XML source data) | |
tense | Morphological |
string |
Gramatical tense of the verb | present aorist |
type | Morphological |
string |
Gramatical type of noun or pronoun | common personal |
unicode | Orthographic |
string |
Word as it appears in the text (in unicode) | λόγος |
verse | Sectional |
integer |
Verse number inside chapter | |
voice | Morphological |
string |
Gramatical voice of the verb | active passive |
word | Orthographic |
string |
Word as it appears in the text | λόγος |
wordrole | Syntactic |
string |
Word role in the text (abbriviated) | |
wordrolelong | Syntactic |
string |
Word role in the text | Adverbial Subject Verbal |
word | Node |
word |
Word as it appears in the text | λόγος |
wordtranslit | Orthographic |
string |
Word transliterated to Latin characters | logos |
wordunacc | Orthographic |
string |
Word without accents | λογος |