|
|
|
The file benson-glosses.xml
defines Professor Larry Benson's glosses for the Chaucer corpus. The standard NU file is:
<?xml version="1.0" encoding="utf-8"?> <WordHoardBensonGlosses> <bensonPartsOfSpeech> <pos id="0" tag="?">?</pos> <pos id="1" tag="abbrev">abbreviation</pos> ... <pos id="169" tag="for">for in 'for to'</pos> <pos id="171" tag="adv#of">adverbial use of 'of'</pos> </bensonPartsOfSpeech> <bensonLemmas> <lemma id="1" homonym="0"> <word>+</word> <wordClass>plussign</wordClass> <definition></definition> <comment></comment> <oedLemma></oedLemma> </lemma> <lemma id="2" homonym="0"> <word>--</word> <wordClass>dash</wordClass> <definition></definition> <comment></comment> <oedLemma></oedLemma> </lemma> ... <lemma id="10702" homonym="0"> <word>zodiak</word> <wordClass>n</wordClass> <definition>zodiac,</definition> <comment>s.v. zodiac sb. (a.) OED.</comment> <oedLemma>zodiac</oedLemma> </lemma> <lemma id="11001" homonym="1"> <word>Dite</word> <wordClass>n</wordClass> <definition>Dictys Cretensis, historian of the Trojan War,</definition> <comment>proper n.; not in MED.</comment> <oedLemma></oedLemma> </lemma> </bensonLemmas> <bensonLemPos> <lemPos id="1" lemma="1" pos="97"/> <lemPos id="2" lemma="4" pos="88"/> ... <lemPos id="17607" lemma="10702" pos="68"/> <lemPos id="17608" lemma="11001" pos="74"/> </bensonLemPos> </WordHoardBensonGlosses>
The Benson gloss definition file has the following elements:
WordHoardBensonGlosses
. The root element.
Children:
bensonPartsOfSpeech
(1). Benson's parts of speech.bensonLemmas
(1). Benson's lemmas.bensonLemPos
(1). Benson's lemma and part of speech combinations.bensonPartsOfSpeech
. Benson's parts of speech.
Children:
pos
(0..n). Parts of speech.pos
. A part of speech.
Attributes:
id
. Required unique numeric id.tag
. Required tag.Children:
TEXT
(0..1). Optional description.bensonLemmas
. Benson's lemmas.
Children:
lemma
(0..n). Lemmas.lemma
. A lemma.
Attributes:
id
. Required unique numeric id.homonym
. An optional homonym number.Children:
word
(1). The lemma word.wordClass
(1). The word class.definition
(0..1). The definition.comment
(0..1). A comment.oedLemma
(0..1). The OED lemma.word
. A lemma word.
Children:
TEXT
(1). The lemma word.wordClass
. A word class.
Children:
TEXT
(1). The word class.definition
. A definition.
Children:
TEXT
(0..1). The definition.comment
. A comment.
Children:
TEXT
(0..1). The comment.oedLemma
. An OED lemma.
Children:
TEXT
(0..1). The OED lemma.bensonLemPos
. Benson's lemma and part of speech combinations.
Children:
lemPos
(0..n). Lemma and part of speech combinations.lemPos
. A lemma and part of speech combination.
Attributes:
id
. Required unique numeric id.lemma
. Required lemma id.pos
. Required part of speech id.
|
|
|