|
|
|
The file benson-glosses.xml defines Professor Larry Benson's glosses for the Chaucer corpus. The standard NU file is:
<?xml version="1.0" encoding="utf-8"?>
<WordHoardBensonGlosses>
<bensonPartsOfSpeech>
<pos id="0" tag="?">?</pos>
<pos id="1" tag="abbrev">abbreviation</pos>
...
<pos id="169" tag="for">for in 'for to'</pos>
<pos id="171" tag="adv#of">adverbial use of 'of'</pos>
</bensonPartsOfSpeech>
<bensonLemmas>
<lemma id="1" homonym="0">
<word>+</word>
<wordClass>plussign</wordClass>
<definition></definition>
<comment></comment>
<oedLemma></oedLemma>
</lemma>
<lemma id="2" homonym="0">
<word>--</word>
<wordClass>dash</wordClass>
<definition></definition>
<comment></comment>
<oedLemma></oedLemma>
</lemma>
...
<lemma id="10702" homonym="0">
<word>zodiak</word>
<wordClass>n</wordClass>
<definition>zodiac,</definition>
<comment>s.v. zodiac sb. (a.) OED.</comment>
<oedLemma>zodiac</oedLemma>
</lemma>
<lemma id="11001" homonym="1">
<word>Dite</word>
<wordClass>n</wordClass>
<definition>Dictys Cretensis, historian of the Trojan War,</definition>
<comment>proper n.; not in MED.</comment>
<oedLemma></oedLemma>
</lemma>
</bensonLemmas>
<bensonLemPos>
<lemPos id="1" lemma="1" pos="97"/>
<lemPos id="2" lemma="4" pos="88"/>
...
<lemPos id="17607" lemma="10702" pos="68"/>
<lemPos id="17608" lemma="11001" pos="74"/>
</bensonLemPos>
</WordHoardBensonGlosses>
The Benson gloss definition file has the following elements:
WordHoardBensonGlosses. The root element.
Children:
bensonPartsOfSpeech (1). Benson's parts of speech.bensonLemmas (1). Benson's lemmas.bensonLemPos (1). Benson's lemma and part of speech combinations.bensonPartsOfSpeech. Benson's parts of speech.
Children:
pos (0..n). Parts of speech.pos. A part of speech.
Attributes:
id. Required unique numeric id.tag. Required tag.Children:
TEXT (0..1). Optional description.bensonLemmas. Benson's lemmas.
Children:
lemma (0..n). Lemmas.lemma. A lemma.
Attributes:
id. Required unique numeric id.homonym. An optional homonym number.Children:
word (1). The lemma word.wordClass (1). The word class.definition (0..1). The definition.comment (0..1). A comment.oedLemma (0..1). The OED lemma.word. A lemma word.
Children:
TEXT (1). The lemma word.wordClass. A word class.
Children:
TEXT (1). The word class.definition. A definition.
Children:
TEXT (0..1). The definition.comment. A comment.
Children:
TEXT (0..1). The comment.oedLemma. An OED lemma.
Children:
TEXT (0..1). The OED lemma.bensonLemPos. Benson's lemma and part of speech combinations.
Children:
lemPos (0..n). Lemma and part of speech combinations.lemPos. A lemma and part of speech combination.
Attributes:
id. Required unique numeric id.lemma. Required lemma id.pos. Required part of speech id.
|
|
|