http://www.unlweb.net/wiki/index.php?title=BRUNO&feed=atom&action=historyBRUNO - Revision history2024-03-28T21:30:29ZRevision history for this page on the wikiMediaWiki 1.18.1http://www.unlweb.net/wiki/index.php?title=BRUNO&diff=7721&oldid=prevMartins: /* Repository */2014-05-28T08:47:33Z<p><span class="autocomment">Repository</span></p>
<table class='diff diff-contentalign-left'>
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 08:47, 28 May 2014</td>
</tr><tr><td colspan="2" class="diff-lineno">Line 30:</td>
<td colspan="2" class="diff-lineno">Line 30:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>|-</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>|-</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>|align="center"|BRUNO-C1</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>|align="center"|BRUNO-C1</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="background: #ffa; color:black; font-size: smaller;"><div>|align="center"|<del class="diffchange diffchange-inline">5</del>,000</div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>|align="center"|<ins class="diffchange diffchange-inline">10</ins>,000</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>|-</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>|-</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>|align="center"|BRUNO-C2</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>|align="center"|BRUNO-C2</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="background: #ffa; color:black; font-size: smaller;"><div>|align="center"|<del class="diffchange diffchange-inline">5</del>,000</div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>|align="center"|<ins class="diffchange diffchange-inline">10</ins>,000</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>|}</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>|}</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
</table>Martinshttp://www.unlweb.net/wiki/index.php?title=BRUNO&diff=7420&oldid=prevMartins at 15:28, 5 February 20142014-02-05T15:28:42Z<p></p>
<table class='diff diff-contentalign-left'>
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 15:28, 5 February 2014</td>
</tr><tr><td colspan="2" class="diff-lineno">Line 46:</td>
<td colspan="2" class="diff-lineno">Line 46:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>*In all cases, the language must contain a reasonable amount of [[inflectional paradigms]] and [[subcategorization frames]] already registered in the [[UNLarium]].</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>*In all cases, the language must contain a reasonable amount of [[inflectional paradigms]] and [[subcategorization frames]] already registered in the [[UNLarium]].</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'>−</td><td style="background: #ffa; color:black; font-size: smaller;"><div>== <del class="diffchange diffchange-inline">Methodology </del>==</div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>== <ins class="diffchange diffchange-inline">Preparing the list of entries </ins>==</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#List of entries</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#List of entries</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:Participants are expected to provide a list of the entries according to the following criteria:</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:Participants are expected to provide a list of the entries according to the following criteria:</div></td></tr>
<tr><td colspan="2" class="diff-lineno">Line 59:</td>
<td colspan="2" class="diff-lineno">Line 59:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#Dictionary</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#Dictionary</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:Entries become available, in the UNLarium, for all the registered users of a given language, in case of open projects, or for the approved candidates, in case of closed projects. Users are expected to provide all the morphological, syntactic and semantic information for each entry</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:Entries become available, in the UNLarium, for all the registered users of a given language, in case of open projects, or for the approved candidates, in case of closed projects. Users are expected to provide all the morphological, syntactic and semantic information for each entry</div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;"></ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">== Instructions ==</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">;Lexical Category</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">:Whenever the lexical category for a given lemma is provided, check whether it is correct. If it is not correct, decline the entry and report the problem by clicking over the yellow triangle at the right of the main entry. If the lexical category is not provided, select the most likely category. Do not worry about homonyms: provide one single category for a given main entry.</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">;Lemma</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">:Do not change the lemma. If it is not correct (i.e., if it is misspelled or cannot be considered to be a lexical unit), decline the entry and report the problem by clicking over the yellow triangle at the right of the main entry.</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">;Provide as many UW's as necessary to each lemma, but do not include very rare or unusual cases. And check the order: the most likely senses must appear first.</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">;Base Form</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">:You have to worry about the base form only in case of multiword expressions 1) whose inflections cannot be formed by simple affixation or 2) which are discontinuous. In these cases, provide the corresponding composition rules.</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">;Inflection</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">:Select AND TEST the inflectional paradigm that generates the inflections of the base form. Any errors here will be propagated to the dictionary, so be careful. And pay attention to the cases below:</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">:*LOCALIZED IRREGULARITY: if the word is mostly regular and its irregularity is localized in some few and specific rules (more than one possible plural for nouns, or defective verbs that are not used in a given person, for instance, but follow the general rules for all the others), assign the word to the corresponding paradigm and list, in the box "inflectional rules", its irregularities;</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">:*NON-EXISTING PARADIGM: if the word is regular or semi-regular (in the sense that there are several other words in the same case), and cannot be associated to any existing paradigm, press the button REQUEST A NEW PARADIGM and provide the corresponding details;</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">:*IRREGULAR WORDS: if the word is irregular (i.e., it has a quite unusual and specific morphological behavior), choose the option IRREGULAR and provide the corresponding inflectional rules.</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">;Subcategorization</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">:Subcategorization is only required when the word REQUIRES a complement or a specifier (indirect transitive verbs that select an specific preposition, for instance). In this case, you have to inform the corresponding subcategorization frame. If the subcategorization frame is not available, press the button REQUEST A NEW SUBCATEGORIZATION FRAME and provide the corresponding details.</ins></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Notes ==</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Notes ==</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div><references /></div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div><references /></div></td></tr>
</table>Martinshttp://www.unlweb.net/wiki/index.php?title=BRUNO&diff=7159&oldid=prevMartins: /* Methodology */2013-09-26T12:07:13Z<p><span class="autocomment">Methodology</span></p>
<table class='diff diff-contentalign-left'>
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 12:07, 26 September 2013</td>
</tr><tr><td colspan="2" class="diff-lineno">Line 49:</td>
<td colspan="2" class="diff-lineno">Line 49:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#List of entries</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#List of entries</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:Participants are expected to provide a list of the entries according to the following criteria:</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:Participants are expected to provide a list of the entries according to the following criteria:</div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">#:*The list of entries must include the most frequent lemmas of the language, including articles, prepositions, conjunctions, nouns, verbs, adjectives, adverbs, etc.</ins></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries can be extracted from prestigious monolingual dictionaries or from a corpus considered to be representative of the standard written language<ref>This corpus can be either an existing reputable corpus or a new corpus compiled according to the criteria defined at [[NC]].</ref>.</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries can be extracted from prestigious monolingual dictionaries or from a corpus considered to be representative of the standard written language<ref>This corpus can be either an existing reputable corpus or a new corpus compiled according to the criteria defined at [[NC]].</ref>.</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries must be ordered according to the frequency of occurrence (the most frequent entries must come first)<ref>The frequency of use is not often informed by ordinary dictionaries but may be inferred from the several distributions of the same dictionary: basic, intermediate or advanced, for instance.</ref>.</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries must be ordered according to the frequency of occurrence (the most frequent entries must come first)<ref>The frequency of use is not often informed by ordinary dictionaries but may be inferred from the several distributions of the same dictionary: basic, intermediate or advanced, for instance.</ref>.</div></td></tr>
</table>Martinshttp://www.unlweb.net/wiki/index.php?title=BRUNO&diff=7143&oldid=prevMartins: /* Repository */2013-09-24T15:41:22Z<p><span class="autocomment">Repository</span></p>
<table class='diff diff-contentalign-left'>
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 15:41, 24 September 2013</td>
</tr><tr><td colspan="2" class="diff-lineno">Line 8:</td>
<td colspan="2" class="diff-lineno">Line 8:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Repository ==</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>== Repository ==</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>BRUNO is language dependent. Every language has its own set of entries to be addressed. The repository is divided into 6 different subprojects according to the frequency of use of the lemmas.  </div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>BRUNO is language dependent. Every language has its own set of entries to be addressed. The repository is divided into 6 different subprojects according to the frequency of use of the lemmas.  </div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">*BRUNO-A1 contains the list of the 2,000 most frequent lemmas of the language (including articles, prepositions, conjunctions, auxiliary verbs, etc.);</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">*BRUNO-A2 contains the next 3,000 most frequent lemmas of the language;</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">*BRUNO-B1 contains the next 5,000 most frequent lemmas of the language;</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;">And so on.</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins style="color: red; font-weight: bold; text-decoration: none;"></ins></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>{|border="1" align="center" cellpadding="2"</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>{|border="1" align="center" cellpadding="2"</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>!Repository</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>!Repository</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="background: #ffa; color:black; font-size: smaller;"><div>!# of lemmas<del class="diffchange diffchange-inline"><ref>The lemmas must be ordered according to the frequency of use. In that sense, BRUNO-A1 deals with the most frequent lemmas from 1 to 2,000. BRUNO-A2 deals with the most frequent lemmas from 2,001 to 5,000. BRUNO-B1 deals with the most frequent lemmas from 5,001 to 10,000. And so on.</ref> </del></div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>!# of lemmas  </div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>|-</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>|-</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>|align="center"|BRUNO-A1</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>|align="center"|BRUNO-A1</div></td></tr>
</table>Martinshttp://www.unlweb.net/wiki/index.php?title=BRUNO&diff=7142&oldid=prevMartins: /* Methodology */2013-09-24T15:38:52Z<p><span class="autocomment">Methodology</span></p>
<table class='diff diff-contentalign-left'>
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 15:38, 24 September 2013</td>
</tr><tr><td colspan="2" class="diff-lineno">Line 46:</td>
<td colspan="2" class="diff-lineno">Line 46:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries can be extracted from prestigious monolingual dictionaries or from a corpus considered to be representative of the standard written language<ref>This corpus can be either an existing reputable corpus or a new corpus compiled according to the criteria defined at [[NC]].</ref>.</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries can be extracted from prestigious monolingual dictionaries or from a corpus considered to be representative of the standard written language<ref>This corpus can be either an existing reputable corpus or a new corpus compiled according to the criteria defined at [[NC]].</ref>.</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries must be ordered according to the frequency of occurrence (the most frequent entries must come first)<ref>The frequency of use is not often informed by ordinary dictionaries but may be inferred from the several distributions of the same dictionary: basic, intermediate or advanced, for instance.</ref>.</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries must be ordered according to the frequency of occurrence (the most frequent entries must come first)<ref>The frequency of use is not often informed by ordinary dictionaries but may be inferred from the several distributions of the same dictionary: basic, intermediate or advanced, for instance.</ref>.</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="background: #ffa; color:black; font-size: smaller;"><div>#:*The list of entries must be lemmatized<ref>There should be as many lemmas as different '''morphological behavior''' (part-of-speech, gender, number, inflections, etc.). The word "book", in English, should correspond to two lemmas: "book" as a noun, and "book" as a verb. Note that the many different meanings of "book" as a noun do not lead to different lemmas, because all them have the same morphological behavior, i.e., are singular and make plural in -s. On the other hand, the noun "livre", in French, should correspond to two lemmas: "livre" as a noun masculine (= "book"), and "livre" as a noun feminine (= "pound"). This difference is not derived from the different meanings, but from the different morphological behavior: one is masculine and the other is feminine.</ref></div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>#:*The list of entries must be lemmatized<ref>There should be as many lemmas as different '''morphological behavior''' (part-of-speech, gender, number, inflections, etc.). The word "book", in English, should correspond to two lemmas: "book" as a noun, and "book" as a verb. Note that the many different meanings of "book" as a noun do not lead to different lemmas, because all <ins class="diffchange diffchange-inline">of </ins>them have the same morphological behavior, i.e., are singular and make plural in -s. On the other hand, the noun "livre", in French, should correspond to two lemmas: "livre" as a noun masculine (= "book"), and "livre" as a noun feminine (= "pound"). This difference is not derived from the different meanings, but from the different morphological behavior: one is masculine and the other is feminine.</ref></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*Entries must be provided in a plain text file (.txt) with UTF-8 encoding, with one entry per line, along with the corresponding value of the lexical category [[LEX]], in the following format:</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*Entries must be provided in a plain text file (.txt) with UTF-8 encoding, with one entry per line, along with the corresponding value of the lexical category [[LEX]], in the following format:</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#::lemma:LEX<ref>See an example at [http://www.unlweb.net/resources/bruno/hu_a1.txt]</ref></div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#::lemma:LEX<ref>See an example at [http://www.unlweb.net/resources/bruno/hu_a1.txt]</ref></div></td></tr>
</table>Martinshttp://www.unlweb.net/wiki/index.php?title=BRUNO&diff=7141&oldid=prevMartins: /* Methodology */2013-09-24T15:38:03Z<p><span class="autocomment">Methodology</span></p>
<table class='diff diff-contentalign-left'>
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 15:38, 24 September 2013</td>
</tr><tr><td colspan="2" class="diff-lineno">Line 46:</td>
<td colspan="2" class="diff-lineno">Line 46:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries can be extracted from prestigious monolingual dictionaries or from a corpus considered to be representative of the standard written language<ref>This corpus can be either an existing reputable corpus or a new corpus compiled according to the criteria defined at [[NC]].</ref>.</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries can be extracted from prestigious monolingual dictionaries or from a corpus considered to be representative of the standard written language<ref>This corpus can be either an existing reputable corpus or a new corpus compiled according to the criteria defined at [[NC]].</ref>.</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries must be ordered according to the frequency of occurrence (the most frequent entries must come first)<ref>The frequency of use is not often informed by ordinary dictionaries but may be inferred from the several distributions of the same dictionary: basic, intermediate or advanced, for instance.</ref>.</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries must be ordered according to the frequency of occurrence (the most frequent entries must come first)<ref>The frequency of use is not often informed by ordinary dictionaries but may be inferred from the several distributions of the same dictionary: basic, intermediate or advanced, for instance.</ref>.</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="background: #ffa; color:black; font-size: smaller;"><div>#:*The list of entries must be lemmatized<ref>There should be as many lemmas as different '''morphological behavior''' (part-of-speech, gender, number, inflections, etc.). The word "book", in English, should correspond to two lemmas: "book" as a noun, and "book" as a verb. <del class="diffchange diffchange-inline">The </del>noun "livre", in French, should correspond to two lemmas: "livre" as a noun masculine (="book"), and "livre" as a noun feminine (="pound"). <del class="diffchange diffchange-inline">The verb "haver"</del>, <del class="diffchange diffchange-inline">in Portuguese, should correspond to two lemmas</del>: <del class="diffchange diffchange-inline">"haver" (auxiliary verb inflected in all verb forms) </del>and <del class="diffchange diffchange-inline">"haver" (main verb inflected only in </del>the <del class="diffchange diffchange-inline">3rd person, i.e., defective)</del>.</ref></div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>#:*The list of entries must be lemmatized<ref>There should be as many lemmas as different '''morphological behavior''' (part-of-speech, gender, number, inflections, etc.). The word "book", in English, should correspond to two lemmas: "book" as a noun, and "book" as a verb. <ins class="diffchange diffchange-inline">Note that the many different meanings of "book" as a noun do not lead to different lemmas, because all them have the same morphological behavior, i.e., are singular and make plural in -s. On the other hand, the </ins>noun "livre", in French, should correspond to two lemmas: "livre" as a noun masculine (= "book"), and "livre" as a noun feminine (= "pound"). <ins class="diffchange diffchange-inline">This difference is not derived from the different meanings</ins>, <ins class="diffchange diffchange-inline">but from the different morphological behavior</ins>: <ins class="diffchange diffchange-inline">one is masculine </ins>and the <ins class="diffchange diffchange-inline">other is feminine</ins>.</ref></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*Entries must be provided in a plain text file (.txt) with UTF-8 encoding, with one entry per line, along with the corresponding value of the lexical category [[LEX]], in the following format:</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*Entries must be provided in a plain text file (.txt) with UTF-8 encoding, with one entry per line, along with the corresponding value of the lexical category [[LEX]], in the following format:</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#::lemma:LEX<ref>See an example at [http://www.unlweb.net/resources/bruno/hu_a1.txt]</ref></div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#::lemma:LEX<ref>See an example at [http://www.unlweb.net/resources/bruno/hu_a1.txt]</ref></div></td></tr>
</table>Martinshttp://www.unlweb.net/wiki/index.php?title=BRUNO&diff=7140&oldid=prevMartins: /* Methodology */2013-09-24T15:33:50Z<p><span class="autocomment">Methodology</span></p>
<table class='diff diff-contentalign-left'>
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 15:33, 24 September 2013</td>
</tr><tr><td colspan="2" class="diff-lineno">Line 48:</td>
<td colspan="2" class="diff-lineno">Line 48:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries must be lemmatized<ref>There should be as many lemmas as different '''morphological behavior''' (part-of-speech, gender, number, inflections, etc.). The word "book", in English, should correspond to two lemmas: "book" as a noun, and "book" as a verb. The noun "livre", in French, should correspond to two lemmas: "livre" as a noun masculine (="book"), and "livre" as a noun feminine (="pound"). The verb "haver", in Portuguese, should correspond to two lemmas: "haver" (auxiliary verb inflected in all verb forms) and "haver" (main verb inflected only in the 3rd person, i.e., defective).</ref></div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries must be lemmatized<ref>There should be as many lemmas as different '''morphological behavior''' (part-of-speech, gender, number, inflections, etc.). The word "book", in English, should correspond to two lemmas: "book" as a noun, and "book" as a verb. The noun "livre", in French, should correspond to two lemmas: "livre" as a noun masculine (="book"), and "livre" as a noun feminine (="pound"). The verb "haver", in Portuguese, should correspond to two lemmas: "haver" (auxiliary verb inflected in all verb forms) and "haver" (main verb inflected only in the 3rd person, i.e., defective).</ref></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*Entries must be provided in a plain text file (.txt) with UTF-8 encoding, with one entry per line, along with the corresponding value of the lexical category [[LEX]], in the following format:</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*Entries must be provided in a plain text file (.txt) with UTF-8 encoding, with one entry per line, along with the corresponding value of the lexical category [[LEX]], in the following format:</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="background: #ffa; color:black; font-size: smaller;"><div>#::lemma:LEX<ref>See an example at [http://www.unlweb.net/resources/bruno/<del class="diffchange diffchange-inline">bg_a1</del>.txt]</ref></div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>#::lemma:LEX<ref>See an example at [http://www.unlweb.net/resources/bruno/<ins class="diffchange diffchange-inline">hu_a1</ins>.txt]</ref></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#Verification</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#Verification</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:The list of entries is verified by a language manager or, in case there is no language manager for the target language, by the Language Resources Manager of the UNDL Foundation. If approved, it is uploaded to the UNLarium, and the corresponding BRUNO project is open.</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:The list of entries is verified by a language manager or, in case there is no language manager for the target language, by the Language Resources Manager of the UNDL Foundation. If approved, it is uploaded to the UNLarium, and the corresponding BRUNO project is open.</div></td></tr>
</table>Martinshttp://www.unlweb.net/wiki/index.php?title=BRUNO&diff=7139&oldid=prevMartins: /* Methodology */2013-09-24T15:29:35Z<p><span class="autocomment">Methodology</span></p>
<table class='diff diff-contentalign-left'>
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 15:29, 24 September 2013</td>
</tr><tr><td colspan="2" class="diff-lineno">Line 48:</td>
<td colspan="2" class="diff-lineno">Line 48:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries must be lemmatized<ref>There should be as many lemmas as different '''morphological behavior''' (part-of-speech, gender, number, inflections, etc.). The word "book", in English, should correspond to two lemmas: "book" as a noun, and "book" as a verb. The noun "livre", in French, should correspond to two lemmas: "livre" as a noun masculine (="book"), and "livre" as a noun feminine (="pound"). The verb "haver", in Portuguese, should correspond to two lemmas: "haver" (auxiliary verb inflected in all verb forms) and "haver" (main verb inflected only in the 3rd person, i.e., defective).</ref></div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries must be lemmatized<ref>There should be as many lemmas as different '''morphological behavior''' (part-of-speech, gender, number, inflections, etc.). The word "book", in English, should correspond to two lemmas: "book" as a noun, and "book" as a verb. The noun "livre", in French, should correspond to two lemmas: "livre" as a noun masculine (="book"), and "livre" as a noun feminine (="pound"). The verb "haver", in Portuguese, should correspond to two lemmas: "haver" (auxiliary verb inflected in all verb forms) and "haver" (main verb inflected only in the 3rd person, i.e., defective).</ref></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*Entries must be provided in a plain text file (.txt) with UTF-8 encoding, with one entry per line, along with the corresponding value of the lexical category [[LEX]], in the following format:</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*Entries must be provided in a plain text file (.txt) with UTF-8 encoding, with one entry per line, along with the corresponding value of the lexical category [[LEX]], in the following format:</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="background: #ffa; color:black; font-size: smaller;"><div>#::lemma:LEX<ref>See an example at <del class="diffchange diffchange-inline">[</del>[http://www.unlweb.net/resources/bruno/bg_a1.txt]</ref></div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>#::lemma:LEX<ref>See an example at [http://www.unlweb.net/resources/bruno/bg_a1.txt]</ref></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#Verification</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#Verification</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:The list of entries is verified by a language manager or, in case there is no language manager for the target language, by the Language Resources Manager of the UNDL Foundation. If approved, it is uploaded to the UNLarium, and the corresponding BRUNO project is open.</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:The list of entries is verified by a language manager or, in case there is no language manager for the target language, by the Language Resources Manager of the UNDL Foundation. If approved, it is uploaded to the UNLarium, and the corresponding BRUNO project is open.</div></td></tr>
</table>Martinshttp://www.unlweb.net/wiki/index.php?title=BRUNO&diff=7138&oldid=prevMartins: /* Methodology */2013-09-24T15:29:08Z<p><span class="autocomment">Methodology</span></p>
<table class='diff diff-contentalign-left'>
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 15:29, 24 September 2013</td>
</tr><tr><td colspan="2" class="diff-lineno">Line 48:</td>
<td colspan="2" class="diff-lineno">Line 48:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries must be lemmatized<ref>There should be as many lemmas as different '''morphological behavior''' (part-of-speech, gender, number, inflections, etc.). The word "book", in English, should correspond to two lemmas: "book" as a noun, and "book" as a verb. The noun "livre", in French, should correspond to two lemmas: "livre" as a noun masculine (="book"), and "livre" as a noun feminine (="pound"). The verb "haver", in Portuguese, should correspond to two lemmas: "haver" (auxiliary verb inflected in all verb forms) and "haver" (main verb inflected only in the 3rd person, i.e., defective).</ref></div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries must be lemmatized<ref>There should be as many lemmas as different '''morphological behavior''' (part-of-speech, gender, number, inflections, etc.). The word "book", in English, should correspond to two lemmas: "book" as a noun, and "book" as a verb. The noun "livre", in French, should correspond to two lemmas: "livre" as a noun masculine (="book"), and "livre" as a noun feminine (="pound"). The verb "haver", in Portuguese, should correspond to two lemmas: "haver" (auxiliary verb inflected in all verb forms) and "haver" (main verb inflected only in the 3rd person, i.e., defective).</ref></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*Entries must be provided in a plain text file (.txt) with UTF-8 encoding, with one entry per line, along with the corresponding value of the lexical category [[LEX]], in the following format:</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*Entries must be provided in a plain text file (.txt) with UTF-8 encoding, with one entry per line, along with the corresponding value of the lexical category [[LEX]], in the following format:</div></td></tr>
<tr><td class='diff-marker'>−</td><td style="background: #ffa; color:black; font-size: smaller;"><div>#::lemma:LEX</div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>#::lemma:LEX<ins class="diffchange diffchange-inline"><ref>See </ins>an example at [[http://www.unlweb.net/resources/bruno/bg_a1.txt]<ins class="diffchange diffchange-inline"></ref></ins></div></td></tr>
<tr><td class='diff-marker'>−</td><td style="background: #ffa; color:black; font-size: smaller;"><div><del class="diffchange diffchange-inline">#:(see </del>an example at [[http://www.unlweb.net/resources/bruno/bg_a1.txt]<del class="diffchange diffchange-inline">])</del></div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#Verification</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#Verification</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:The list of entries is verified by a language manager or, in case there is no language manager for the target language, by the Language Resources Manager of the UNDL Foundation. If approved, it is uploaded to the UNLarium, and the corresponding BRUNO project is open.</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:The list of entries is verified by a language manager or, in case there is no language manager for the target language, by the Language Resources Manager of the UNDL Foundation. If approved, it is uploaded to the UNLarium, and the corresponding BRUNO project is open.</div></td></tr>
</table>Martinshttp://www.unlweb.net/wiki/index.php?title=BRUNO&diff=7137&oldid=prevMartins: /* Methodology */2013-09-24T15:28:38Z<p><span class="autocomment">Methodology</span></p>
<table class='diff diff-contentalign-left'>
<col class='diff-marker' />
<col class='diff-content' />
<col class='diff-marker' />
<col class='diff-content' />
<tr valign='top'>
<td colspan='2' style="background-color: white; color:black;">← Older revision</td>
<td colspan='2' style="background-color: white; color:black;">Revision as of 15:28, 24 September 2013</td>
</tr><tr><td colspan="2" class="diff-lineno">Line 47:</td>
<td colspan="2" class="diff-lineno">Line 47:</td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries must be ordered according to the frequency of occurrence (the most frequent entries must come first)<ref>The frequency of use is not often informed by ordinary dictionaries but may be inferred from the several distributions of the same dictionary: basic, intermediate or advanced, for instance.</ref>.</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries must be ordered according to the frequency of occurrence (the most frequent entries must come first)<ref>The frequency of use is not often informed by ordinary dictionaries but may be inferred from the several distributions of the same dictionary: basic, intermediate or advanced, for instance.</ref>.</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries must be lemmatized<ref>There should be as many lemmas as different '''morphological behavior''' (part-of-speech, gender, number, inflections, etc.). The word "book", in English, should correspond to two lemmas: "book" as a noun, and "book" as a verb. The noun "livre", in French, should correspond to two lemmas: "livre" as a noun masculine (="book"), and "livre" as a noun feminine (="pound"). The verb "haver", in Portuguese, should correspond to two lemmas: "haver" (auxiliary verb inflected in all verb forms) and "haver" (main verb inflected only in the 3rd person, i.e., defective).</ref></div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:*The list of entries must be lemmatized<ref>There should be as many lemmas as different '''morphological behavior''' (part-of-speech, gender, number, inflections, etc.). The word "book", in English, should correspond to two lemmas: "book" as a noun, and "book" as a verb. The noun "livre", in French, should correspond to two lemmas: "livre" as a noun masculine (="book"), and "livre" as a noun feminine (="pound"). The verb "haver", in Portuguese, should correspond to two lemmas: "haver" (auxiliary verb inflected in all verb forms) and "haver" (main verb inflected only in the 3rd person, i.e., defective).</ref></div></td></tr>
<tr><td class='diff-marker'>−</td><td style="background: #ffa; color:black; font-size: smaller;"><div>#:*Entries must be provided in a plain text file (.txt) with UTF-8 encoding, with one entry per line, along with the corresponding value of the lexical category [[LEX]], <del class="diffchange diffchange-inline">as follows</del>:</div></td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div>#:*Entries must be provided in a plain text file (.txt) with UTF-8 encoding, with one entry per line, along with the corresponding value of the lexical category [[LEX]], <ins class="diffchange diffchange-inline">in the following format</ins>:</div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins class="diffchange diffchange-inline">#::lemma:LEX</ins></div></td></tr>
<tr><td colspan="2"> </td><td class='diff-marker'>+</td><td style="background: #cfc; color:black; font-size: smaller;"><div><ins class="diffchange diffchange-inline">#:(see an example at [[http://www.unlweb.net/resources/bruno/bg_a1.txt]])</ins></div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#Verification</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#Verification</div></td></tr>
<tr><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:The list of entries is verified by a language manager or, in case there is no language manager for the target language, by the Language Resources Manager of the UNDL Foundation. If approved, it is uploaded to the UNLarium, and the corresponding BRUNO project is open.</div></td><td class='diff-marker'> </td><td style="background: #eee; color:black; font-size: smaller;"><div>#:The list of entries is verified by a language manager or, in case there is no language manager for the target language, by the Language Resources Manager of the UNDL Foundation. If approved, it is uploaded to the UNLarium, and the corresponding BRUNO project is open.</div></td></tr>
</table>Martins