FRIDA

From UNL Wiki
(Difference between revisions)
Jump to: navigation, search
(Repository)
Line 7: Line 7:
  
 
== Repository ==
 
== Repository ==
The whole FRIDA contains, for each language, 30,000 lemmas, and is divided into 6 different repositories according to the frequency of use of the lemmas.  
+
The whole FRIDA contains, for each language, 30,000 lemmas, and is divided into 6 different repositories according to the frequency of use of lemmas.  
 
*FRIDA-A1 contains the list of the 5,000 most frequent lemmas of the language (including articles, prepositions, conjunctions, auxiliary verbs, etc.);
 
*FRIDA-A1 contains the list of the 5,000 most frequent lemmas of the language (including articles, prepositions, conjunctions, auxiliary verbs, etc.);
 
*FRIDA-A2 contains the list of the following 5,000 most frequent lemmas of the language (including articles, prepositions, conjunctions, auxiliary verbs, etc.);
 
*FRIDA-A2 contains the list of the following 5,000 most frequent lemmas of the language (including articles, prepositions, conjunctions, auxiliary verbs, etc.);

Revision as of 15:53, 5 February 2014

The project FRIDA (Français, Rumantsch, Italiano and Deutsch for Analysis) is devoted to the creation of NL-UNL (analysis) dictionaries for the official languages of Switzerland.

Goal

The project FRIDA has two main goals:

  1. To provide several word-to-concept monolingual databases (i.e., encoding or reader's dictionaries). These dictionaries are expected to be used in UNLization, i.e., in generating UNL graphs out of natural language documents, especially through IAN.
  2. To find concepts that are not enclosed in the WordNet3.0 and should be incorporated to the UNL Dictionary.

Repository

The whole FRIDA contains, for each language, 30,000 lemmas, and is divided into 6 different repositories according to the frequency of use of lemmas.

  • FRIDA-A1 contains the list of the 5,000 most frequent lemmas of the language (including articles, prepositions, conjunctions, auxiliary verbs, etc.);
  • FRIDA-A2 contains the list of the following 5,000 most frequent lemmas of the language (including articles, prepositions, conjunctions, auxiliary verbs, etc.);

And so on up to FRIDA-C2. The entries have been extracted mostly from the nominata of acknowledged monolingual dictionaries.

Repository # of lemmas
FRIDA-A1 5,000
FRIDA-A2 5,000
FRIDA-B1 5,000
FRIDA-B2 5,000
FRIDA-C1 5,000
FRIDA-C2 5,000
Software