Projects

From UNL Wiki
(Difference between revisions)
Jump to: navigation, search
(BRUNO)
(LACE)
Line 12: Line 12:
  
 
== [[LACE]] ==  
 
== [[LACE]] ==  
The main goal of the project LACE is to build language modules out of data automatically extracted from comparable corpora. The results are expected to be incorporated in the architecture of UNL-based systems as supplementary resources for natural language disambiguation, both in analysis and generation, and will be used for improving the performance of applications in machine translation, summarization, information retrieval and semantic reasoning. The project has been developed under the CADMOS consortium (University of Geneva, University of Lausanne and École Politechnique Fédérale de Lausanne), and is supported by the Wilsdorf Foundation.
+
The main goal of the project LACE (Lexical Acquisition from Comparable tExts) is to build language modules out of data automatically extracted from comparable corpora. The results are expected to be incorporated in the architecture of UNL-based systems as supplementary resources for natural language disambiguation, both in analysis and generation, and will be used for improving the performance of applications in machine translation, summarization, information retrieval and semantic reasoning. The project has been developed under the CADMOS consortium (University of Geneva, University of Lausanne and École Politechnique Fédérale de Lausanne), and is supported by the Wilsdorf Foundation.
  
 
== [[LPP|LE PETIT PRINCE]] ==
 
== [[LPP|LE PETIT PRINCE]] ==

Revision as of 10:07, 7 August 2013

Contents

BRUNO

The project BRUNO (Basic Resources for UNlizatiOn) aims at providing NL->UNL (analysis) dictionaries based in the frequency of occurrence of lemmas in the source language.

CRATYLUS

The project Cratylus aims at UNLizing the integral text of Cratylus (360 BC), written by the Greek philosopher Plato (427? BC-347? BC). Cratylus is one of the most well-known Platonic dialogues, and an outstanding cornerstone in the history of language studies. The text was used mainly to provide some standards for UNLization.

EOLSS

The project EOLSS aims at multilingualizing, via UNL, the content of 30 articles of the Encyclopedia of Water, one of the many encyclopedias of the Encyclopedia of Life Support Systems (EOLSS), an integrated compendium of several encyclopaedias, which attempts to forge pathways between disciplines and to foster the transdisciplinary relations between subjects especially related to the life supporting systems.

IGLU

The project IGLU intends to map WordNet glosses from English into UNL. The project is divided into two main phases: the first one (iGLU#1) addresses a subset of 27,255 synsets and is supposed to be carried out in a predominantly human basis; the second one (iGLU#2) focuses on the remaining 90,404 synsets and it is expected to be mainly automatic. In iGLU#1, linguists are supposed to UNL-ize WordNet definitions through the UNL Editor, a graph-based UNL authoring tool available at the UNLdev. Decisions are stored in a UNL-ization memory, which comprises mappings between lexical items of English and Universal Words. Information on attributes and relations are also encoded. These data will be used in the second phase, when the UNL-ization process is expected to be performed by IAN - the UNDL Foundation Interactive ANalyzer -, under development. IAN requires much less human intervention than the UNL Editor, and it is a first step towards a fully-automatic natural language analysis system. Results of the project iGLU are expected to be used not only in compiling the UNL-ization memory, but also in populating the UNL Knowledge Base, which is an essential part of the architecture of the UNL system. It will improve the quality of word sense disambiguation and enhance the capability of information retrieval and extraction through UNL.

LACE

The main goal of the project LACE (Lexical Acquisition from Comparable tExts) is to build language modules out of data automatically extracted from comparable corpora. The results are expected to be incorporated in the architecture of UNL-based systems as supplementary resources for natural language disambiguation, both in analysis and generation, and will be used for improving the performance of applications in machine translation, summarization, information retrieval and semantic reasoning. The project has been developed under the CADMOS consortium (University of Geneva, University of Lausanne and École Politechnique Fédérale de Lausanne), and is supported by the Wilsdorf Foundation.

LE PETIT PRINCE

The project Le Petit Prince (or LPP) aims at UNLizing the integral text of Le Petit Prince, a French novel published by Antoine de Saint-Exupéry in 1943. The main goal is to set standards and guidelines for human UNLization, and to test several tools that have been developed at the UNDL Foundation. The resulting UNL document is also planned to be used in the evaluation of UNL-based translations, and as a training material for VALERIE, the Virtual Learning Environment for UNL.

LEWIS & SHORT

The project Lewis & Short aims at mapping lemmas extracted from the Lewis & Short Latin Dictionary (1879) into UNL. The project is coordinated by the UNL Center at the University of Patras, in Greece, under the supervision of Dr. Olga Vartzioti.

LIS

The Library Information System (LIS) is an information retrieval system that aims at performing multilingual search over bibliographical metadata. The main goal of the project is to UNLize a small set of MARC21 records and to provide the resources necessary to generate it into at least five different languages other than Arabic. The project has been developed by the UNL Center at the Library of Alexandria.

MIR

The project MIR (Multilingual InfrastRucture) aims at creating UNL->NL (generation) dictionaries based in the WordNet3.0.

Software