Projects

From UNL Wiki
Revision as of 18:48, 7 August 2013 by Martins (Talk | contribs)
Jump to: navigation, search

Contents

Types of Projects

The UNLweb hosts three different types of projects:

  • Dictionary projects aims at proving entries to the UNL dictionaries
  • Corpus projects aims at providing corpora for assessing UNL grammars
  • Memory projects aims at providing further lexical resources for UNL-based systems

Dictionary Projects

There are four types of dictionary projects:

  • UNL->NL (Generation) Dictionary (or simply GD) projects aims at mapping UWs into natural language lexical items
  • NL->UNL (Analysis) Dictionary (or simply AD) projects aims at mapping natural language lexical items into UWs
  • NL Dictionary (or simply ND) projects aims at treating entries resulting from GD Dictionary projects
  • UNL Dictionary (or simply UD) projects aims at analyzing, defining and exemplifying UWs

Corpus Projects

There are two types of corpus projects:

  • UNL->NL (Generation) Corpus projects aims at NL-izing a UNL document
  • NL->UNL (Analysis) Corpus projects aims at UNL-izing a natural language document

Memory Projects

There are five types of memory projects:

  • Knowledge Base projects aims at providing entries for the UNL Knowledge Base
  • UNL Memory projects aims at providing entries for the UNL Memory
  • NL Memory projects aims at providing entries for the NL Memory
  • NL->UNL (Analysis) Memory projects aims at mapping translation units into UNL
  • UNL->NL (Generation) Memory projects aims at UNL segments into natural language expressions

Requisites

List of projects

Title Type Description
BRUNO AD The project BRUNO (Basic Resources for UNlizatiOn) aims at providing NL->UNL (analysis) dictionaries based in the frequency of occurrence of lemmas in the source language.
CRATYLUS UC The project Cratylus aims at UNLizing the integral text of Cratylus (360 BC), written by the Greek philosopher Plato (427? BC-347? BC). Cratylus is one of the most well-known Platonic dialogues, and an outstanding cornerstone in the history of language studies. The text was used mainly to provide some standards for UNLization.
EOLSS UC and GD The project EOLSS aims at multilingualizing, via UNL, the content of 30 articles of the Encyclopedia of Water, one of the many encyclopedias of the Encyclopedia of Life Support Systems (EOLSS), an integrated compendium of several encyclopaedias, which attempts to forge pathways between disciplines and to foster the transdisciplinary relations between subjects especially related to the life supporting systems.
IGLU UC The project IGLU (from GLosses to Unl) intends to map WordNet glosses from English into UNL.
LE PETIT PRINCE UC and GD The project Le Petit Prince (or LPP) aims at UNLizing the integral text of Le Petit Prince, a French novel published by Antoine de Saint-Exupéry in 1943. The main goal is to set standards and guidelines for human UNLization, and to test several tools that have been developed at the UNDL Foundation.
LEWIS & SHORT AD The project Lewis & Short aims at mapping lemmas extracted from the Lewis & Short Latin Dictionary (1879) into UNL. The project is coordinated by the UNL Center at the University of Patras, in Greece, under the supervision of Dr. Olga Vartzioti.
LIS The Library Information System (LIS) is an information retrieval system that aims at performing multilingual search over bibliographical metadata. The main goal of the project is to UNLize a small set of MARC21 records and to provide the resources necessary to generate it into at least five different languages other than Arabic. The project has been developed by the UNL Center at the Library of Alexandria.
MIR GD The project MIR (Multilingual InfrastRucture) aims at creating UNL->NL (generation) dictionaries based in the WordNet3.0.

Other projects

LACE

The main goal of the project LACE (Lexical Acquisition from Comparable tExts) is to build language modules out of data automatically extracted from comparable corpora. The results are expected to be incorporated in the architecture of UNL-based systems as supplementary resources for natural language disambiguation, both in analysis and generation, and will be used for improving the performance of applications in machine translation, summarization, information retrieval and semantic reasoning. The project has been developed under the CADMOS consortium (University of Geneva, University of Lausanne and École Politechnique Fédérale de Lausanne), and is supported by the Wilsdorf Foundation.

Software