UNL Dictionary

From UNL Wiki
(Difference between revisions)
Jump to: navigation, search
(<)
Line 9: Line 9:
 
== Structure ==
 
== Structure ==
  
<`UWID` int(11) NOT NULL AUTO_INCREMENT COMMENT 'UWID',
+
<pre>
                  `UW` varchar(255) COLLATE utf8_unicode_ci NOT NULL COMMENT 'Universal Word',
+
<dic_unl>
                  `HEADWORD` varchar(255) COLLATE utf8_unicode_ci DEFAULT NULL COMMENT 'Headword',
+
        <UWID>unique index (integer auto-increment)</UWID>
                  `SYNSET` varchar(255) COLLATE utf8_unicode_ci NOT NULL,
+
        <UW>lemma</UW>
                  `ROOT` varchar(255) COLLATE utf8_unicode_ci NOT NULL,
+
        <HEADWORD>UW in English</UW>
                  `SEMSTR` varchar(255) COLLATE utf8_unicode_ci NOT NULL,
+
        <SYNSET>synset in English</SYNSET>
                  `HYPERNYM` varchar(255) COLLATE utf8_unicode_ci NOT NULL,
+
        <ROOT>root of the UW</ROOT>
                  `DEFINITION` text COLLATE utf8_unicode_ci COMMENT 'Definition',
+
        <SEMSTR>semantic string</SEMSTR>
                  `EXAMPLE` text COLLATE utf8_unicode_ci COMMENT 'Example',
+
        <HYPERNYM>hypernym</HYPERNYM>
                  `LEX` char(1) COLLATE utf8_unicode_ci NOT NULL COMMENT 'Lexical Category',
+
        <DEFINITION>definition of the UW</DEFINITION>
                  `ABN` varchar(4) COLLATE utf8_unicode_ci DEFAULT NULL COMMENT 'Abstractness',
+
        <EXAMPLE>example of the UW</EXAMPLE>
                  `ALY` varchar(4) COLLATE utf8_unicode_ci DEFAULT NULL COMMENT 'Alienability',
+
        <LEX>lexical category</LEX>
                  `ANI` varchar(4) COLLATE utf8_unicode_ci DEFAULT NULL COMMENT 'Animacy',
+
        <ABN>abstractness</ABN>
                  `CAR` varchar(4) COLLATE utf8_unicode_ci DEFAULT NULL COMMENT 'Cardinality',
+
        <ALY>alienability</ALY>
                  `GEN` char(3) COLLATE utf8_unicode_ci DEFAULT NULL COMMENT 'Gender',
+
        <ANI>animacy</ANI>
                  `POL` char(3) COLLATE utf8_unicode_ci DEFAULT NULL COMMENT 'Polarity',
+
        <CAR>cardinality</CAR>
                  `SEM` char(3) COLLATE utf8_unicode_ci DEFAULT NULL COMMENT 'Semantic Category',
+
        <GEN>gender</GEN>
                  `SFR` varchar(255) COLLATE utf8_unicode_ci DEFAULT NULL COMMENT 'Semantic Frame',
+
        <POL>polarity</POL>
                  `SLANGUAGE` char(2) COLLATE utf8_unicode_ci DEFAULT NULL COMMENT 'Source Language',
+
        <SEM>semantic class</SEM>
                  `PROJECT` varchar(255) COLLATE utf8_unicode_ci DEFAULT NULL COMMENT 'Project',
+
        <SFR>semantic Frame</SFR>
                  `AUTHOR` varchar(255) COLLATE utf8_unicode_ci DEFAULT NULL COMMENT 'Author',
+
        <SLANGUAGE>source language</SLANGUAGE>
                  `EDITOR` varchar(255) COLLATE utf8_unicode_ci DEFAULT NULL COMMENT 'Editor',
+
        <PROJECT>project</PROJECT>
                  `REVISOR` varchar(255) COLLATE utf8_unicode_ci DEFAULT NULL COMMENT 'Revisor',
+
        <AUTHOR>creator of the entry</AUTHOR>
                  `CREATED` date DEFAULT NULL COMMENT 'Created',
+
        <EDITOR>editor of the entry</EDITOR>
                  `EDITED` date DEFAULT NULL COMMENT 'Edited',
+
        <REVISOR>revisor of the entry</REVISOR>
                  `REVISED` date DEFAULT NULL COMMENT 'Revised',
+
        <CREATED>date of creation</CREATED>
                  `STATUS` int(1) DEFAULT NULL COMMENT 'Status',
+
        <EDITED>date of edition</EDITED>
                  `PROBLEM` text COLLATE utf8_unicode_ci COMMENT 'Problem',
+
        <REVISED>date of revision</REVISED>
                  `BUG_REPORTER` text COLLATE utf8_unicode_ci COMMENT 'Bug Reporter',
+
        <DATE>last update</DATE>
                  `IMAGE` text COLLATE utf8_unicode_ci COMMENT 'Image',
+
        <STATUS>current status</STATUS>
                  `COMMENTS` text COLLATE utf8_unicode_ci COMMENT 'Comments',
+
        <ASSIGNMENT>number of the assignment</ASSIGNMENT>
                  `DECLINED` int(11) DEFAULT NULL,
+
        <PROBLEM>problem, if any</PROBLEM>
                  `REFUSED` int(11) NOT NULL DEFAULT '0',
+
        <BUG_REPORTER>user who reported the problem</BUG_REPORTER>
                  `FREQUENCY` int(11) DEFAULT NULL COMMENT 'Frequency',
+
        <DECLINED>number of times the entry was declined</DECLINED>
                  `ASSIGNMENT` int(11) DEFAULT NULL,
+
        <REFUSED>number of times the entry was refused</REFUSED>
                  `CORE` int(1) DEFAULT NULL,
+
        <FREQUENCY>frequency of the entry</FREQUENCY>
                  `UNIVERSALITY` tinyint(1) NOT NULL,
+
        <CORE>position in the core dictionary</CORE>
                  `DATE` timestamp NULL DEFAULT CURRENT_TIMESTAMP ON UPDATE CURRENT_TIMESTAMP COMMENT 'Updated',
+
        <UNIVERSALITY>degree of universality</UNIVERSALITY>
                  PRIMARY KEY (`UWID`),
+
        <COMMENTS>comments</COMMENTS>
 +
</dic_unl>
 +
</pre>

Revision as of 20:15, 27 June 2013

The UNL Dictionary, or UD, is the inventory of the permanent Universal Words. It is a flat list of UWs in alphabetical order with their corresponding semantic features, but without any further organization or structure, which is expected to be provided by the other UNL lexical databases (the UNL Knowledge Base and the UNL Memory, namely).

Subdivisions

The UNL Dictionary is divided into three nested repositories:

  • The UNL Core Dictionary contains only permanent simple UWs that are (presumably) shared by all languages
  • The UNL Abridged Dictionary contains all permanent UWs (simple, compound or complex) that are shared by at least two different language families
  • The UNL Unabridged Dictionary contains all permanent UWs (simple, compound or complex) that are lexicalized in at least one language

Structure

<dic_unl>
        <UWID>unique index (integer auto-increment)</UWID>
        <UW>lemma</UW>
        <HEADWORD>UW in English</UW>
        <SYNSET>synset in English</SYNSET>
        <ROOT>root of the UW</ROOT>
        <SEMSTR>semantic string</SEMSTR>
        <HYPERNYM>hypernym</HYPERNYM>
        <DEFINITION>definition of the UW</DEFINITION>
        <EXAMPLE>example of the UW</EXAMPLE>
        <LEX>lexical category</LEX>
        <ABN>abstractness</ABN>
        <ALY>alienability</ALY>
        <ANI>animacy</ANI>
        <CAR>cardinality</CAR>
        <GEN>gender</GEN>
        <POL>polarity</POL>
        <SEM>semantic class</SEM>
        <SFR>semantic Frame</SFR>
        <SLANGUAGE>source language</SLANGUAGE>
        <PROJECT>project</PROJECT>
        <AUTHOR>creator of the entry</AUTHOR>
        <EDITOR>editor of the entry</EDITOR>
        <REVISOR>revisor of the entry</REVISOR>
        <CREATED>date of creation</CREATED>
        <EDITED>date of edition</EDITED>
        <REVISED>date of revision</REVISED>
        <DATE>last update</DATE>
        <STATUS>current status</STATUS>
        <ASSIGNMENT>number of the assignment</ASSIGNMENT>
        <PROBLEM>problem, if any</PROBLEM>
        <BUG_REPORTER>user who reported the problem</BUG_REPORTER>
        <DECLINED>number of times the entry was declined</DECLINED>
        <REFUSED>number of times the entry was refused</REFUSED>
        <FREQUENCY>frequency of the entry</FREQUENCY>
        <CORE>position in the core dictionary</CORE>
        <UNIVERSALITY>degree of universality</UNIVERSALITY>
        <COMMENTS>comments</COMMENTS>
</dic_unl>
Software