XI UNL School

From UNL Wiki

Revision as of 20:19, 25 February 2013 by Martins (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

The XI UNL School will be dedicated to the development and improvement of the grammatical resources for the languages below. It will take place at the University of Macau, in Macau, from March 11th to 15th, 2013.

Call for Participation

The CFP is available at [1].

Important dates

~~03/Feb/2013: Deadline for the applications~~
~~10/Feb/2013: Notification of accepted candidates~~
11-15/Mar/2013: XI UNL School

Languages

The workshop is dedicated to the development of the grammatical resources for the processing of the languages below:

Bahasa Indonesia
Bengali
Burmese
Cantonese
Chinese
Japanese
Khmer
Korean
Laotian
Malay
Mongolian
Nepalese
Portuguese (Macau)
Sinhalese
Tagalog
Tetum
Thai
Vietnamese

Participants

Angela Manukian
Bal Krishna Bal
Balaram Prasain
Chittaphone Chanylilath
Chunlei Yang
Jeanette Tan
Kim Sokphyrum
Lilit Paremuzyan
Minh Tien Nguyen
Muhammad Zulhelmy bin Mohd Rosman
Siyou Liu
Somdev Kar
Suos Samak
Teng Wei Min
Vinh Dang
Viraj Karunananda
Viraj Welgama

Pre-workshop tasks (to be sent before March 4th)

Corpus
1. Translate (manually) the 100 sentences of the corpus UC-A1 from English into your native language. Be as close as possible to the original, and provide one single translation for each sentence. This will be your input document file, and your goal will be to provide (automatically, through IAN) the UNL graphs for each sentence
2. Save the translated text (without the English original) in a plain text (.txt) file with UTF-8 encoding and name it corpus_???.txt where ??? must be replaced by the corresponding ISO 639-2 language code.
Analysis Dictionary
1. Localize the dictionary available at eng_unl_dic.txt. Note that "localization" is not the same as "translation". You may need other features (in English, for instance, nouns do not have gender or case) or other entries. In any case, the resulting dictionary should fit your translated version of the corpus (i.e., all the entries appearing in your translated version of the corpus should appear in the dictionary). For further information on localization, see Localization. For information on the dictionary structure, see Dictionary Specs. For an explanation of the structure of the English dictionary, see English Dictionary. In case you need additional features, use only the tags available at the tagset.
2. Save the NL-UNL dictionary in a plain text (.txt) file with UTF-8 encoding and name it ???_unl_dic.txt where ??? must be replaced by the corresponding ISO 639-2 language code.
Generation Dictionary
1. Localize the dictionary available at unl_eng_dic.txt. Follow the same procedures described above.
2. Save the UNL-NL dictionary in a plain text (.txt) file with UTF-8 encoding and name it unl_???_dic.txt where ??? must be replaced by the corresponding ISO 639-2 language code.
Send the 3 files to r.martins@undlfoundation.org before March 4th

Material

Recommended readings
Before starting the activity, and in order to fully understand what is expected to be done, it is important for you to be acquainted with the following documentation:
- Tagset, because you are expected to use only the tags included in the tagset
- UNL Dictionary Specs, which is essential to understand the dictionary structure
- UNL Grammar Specs, which is essential to understand the grammar structure
- X-bar
- Default Dictionary
- English dictionary
- Default grammar
- English grammar

It is also interesting to make a test drive with IAN and EUGENE.

Venue

University of Macau, Macau

Local Organization

Prof. Dr. Ana Luísa Varani Leal (University of Macau)

XI UNL School

Contents

Call for Participation

Important dates

Languages

Participants

Pre-workshop tasks (to be sent before March 4th)

Material

Venue

Local Organization

Views

Personal tools

Search

UNL

Lingware

Software

UNL Program

Navigation

Toolbox

Print/export