VIII UNL School

From UNL Wiki
(Difference between revisions)
Jump to: navigation, search
(Deliverables)
(Schedule)
 
(17 intermediate revisions by one user not shown)
Line 1: Line 1:
 +
The VIII UNL School, formerly UNL Grammar Workshop, was held in Geneva, Switzerland, from Feb 6-10, 2012.
 +
== Languages ==
 +
*Armenian
 +
*Bulgarian
 +
*Estonian
 +
*German
 +
*Greek
 +
*Polish
 +
*Portuguese
 +
*Romanian
 +
*Russian
 +
*Swedish
 +
 
== Goals ==
 
== Goals ==
 
*To build the basic modules of a NL-UNL (analysis) grammar
 
*To build the basic modules of a NL-UNL (analysis) grammar
Line 4: Line 17:
  
 
== Corpus ==
 
== Corpus ==
*[http://www.unlweb.net/resources/geneva2012/corpus_eng.txt Reference corpus in English (500 sentences)], to be manually translated to the target languages, in order to be used as the input for IAN
+
*[[Corpus500]]
*[http://www.unlweb.net/resources/geneva2012/corpus_unl.txt Reference corpus in UNL (500 graphs)], to be used as the input for EUGENE
+
*Reference corpus according to the complexity of the graphs (the same as above, but split in different files)
+
{| border="1" cellpadding="2" align=center
+
|+Corpus
+
!Order
+
!Description
+
!Analysis (English original)
+
!Generation (UNL)
+
!Word list (English original)
+
|-
+
|1
+
|Temporary entries
+
|[http://www.unlweb.net/resources/geneva2012/temp_org.txt temp_org.txt]
+
|[http://www.unlweb.net/resources/geneva2012/temp_unl.txt temp_unl.txt]
+
|[http://www.unlweb.net/resources/geneva2012/temp_dic.txt temp_dic.txt]
+
|-
+
|2
+
|Entries with no attribute or relation
+
|[http://www.unlweb.net/resources/geneva2012/attribute0_org.txt attribute0_org.txt]
+
|[http://www.unlweb.net/resources/geneva2012/attribute0_unl.txt attribute0_unl.txt]
+
|[http://www.unlweb.net/resources/geneva2012/attribute0_dic.txt attribute0_dic.txt]
+
|-
+
|3
+
|one-attribute entries
+
|[http://www.unlweb.net/resources/geneva2012/attribute1_org.txt attribute1_org.txt]
+
|[http://www.unlweb.net/resources/geneva2012/attribute1_unl.txt attribute1_unl.txt]
+
|[http://www.unlweb.net/resources/geneva2012/attribute1_dic.txt attribute1_dic.txt]
+
|-
+
|4
+
|two-attribute entries
+
|[http://www.unlweb.net/resources/geneva2012/attribute2_org.txt attribute2_org.txt]
+
|[http://www.unlweb.net/resources/geneva2012/attribute2_unl.txt attribute2_unl.txt]
+
|[http://www.unlweb.net/resources/geneva2012/attribute2_dic.txt attribute2_dic.txt]
+
|-
+
|5
+
|three-attribute entries
+
|[http://www.unlweb.net/resources/geneva2012/attribute3_org.txt attribute3_org.txt]
+
|[http://www.unlweb.net/resources/geneva2012/attribute3_unl.txt attribute3_unl.txt]
+
|[http://www.unlweb.net/resources/geneva2012/attribute3_dic.txt attribute3_dic.txt]
+
|-
+
|6
+
|one-relation entries
+
|[http://www.unlweb.net/resources/geneva2012/relation1_org.txt relation1_org.txt]
+
|[http://www.unlweb.net/resources/geneva2012/relation1_unl.txt relation1_unl.txt]
+
|[http://www.unlweb.net/resources/geneva2012/relation1_dic.txt relation1_dic.txt]
+
|-
+
|7
+
|two-relation entries
+
|[http://www.unlweb.net/resources/geneva2012/relation2_org.txt relation2_org.txt]
+
|[http://www.unlweb.net/resources/geneva2012/relation2_unl.txt relation2_unl.txt]
+
|[http://www.unlweb.net/resources/geneva2012/relation2_dic.txt relation2_dic.txt]
+
|-
+
|8
+
|three-relation entries
+
|[http://www.unlweb.net/resources/geneva2012/relation3_org.txt relation3_org.txt]
+
|[http://www.unlweb.net/resources/geneva2012/relation3_unl.txt relation3_unl.txt]
+
|[http://www.unlweb.net/resources/geneva2012/relation3_dic.txt relation3_dic.txt]
+
|-
+
|9
+
|four-relation entries
+
|[http://www.unlweb.net/resources/geneva2012/relation4_org.txt relation4_org.txt]
+
|[http://www.unlweb.net/resources/geneva2012/relation4_unl.txt relation4_unl.txt]
+
|[http://www.unlweb.net/resources/geneva2012/relation4_dic.txt relation4_dic.txt]
+
|-
+
|10
+
|five-relation entries
+
|[http://www.unlweb.net/resources/geneva2012/relation5_org.txt relation5_org.txt]
+
|[http://www.unlweb.net/resources/geneva2012/relation5_unl.txt relation5_unl.txt]
+
|[http://www.unlweb.net/resources/geneva2012/relation5_dic.txt relation5_dic.txt]
+
|-
+
|11
+
|six-relation entries
+
|[http://www.unlweb.net/resources/geneva2012/relation6_org.txt relation6_org.txt]
+
|[http://www.unlweb.net/resources/geneva2012/relation6_unl.txt relation6_unl.txt]
+
|[http://www.unlweb.net/resources/geneva2012/relation6_dic.txt relation6_dic.txt]
+
|-
+
|12
+
|numbers and numerals
+
|[http://www.unlweb.net/resources/geneva2012/numbers.txt numbers_org.txt]
+
|[http://www.unlweb.net/resources/geneva2012/numbers.txt numbers_unl.txt]
+
|[http://www.unlweb.net/resources/geneva2012/numbers.txt numbers_dic.txt]
+
|-
+
|13
+
|expressions of time
+
|[http://www.unlweb.net/resources/geneva2012/time.txt time_org.txt]
+
|[http://www.unlweb.net/resources/geneva2012/time.txt time_unl.txt]
+
|[http://www.unlweb.net/resources/geneva2012/time.txt time_dic.txt]
+
|-
+
|14
+
|relative clauses
+
|[http://www.unlweb.net/resources/geneva2012/relatives.txt relatives_org.txt]
+
|[http://www.unlweb.net/resources/geneva2012/relatives.txt relatives_unl.txt]
+
|[http://www.unlweb.net/resources/geneva2012/relatives.txt relatives_dic.txt]
+
|-
+
|15
+
|special issues
+
|[http://www.unlweb.net/resources/geneva2012/problems.txt problems_org.txt]
+
|[http://www.unlweb.net/resources/geneva2012/problems.txt problems_unl.txt]
+
|[http://www.unlweb.net/resources/geneva2012/problems.txt problems_dic.txt]
+
|}
+
  
 
== Deliverables ==
 
== Deliverables ==
;DEADLINE: 15 MAY 2012
+
;DEADLINE: 01 SEP 2012
 
;ANALYSIS (IAN)
 
;ANALYSIS (IAN)
 
*The manual translated version of the 500 sentences of the reference corpus (corpus_LID.txt)
 
*The manual translated version of the 500 sentences of the reference corpus (corpus_LID.txt)
Line 117: Line 30:
 
*The generation dictionary used to generate the reference corpus onto natural language (gen_dic_LID.txt)
 
*The generation dictionary used to generate the reference corpus onto natural language (gen_dic_LID.txt)
 
*The generation grammar, including inflectional paradigms, used to generate the reference corpus onto natural language (gen_gra_LID.txt)
 
*The generation grammar, including inflectional paradigms, used to generate the reference corpus onto natural language (gen_gra_LID.txt)
*The generation disambiguation grammar used to generate the reference corpus onto natural language (gen_dis_LID.txt)
+
*The generation disambiguation grammar, if any, used to generate the reference corpus onto natural language (gen_dis_LID.txt)
 
*The natural language output generated from the dictionary and grammars above (gen_out_LID.txt)
 
*The natural language output generated from the dictionary and grammars above (gen_out_LID.txt)
 
:'''LID''' is to be replaced by the ISO639-2 two-character code of the language (en = English, el = Greek, etc.). Accordingly, the Greek corpus is to be provided as corpus_el.txt; the Greek analysis dictionary, as ana_dic_el.txt; and so on. All the files must be shared with "martins" in EUGENE and IAN.
 
:'''LID''' is to be replaced by the ISO639-2 two-character code of the language (en = English, el = Greek, etc.). Accordingly, the Greek corpus is to be provided as corpus_el.txt; the Greek analysis dictionary, as ana_dic_el.txt; and so on. All the files must be shared with "martins" in EUGENE and IAN.
  
 
== Presentations ==
 
== Presentations ==
*[http://www.unlweb.net/resources/geneva2012/Corpus.pdf Corpus]
+
{| border="1" cellpadding="2" align=center
*[http://www.unlweb.net/resources/geneva2012/Dictionary1.pdf NL-UNL Dictionary]
+
!Title
*[http://www.unlweb.net/resources/geneva2012/Dictionary2.pdf UNL-NL Dictionary]
+
!pdf
*[http://www.unlweb.net/resources/geneva2012/Grammar.pdf Grammar]
+
!pptx
 +
|-
 +
|Corpus
 +
|[http://www.unlweb.net/resources/geneva2012/Corpus.pdf pdf]
 +
|[http://www.unlweb.net/resources/geneva2012/Corpus.pptx pptx]
 +
|-
 +
|NL-UNL Dictionary
 +
|[http://www.unlweb.net/resources/geneva2012/Dictionary1.pdf pdf]
 +
|[http://www.unlweb.net/resources/geneva2012/Dictionary1.pptx pptx]
 +
|-
 +
|UNL-NL Dictionary
 +
|[http://www.unlweb.net/resources/geneva2012/Dictionary2.pdf pdf]
 +
|[http://www.unlweb.net/resources/geneva2012/Dictionary2.pptx pptx]
 +
|-
 +
|Grammar
 +
|[http://www.unlweb.net/resources/geneva2012/Grammar.pdf pdf]
 +
|[http://www.unlweb.net/resources/geneva2012/Grammar.pptx pptx]
 +
|}
  
 
== English examples and templates ==
 
== English examples and templates ==
 
;ANALYSIS (IAN)
 
;ANALYSIS (IAN)
*[http://www.unlweb.net/resources/geneva2012/corpus_dic.txt English analysis dictionary (for the reference corpus)]
+
*Dictionaries
*English analysis grammar (to be provided soon)
+
**[http://www.unlweb.net/resources/geneva2012/ana_dic_eng.txt English analysis dictionary (for the reference corpus)]
*English disambiguation grammar (for analysis) (to be provided soon)
+
**[http://www.unlweb.net/resources/geneva2012/ana_dic.txt Default dictionary (punctuation and regular expressions)]
 +
*Grammars
 +
**[http://www.unlweb.net/resources/geneva2012/ana_gra_eng.pdf English analysis grammar (simplified)]
 +
**[http://www.unlweb.net/resources/geneva2012/ana_gra_eng.txt English analysis grammar (full)]
 +
**[http://www.unlweb.net/resources/geneva2012/ana_gra.txt Default analysis grammar]
 +
**[http://www.unlweb.net/resources/geneva2012/ana_dis_eng.txt English disambiguation grammar]
 
;GENERATION (EUGENE)
 
;GENERATION (EUGENE)
 
*English generation dictionary (to be provided soon)
 
*English generation dictionary (to be provided soon)
Line 152: Line 87:
 
*Yordanka STANCHEVA (Bulgarian)
 
*Yordanka STANCHEVA (Bulgarian)
  
== Schedule ==
+
== Program ==
 
;Feb 06th, 2012 - Monday
 
;Feb 06th, 2012 - Monday
 
:09:00-10:00 Introduction
 
:09:00-10:00 Introduction

Latest revision as of 15:52, 10 September 2012

The VIII UNL School, formerly UNL Grammar Workshop, was held in Geneva, Switzerland, from Feb 6-10, 2012.

Contents

Languages

  • Armenian
  • Bulgarian
  • Estonian
  • German
  • Greek
  • Polish
  • Portuguese
  • Romanian
  • Russian
  • Swedish

Goals

  • To build the basic modules of a NL-UNL (analysis) grammar
  • To build the basic modules of a UNL-UNL (generation) grammar

Corpus

Deliverables

DEADLINE
01 SEP 2012
ANALYSIS (IAN)
  • The manual translated version of the 500 sentences of the reference corpus (corpus_LID.txt)
  • The analysis dictionary used to analyze those 500 sentences (ana_dic_LID.txt)
  • The analysis grammar used to analyze those 500 sentences (ana_gra_LID.txt)
  • The analysis disambiguation grammar, if any, used to analyze those 500 sentences (ana_dis_LID.txt)
  • The UNL output for those 500 sentences generated from the dictionary and grammars above (ana_out_LID.txt)
GENERATION (EUGENE)
  • The generation dictionary used to generate the reference corpus onto natural language (gen_dic_LID.txt)
  • The generation grammar, including inflectional paradigms, used to generate the reference corpus onto natural language (gen_gra_LID.txt)
  • The generation disambiguation grammar, if any, used to generate the reference corpus onto natural language (gen_dis_LID.txt)
  • The natural language output generated from the dictionary and grammars above (gen_out_LID.txt)
LID is to be replaced by the ISO639-2 two-character code of the language (en = English, el = Greek, etc.). Accordingly, the Greek corpus is to be provided as corpus_el.txt; the Greek analysis dictionary, as ana_dic_el.txt; and so on. All the files must be shared with "martins" in EUGENE and IAN.

Presentations

Title pdf pptx
Corpus pdf pptx
NL-UNL Dictionary pdf pptx
UNL-NL Dictionary pdf pptx
Grammar pdf pptx

English examples and templates

ANALYSIS (IAN)
GENERATION (EUGENE)
  • English generation dictionary (to be provided soon)
  • English generation grammar (to be provided soon)
  • English disambiguation grammar (for generation) (to be provided soon)

Participants

  • Carolin ARNOLD (German)
  • Ewa CZAJKOWSKA (Polish)
  • Grega MILHARCIC (Slovenian)
  • Luisa GOUVEIA (Portuguese)
  • Martin LUTS (Estonian)
  • Mihaela ILIOAIA (Romanian)
  • Ofelia HOVHANNISYAN (Armenian)
  • Olga VARTZIOTI (Greek)
  • Polina LENKOVA (Russian)
  • Ronaldo MARTINS (UNL)
  • Sameh ALANSARY (Arabic)
  • Sara STYMNE (Swedish)
  • Yordanka STANCHEVA (Bulgarian)

Program

Feb 06th, 2012 - Monday
09:00-10:00 Introduction
10:00-12:00 I – Corpus
14:00-17:00 II – UNL-NL dictionary
Feb 07th, 2012 - Tuesday
09:00-12:00 III – Morphology (inflectional paradigms)
14:00-17:00 IV – NL dictionary
Feb 08th, 2012- Wednesday
09:00-12:00 V – UNL-NL grammar (I)
14:00-17:00 V – UNL-NL grammar (II)
Feb 09th, 2012 - Thursday
09:00-12:00 VI – NL-UNL grammar (I)
14:00-17:00 VI – NL-UNL grammar (II)
Feb 10th, 2012 - Friday
09:00-12:00 Evaluation
14:00-17:00 Discussion
Software