II UNL Panel

From UNL Wiki
(Difference between revisions)
Jump to: navigation, search
(Issues)
Line 1: Line 1:
The II UNL Panel, an associated event to [http://lrec2014.lrec-conf.org/en/ LREC 2014], will be devoted to the nature and role of [[relation]]s and [[attribute]]s in the UNL.  
+
The II UNL Panel will be devoted to the nature and role of [[relation]]s and [[attribute]]s in the UNL framework.  
  
 
== Goal ==
 
== Goal ==
Line 10: Line 10:
 
*Document structure (format, encoding, schema and validation)
 
*Document structure (format, encoding, schema and validation)
  
== Issues ==  
+
== Questions ==  
The questions below illustrate some theoretical and practical issues concerning relations and attributes and have been receiving several different possible answers within the UNL framework. The main goal of II UNL Panel is to discuss which answers would be more appropriate and feasible, considering the state of the art of the theory and technology on natural language processing. We would ask participants to use them as starting points for their presentations, but we would expect them to suggest some general procedures to be adopted in similar cases.
+
 
+
 
'''Considering the commitments, assumptions and properties defined at the [[Introduction to UNL]]''', how would you represent, as a language-independent semantic graph, the following English sentences?
 
'''Considering the commitments, assumptions and properties defined at the [[Introduction to UNL]]''', how would you represent, as a language-independent semantic graph, the following English sentences?
  
;1. The book is on the table
+
#The disappointment killed Mary.
*a) place(book,table), i.e., as a general "place" relation between "book" and "table", without any reference to the idea that the book is "on" (and not "in", "above", "under" etc.) the table;
+
#The book is right under the table.
*b) on(book,table), i.e., as a specific "on" relation between "book" and "table", without any reference to the idea that "on" is actually a possible value of "place";
+
#Peter is between John and Mary.
*c) rel1(book,on)rel2(on,table), i.e., as two different relations "rel1" and "rel2" (which ones?), as if there is no direct relation between "book" and "table";
+
#John took a very long nap.
*d) place(book, table.@on), i.e., as a general "place" relation between "book" and "table" and by an attribute "@on" assigned to "table", in order to specify its role;
+
#Mary should have not come so early.
*e) place.@on(book, table), i.e., as a general "place" relation between "book" and "table"  and by an attribute "@on" assigned to the relation itself;
+
*f) other
+
:In order to answer this question, consider also the following:
+
:The resulting semantic graph must be suitable for languages that do not lexicalize the copula, i.e., where "the book is on the table" is translated as "the book on the table";
+
:The resulting semantic graph must be suitable for languages that do not lexicalize the difference between "on" and "above", i.e., where "the book is on the table" is translated in the same way as "the book is above the table";
+
:The resulting semantic graph must be suitable for languages that do not lexicalize place relations, i.e., where "the book is on the table" is translated as "the book is tableon", where "on" is a locative case marker and not an adposition
+
 
+
;2.
+
 
+
 
+
 
+
 
+
 
+
*Your representation
+
 
+
 
+
In English, some prepositions (such as "between") may have valence equal to 3 ("John is between Mary and Peter"). How should we represent that?
+
*In English, prepositions can be modified by adverbs (such as in "the book is right on the table"). How these modifications should be represented?
+
*Several languages do not use adpositions to represent place relations.
+
 
+
 
+
 
+
 
+
 
+
 
+
;1) "The book is on the table"
+
:How should we represent the relation between "book" and "table" in the sentence "the book is on the table"?
+
In a semantic network, semantic relations may have different levels of granularity. The sentence "Peter killed Mary" can be represented either as kill(Peter,Mary) or as agent(kill,Peter)patient(kill,Mary). The latter approach - which is associated to the idea of semantic case, or semantic role - is said to be more productive (i.e., generalizable) than the former one, but poses some practical problems. Consider, for instance, the case of "the book is on the table". How should we represent the relation between "book" and "table" in the sentence "the book is on the table"?
+
 
+
 
+
 
+
 
+
 
+
 
+
1) How many UW's should be recognized in the sentence below?
+
"Charles Dickens is generally regarded as the most important English novelist of the Victorian period"
+
The basic assumption of the UNL approach is that the information conveyed by natural languages can be formally and usefully represented through semantic networks composed of three different types of discrete semantic entities: UW's, relations and attributes. UW's are nodes in the UNL graph; relations are arcs between nodes; and attributes are specifiers that restrict the extension of nodes. This three-layered representation poses several problems to the UNLization as the distinction between these three entities is not always clear. Consider, for instance, the sentence above. How many UW's (either permanent or temporary) should be recognized in this sentence?
+
"Victorian period" should be represented as single UW ("Victorian period") or as two different UW's ("Victorian" and "period")?
+
The verb "to be" should be represented as a UW or as a relation between "Charles Dickens" and "the most important English novelist of the Victorian period"? (Consider also the options "was" and "has been" in the same context)
+
The preposition "of" should be represented as a UW or as a relation between "the most important novelist" and "the Victorian period"? (Consider also the options "since", "from ... on", "in" or "during" instead of "of")
+
"generally regarded as" should be represented by UW's ("generally", "regarded", "as", for instance) or as an attribute (a downtoner, which lowers the truth effect of the declaration) to be assigned to the whole proposition "Charles Dickens is the most important English novelist of the Victorian period"?
+
The adverb "most" should be represented as a UW or as a superlative marker (to be represented as an attribute to be assigned to the adjective "important"?) (Consider also "greatest English novelist" instead of "most important English novelist")
+
 
+
 
+
 
+
  
1) Is there a set of semantic relations that can be said to be shared by all human languages?
+
== Current answers ==
Semantic networks have been used in language description at least since Charles S. Peirce (Manuscript 514, 1909), and as an interlingua for machine translation since 1956 (cf. Sowa 1987). In a semantic network, information is represented as a graph structure composed of nodes (concepts) and arcs between nodes (binary semantic relations between concepts). The idea of semantic binary relations has been proposed in numerous linguistic approaches, the most famous ones being the “structural syntax”, developed by Lucien Tesnière in the 1930’s, and the “semantic case”, presented by Charles Fillmore in 1968. The set of semantic relations, however, is rather controversial, even though involving some basic common concepts, such as “agent”, “patient”, “instrument”, “place”, “time”, etc.  Consider, as a matter of example, the semantic relation between "make" and "cake" in a sentence like "John made a cake".  In different approaches, this relation has been described as "complement", "patient" or "result". Considering the nature and the role of natural language processing, which is the most productive way for representing such relations, if any? Is there a uniform set of relations that may be used to describe the semantic dependencies within any natural language?
+
The sentences above illustrate some theoretical and practical issues concerning relations and attributes that have been receiving several different possible answers within the current UNL framework. The main goal of II UNL Panel is to discuss which answers would be more appropriate and feasible, considering the state of the art of the theory and technology on natural language processing. We would ask participants to use them as starting points for their presentations, but we would expect them to suggest some general procedures to be adopted in similar cases. The current answers are the following:
  
 +
;1. The disappointment killed Mary.
 +
:*a) agt(killed,disappointment), i.e., as an agent relation between "killed" and "disappointment", because "disappointment" caused Mary to die;
 +
:*b) agt(killled,disappointment.@metaphor), i.e., as an agent relation between "killed" and "disappointment" and an attribute .@metaphor to be assigned to "disappointment" (or to "killed"? or to "agt"?), in order to indicate that a "disappointment" cannot actually "cause" the death of anyone;
 +
:*c) man(killed,disappointment), i.e., as a manner relation between "killed" and "disappointment", because a disappointment is rather a manner or a state in which Mary died;
 +
:*d) ???(killed,disappointment), i.e., as a different relation between "killed" and "disappointment" (in this case, which relation?);
 +
:*e) killed(disappointment, Mary), i.e., not as a semantic case, but as a content relation (in this case, how to handle languages where the verb "to kill" is lexicalized as "to cause to die"?).
 +
:*f) other (please specify).
  
 +
;2. The book is right under the table.
 +
:*a) place(book,table), i.e., as a general relation "place"  between "book" and "table", without any reference to the idea that the book is "under" (and not "in", "above", "near" etc.);
 +
:*b) under(book,table), i.e., as a specific relation "under" between "book" and "table", without any reference to the idea that "under" is actually a possible value of "place";
 +
:*c) relation1(book,under)relation2(under,table), i.e., as two different relations, as if there is no direct relation between "book" and "table" (in this case, which would be labels of relation1 and relation2?);
 +
:*d) place(book,table.@under), i.e., as a general "place" relation between "book" and "table" and by an attribute "@under" assigned to "table", in order to specify its role;
 +
:*e) place.@under(book,table), i.e., as a general "place" relation between "book" and "table"  and by an attribute "@under" assigned to the relation itself;
 +
:*f) other (because no alternative above represents the fact that the book is "right under" the table)<ref>Consider, please, that the resulting semantic graph must be suitable for languages that do not lexicalize place relations, i.e., where "the book is right under the table" is translated as "the book is tableunder", where "under" is a locative case marker and not an adposition.</ref>
  
 +
;3. Peter is between John and Mary.
 +
:*a) place(Peter,:01)between:01(John,Mary), i.e., as a general relation "place" between "Peter" and the relation "between(John, Mary)";
 +
:*b) place(Peter,:01.@between)and:01(John,Mary), i.e., as a general relation "place" between the relation "and(John,Mary)" with the attribute .@between;
 +
:*c) between(Peter,John,Mary), i.e., as one single ternary relation "between" between "Peter", "John" and "Mary" (in this case, consider the implications, to the model, of relations with three arguments);
 +
:*d) relation1(Peter,John)relation2(Peter,Mary), i.e., as two (different?) relations between "Peter" and "John", and "Peter" and "Mary", because both "John" and "Mary" are spatial referents for "John" (in this case, please specify which relations);
 +
:*e) place(Peter,John.@attribute1)place(Peter,Mary.@attribute2), i.e., as a general relation "place" between "Peter and "John", and "Peter" and "Mary", modified by the corresponding attributes (in this case, please specify which attributes);
 +
:*f) other (please specify).
  
How should we represent the relation between "make" and "cake" in "John made a cake"?
+
;4. John took a very long nap.
  
2)
+
;5. Mary should have not come so early.
  
 
== Notes ==
 
== Notes ==
 
<references />
 
<references />

Revision as of 15:24, 31 July 2014

The II UNL Panel will be devoted to the nature and role of relations and attributes in the UNL framework.

Contents

Goal

The main purpose of the UNL Panel is to collect the opinion of specialists, from inside and outside the UNL Community, about technical issues of the UNL, as to prepare the ground for an in-depth revision of the current specifications.

Rationale

Originally proposed more than 15 years ago, the UNL Specs have not escaped from the action of time and have not incorporated yet several recent advances in the domain of natural language processing. Additionally, there has been a claim for better standardization practices in the UNL framework, especially after the results of the large-scale development inside the UNLweb. In order to organize this discussion, the UNDL Foundation divided the subjects into three chapters, to be addressed in three different meetings:

  • Universal Words (the set, notation and properties of UWs), which have been already addressed at the I UNL Panel (COLING 2012), and whose results are available at MARTINS, R. (ed). (2013). Lexical Issues of UNL: Universal Networking Language 2012 Panel. Cambridge: Cambridge Scholars Publishing.
  • Relations and Attributes (the set, notation and properties of relations and attributes), which is the object of this II UNL Panel; and
  • Document structure (format, encoding, schema and validation)

Questions

Considering the commitments, assumptions and properties defined at the Introduction to UNL, how would you represent, as a language-independent semantic graph, the following English sentences?

  1. The disappointment killed Mary.
  2. The book is right under the table.
  3. Peter is between John and Mary.
  4. John took a very long nap.
  5. Mary should have not come so early.

Current answers

The sentences above illustrate some theoretical and practical issues concerning relations and attributes that have been receiving several different possible answers within the current UNL framework. The main goal of II UNL Panel is to discuss which answers would be more appropriate and feasible, considering the state of the art of the theory and technology on natural language processing. We would ask participants to use them as starting points for their presentations, but we would expect them to suggest some general procedures to be adopted in similar cases. The current answers are the following:

1. The disappointment killed Mary.
  • a) agt(killed,disappointment), i.e., as an agent relation between "killed" and "disappointment", because "disappointment" caused Mary to die;
  • b) agt(killled,disappointment.@metaphor), i.e., as an agent relation between "killed" and "disappointment" and an attribute .@metaphor to be assigned to "disappointment" (or to "killed"? or to "agt"?), in order to indicate that a "disappointment" cannot actually "cause" the death of anyone;
  • c) man(killed,disappointment), i.e., as a manner relation between "killed" and "disappointment", because a disappointment is rather a manner or a state in which Mary died;
  • d) ???(killed,disappointment), i.e., as a different relation between "killed" and "disappointment" (in this case, which relation?);
  • e) killed(disappointment, Mary), i.e., not as a semantic case, but as a content relation (in this case, how to handle languages where the verb "to kill" is lexicalized as "to cause to die"?).
  • f) other (please specify).
2. The book is right under the table.
  • a) place(book,table), i.e., as a general relation "place" between "book" and "table", without any reference to the idea that the book is "under" (and not "in", "above", "near" etc.);
  • b) under(book,table), i.e., as a specific relation "under" between "book" and "table", without any reference to the idea that "under" is actually a possible value of "place";
  • c) relation1(book,under)relation2(under,table), i.e., as two different relations, as if there is no direct relation between "book" and "table" (in this case, which would be labels of relation1 and relation2?);
  • d) place(book,table.@under), i.e., as a general "place" relation between "book" and "table" and by an attribute "@under" assigned to "table", in order to specify its role;
  • e) place.@under(book,table), i.e., as a general "place" relation between "book" and "table" and by an attribute "@under" assigned to the relation itself;
  • f) other (because no alternative above represents the fact that the book is "right under" the table)[1]
3. Peter is between John and Mary.
  • a) place(Peter,:01)between:01(John,Mary), i.e., as a general relation "place" between "Peter" and the relation "between(John, Mary)";
  • b) place(Peter,:01.@between)and:01(John,Mary), i.e., as a general relation "place" between the relation "and(John,Mary)" with the attribute .@between;
  • c) between(Peter,John,Mary), i.e., as one single ternary relation "between" between "Peter", "John" and "Mary" (in this case, consider the implications, to the model, of relations with three arguments);
  • d) relation1(Peter,John)relation2(Peter,Mary), i.e., as two (different?) relations between "Peter" and "John", and "Peter" and "Mary", because both "John" and "Mary" are spatial referents for "John" (in this case, please specify which relations);
  • e) place(Peter,John.@attribute1)place(Peter,Mary.@attribute2), i.e., as a general relation "place" between "Peter and "John", and "Peter" and "Mary", modified by the corresponding attributes (in this case, please specify which attributes);
  • f) other (please specify).
4. John took a very long nap.
5. Mary should have not come so early.

Notes

  1. Consider, please, that the resulting semantic graph must be suitable for languages that do not lexicalize place relations, i.e., where "the book is right under the table" is translated as "the book is tableunder", where "under" is a locative case marker and not an adposition.
Software