|
|
Line 1: |
Line 1: |
− | | + | #REDIRECT [[Grammar Specs]] |
− | == Basic concepts ==
| + | |
− | ;[[Node]]
| + | |
− | :A node is the most elementary unit in the graph. It is the result of the [[tokenization]] process, and corresponds to the notion of "lexical item". At the surface level, a natural language sentence is considered a list of nodes, and a UNL graph a set of relations between nodes.
| + | |
− | ;[[Relation]]
| + | |
− | :In order to form a natural language sentence or a UNL graph, nodes are inter-related by relations. In the UNL framework, there can be three different types of relations: linear, syntactic or semantic.
| + | |
− | ;[[Hyper-Node]]
| + | |
− | :A hyper-node is a sub-graph, i.e., a node containing relations between nodes.
| + | |
− | ;[[Hyper-Relation]]
| + | |
− | :A hyper-relation is a relation between relations.
| + | |
− | | + | |
− | | + | |
− | | + | |
− | | + | |
− | | + | |
− | == Basic symbols ==
| + | |
− | | + | |
− | {| border="1" cellpadding="2" align=center
| + | |
− | |+Basic symbols used in UNL grammar rules
| + | |
− | !Symbol
| + | |
− | !Definition
| + | |
− | !Example
| + | |
− | |-
| + | |
− | |align=center|<nowiki>^</nowiki>
| + | |
− | |not
| + | |
− | |^a = not a
| + | |
− | |-
| + | |
− | |align=center|{ | }
| + | |
− | |or
| + | |
− | |<nowiki>{a|b}</nowiki> = a or b
| + | |
− | |-
| + | |
− | |align=center|%
| + | |
− | |index for nodes, attributes and values
| + | |
− | |%x (see [[#Indexes|below]])
| + | |
− | |-
| + | |
− | |align=center|#
| + | |
− | |index for sub-NLWs
| + | |
− | |#01 (see [[#Indexes|below]])
| + | |
− | |-
| + | |
− | |align=center|=
| + | |
− | |attribute-value assignment
| + | |
− | |POS=NOU
| + | |
− | |-
| + | |
− | |align=center|!
| + | |
− | |rule trigger
| + | |
− | |!PLR
| + | |
− | |-
| + | |
− | |align=center|&
| + | |
− | |merge operator
| + | |
− | |%x&%y
| + | |
− | |-
| + | |
− | |align=center|?
| + | |
− | |dictionary lookup operator
| + | |
− | |?[a]
| + | |
− | |-
| + | |
− | |align=center|“ “
| + | |
− | |string
| + | |
− | |"went"
| + | |
− | |-
| + | |
− | |align=center|[ ]
| + | |
− | |natural language entry (headword)
| + | |
− | |[go]
| + | |
− | |-
| + | |
− | |align=center|[[ ]]
| + | |
− | |UW
| + | |
− | |[[to go(icl>to move)]]
| + | |
− | |-
| + | |
− | |align=center|( )
| + | |
− | |node
| + | |
− | |(a)
| + | |
− | |-
| + | |
− | |align=center|//
| + | |
− | |regular expression
| + | |
− | |/a{2,3}/ = aa,aaa
| + | |
− | |}
| + | |
− | | + | |
− | ;The differences between "", [] and [[]]
| + | |
− | :Double quotes are always used to represent strings: "a" will match only the string "a"
| + | |
− | :Simple square brackets are always used to represent natural language entries (headwords) in the dictionary: [a] will match the node associated to the entry [a] retrieved from the dictionary, no matter its current realization, which may be affected by other rules (the original [a] may have been replaced, for instance, by "b", but will still be indexed to the entry [a])
| + | |
− | :Double square brackets are always used to represent UWs: <nowiki>[[a]]</nowiki> will match the node associated to the UW <nowiki>[[a]]</nowiki>
| + | |
− | | + | |
− | ;Predefined values (assigned by default)
| + | |
− | :SCOPE - Scope
| + | |
− | :SHEAD - Sentence head (the beginning of a sentence)
| + | |
− | :STAIL - Sentence tail (the end of a sentence)
| + | |
− | :CHEAD - Scope head (the beginning of a scope)
| + | |
− | :CTAIL - Scope tail (the end of a scope)
| + | |
− | :TEMP - Temporary entry (entry not found in the dictionary)
| + | |
− | :DIGIT - Any sequence of digits (i.e.: 0,1,2,3,4,5,6,7,8,9)
| + | |