Changes between Version 2 and Version 3 of Taskforces/FCS/FCS-QL
- Timestamp:
- 03/30/15 08:40:48 (9 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
Taskforces/FCS/FCS-QL
v2 v3 6 6 }}} 7 7 {{{ 8 [1] query ::= main-query within-part? sort-part?8 [1] query ::= main-query within-part? 9 9 10 10 [2] main-query ::= simple-query … … 22 22 23 23 [6] within-part ::= simple-within-part 24 | complex-within-part 24 | complex-within-part /* REMOVE complex within */ 25 25 26 26 [7] simple-within-part ::= "within" simple-within-scope 27 27 28 [8] simple-within-scope::= "sentence"28 [8] simple-within-scope ::= "sentence" 29 29 | "s" 30 30 | "utterance" … … 34 34 | "turn" 35 35 | "t" 36 | " article"36 | "text" 37 37 | "session" 38 39 [9] compex-within-part ::= "within" "[" expression "]" /* TBD: allow more complex stuff? */40 41 [10] sort-part ::= /* TBD: do we want sorting */42 38 43 39 [11] expression ::= basic-expression … … 51 47 [13] operator ::= "=" /* equals */ 52 48 | "!=" /* non-equals */ 53 | "~" /* TBD: fuzzy match? */54 | "!~" /* TBD: fuzzy not? */55 49 56 50 [14] quantifier ::= "+" /* one-or-more */ … … 70 64 | "l" /* literal matching, CQP compat*/ 71 65 | "d" /* diacritic agnostic matching, CQP compat */ 72 /* TBD: more? */73 66 74 [17] regexp ::= string67 [17] regexp ::= quoted-string 75 68 76 69 [18] attribute ::= simple-attribute … … 85 78 [22] identifier-char ::= [a-zA-Z0-9\-] 86 79 87 [23] string ::= plain-string88 | quoted-string89 90 80 [24] integer ::= [0-9]+ 91 92 [25] plain-string ::= char*93 81 94 82 [26] quoted-string ::= "'" (char | ws)* "'" /* single-quotes */ 95 83 | """ (char | ws)* """ /* double-quotes */ 96 84 97 [27] char ::= <any unicode codepoint ex pluding whitespace codepoints>85 [27] char ::= <any unicode codepoint excluding whitespace codepoints> 98 86 | "\" escaped-char 99 87 … … 113 101 == Notes == 114 102 * based on Poliqarp with inspiration from others 115 * contains some "TBD"s (to be determined), e.g. do we want to add a sort-clause (or a meta-clause)?116 103 * "attribute": the annotation layer to be used, e.g. "word", "lemma", "pos" or qualified "pos:stts" the supported values for this construct are beyond the grammar and need to be defined in supplementary documents 117 104 * "simple-within-scope": possible values for scope … … 120 107 * "article" | "session": something like a whole document 121 108 * {{{[27]}}} and {{{[28]}}} "any $SOMETING codepoint" are a pain to get easily done in at least ANTLR and JavaCC. Especially in combination with {{{[29]}}} :/ 122 * regex are not defined/guarded by grammar :/ 109 * regex are not defined/guarded by this grammar :/ 110 * non-continuous rule numbers are currently intended; we've already removed some. Rules will be renumbered, when grammar is fixed.