Changes between Version 9 and Version 10 of FCS-Specification-ScrapBook
- Timestamp:
- 02/03/14 15:40:34 (10 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
FCS-Specification-ScrapBook
v9 v10 26 26 == Introduction == 27 27 The main goal of CLARIN federated content search (CLARIN-FCS) is to introduce a ''interface specification'', to decouple the ''search engine'' functionality from its ''exploitation'', i.e. user-interfaces, third-party applications and to allow services to access search engines in an uniform way. 28 29 The CLARIN-FCS interface specification is built upon the SRU/CQL standard and additional functionality required for CLARIN-FCS is added through SRU/CQL's extension mechanisms.30 28 31 29 … … 118 116 119 117 118 == CLARIN-FCS Interface Specification == 119 120 The CLARIN-FCS interface specification defined two profiles, an extensible result format and a set of required operations. CLARIN-FCS is built on the SRU/CQL standard and additional functionality required for CLARIN-FCS is added through SRU/CQL's extension mechanisms. 121 122 The following sections describe the profiles, how SRU/CQL is used in the context of CLARIN-FCS and the CLARIN-FCS specific extions to SRU. 123 124 125 === Profiles === 126 127 CLARIN-FCS supports two profiles: 128 ''Basic profile'':: 129 Endpoints `MUST` support ''term-only'' queries. \\ 130 Endpoints `SHOULD` support ''terms'' combined with boolean operator (''AND'' and ''OR'') queries, including subqueries. Endpoints `MAY` support the ''NOT'' or ''PROX'' operators. If an endpoint does not support such a query, it `MUST` return an appropriate error message using the appropriate SRU diagnostic. \\ 131 Examples for valid CQL queries : 132 {{{ 133 cat 134 "cat" 135 cat AND dog 136 "grumpy cat" 137 "grumpy cat" AND dog 138 "grumpy cat" OR "lazy dog" 139 cat AND (mouse OR "lazy dog") 140 }}} 141 The endpoint is `MUST` perform the query on an annotation tier, that makes the most sense for the user, i.e. the textual content for a text corpus resource or the orthographic transcription of a spoken language corpus. Endpoints are `RECOMMENDED` to perform the query case-sensitive.\\ 142 Endpoint `MUST NOT` silently accept queries that include CQL features besides ''term-only'' and ''terms'' combined with boolean operator queries, i.e. queries involving context sets, etc. 143 144 ''Extended profile'':: 145 This profile will support more sophisticated queries such as selecting annotation tiers, expanding of tags, or mapping of data categories. \\ 146 '''NOTE''': the extended profile is not yet defined and will be part of a future CLARIN-FCS specification. 147 148 Endpoints and Clients `MUST` support the ''basic profile''. Endpoints and Clients `MUST NOT` claim to support the ''extended profile''. 149 150 120 151 === SRU/CQL === 121 152 SRU (!Search/Retrieve via URL) specifies a general communication protocol for searching and retrieving records and the CQL (Contextual Query Language) specifies a extensible query language. CLARIN-FCS is built on SRU 1.2; subsequent specification may built on SRU 2.0. … … 135 166 CLARIN-FCS deviates from the OASIS specification [#REF_SRU_Overview OASIS-SRU-Overview] and [#REF_SRU_12 OASIS-SRU-12] to ensure backwards comparability with SRU 1.2 services as they where defined by the [#REF_LOC_SRU_12 LOC-SRU12]. 136 167 137 Endpoints or Clients `MUST` support CQL conformance ''Level 2'' (as defined in [#REF_OASIS_CQL OASIA-CQL, section 6]), i.e. abto ''parse'' (Endpoints) or ''serialize'' (Clients) all of CQL and respond with appropriate error messages to the search/retrieve protocol interface.168 Endpoints or Clients `MUST` support CQL conformance ''Level 2'' (as defined in [#REF_OASIS_CQL OASIA-CQL, section 6]), i.e. be able to ''parse'' (Endpoints) or ''serialize'' (Clients) all of CQL and respond with appropriate error messages to the search/retrieve protocol interface. 138 169 139 '''NOTE''': this does '''not imply''', that endpoints are ''required'' support for all of CQL, but rather that they are able to ''parse'' all of CQL and generate the appropriate error message, if a query includes a feature they do not support. 140 141 142 == CLARIN-FCS Interface Specification == 143 144 === Profiles === 145 146 Yada yada yada ... 170 '''NOTE''': this does ''not imply'', that Endpoints are ''required'' support for all of CQL, but rather that they are able to ''parse'' all of CQL and generate the appropriate error message, if a query includes a feature they do not support. 147 171 148 172 … … 152 176 153 177 154 === Operations === 178 === Operations ===#identify 155 179 156 180 Yada yada yada ...