Changes between Version 20 and Version 21 of FCS-Specification-ScrapBook


Ignore:
Timestamp:
02/05/14 12:15:52 (10 years ago)
Author:
oschonef
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • FCS-Specification-ScrapBook

    v20 v21  
    112112
    113113 LOC-SRU12[=#REF_LOC_SRU_12]::
    114     SRU VERSION 1.2: SRU !Search/Retrieve Operation, Library of Congress,\\
     114    SRU Version 1.2: SRU !Search/Retrieve Operation, Library of Congress,\\
    115115    [http://www.loc.gov/standards/sru/sru-1-2.html]
    116116
    117117 LOC-DIAG[=#REF_LOC_DIAG]::
    118     SRU VERSION 1.2: SRU Diagnostics List, Library of Congress,\\
     118    SRU Version 1.2: SRU Diagnostics List, Library of Congress,\\
    119119    [http://www.loc.gov/standards/sru/diagnostics/diagnosticsList.html]
    120120
     
    124124The CLARIN-FCS interface specification defined two profiles, an extensible result format and a set of required operations. CLARIN-FCS is built on the SRU/CQL standard and additional functionality required for CLARIN-FCS is added through SRU/CQL's extension mechanisms.
    125125
    126 The following sections describe the profiles, how SRU/CQL is used in the context of CLARIN-FCS and the CLARIN-FCS specific extensions to SRU.
     126Generally, CLARIN-FCS Interface Specification consists of two components, a set of ''formats'' and a ''transport protocol''. The ''Endpoint'' component is a software component that acts as a bridge between the Formats, that are send by a ''Client'' using the ''Transport Protocol'', and a ''Search Engine''. The ''Search Engine'' is a custom software component, that allows searching in the language resources of a CLARIN center. The ''Endpoint'' basically implements the ''transport protocol''  and acts as an mediator between the CLRAIN-FCS speceific formats and the idiosyncrasies of ''Search Engines''. The following figure illustrates the overall architecture.
     127{{{
     128                 +---------+
     129                 |  Client |
     130                 +---------+
     131                     /|\
     132                      |
     133          -------------------------
     134         |        SRU / CQL        |
     135         | w/CLARIN-FCS extensions |
     136          -------------------------
     137                      |
     138                     \|/
     139 +-----------------------------------------+
     140 |        |      Endpoint     /|\          |
     141 |        |                    |           |
     142 |  ---------------    ------------------  |
     143 | | Translate CQL |  | Translate Result | |
     144 |  ---------------    ------------------  |
     145 |        |                    |           |
     146 |       \|/                   |           |
     147 +-----------------------------------------+
     148                     /|\
     149                      |
     150                     \|/
     151        +---------------------------+
     152        |       Search Engine       |
     153        +---------------------------+
     154}}}
     155
     156The following sections describe the CLARIN-FCS profiles and query and result formats, how SRU/CQL is used as a transport protocol in the context of CLARIN-FCS and the required CLARIN-FCS specific extensions to SRU.
    127157
    128158
    129159=== Profiles ===
    130 
    131 CLARIN-FCS supports two profiles:
     160CLARIN-FCS defines two profiles:
    132161 ''Basic profile''::
    133162   Endpoints `MUST` support ''term-only'' queries. \\
     
    151180
    152181Endpoints and Clients `MUST` support the ''basic profile''. Endpoints and Clients `MUST NOT` claim to support the ''extended profile''.
    153 
    154 
    155 === SRU/CQL ===
    156 SRU (!Search/Retrieve via URL) specifies a general communication protocol for searching and retrieving records and the CQL (Contextual Query Language) specifies a extensible query language. CLARIN-FCS is built on SRU 1.2. A subsequent specification may be built on SRU 2.0.
    157 
    158 Endpoints and Clients `MUST` implement the SRU/CQL protocol suite as defined in [#REF_SRU_Overview OASIS-SRU-Overview], [#REF_SRU_APD OASIS-SRU-APD], [#REF_CQL OASIS-CQL], [#REF_Explain SRU-Explain], [#REF_Scan SRU-Scan], especially with respect to:
    159  * Data Model,
    160  * Query Model,
    161  * Processing Model,
    162  * Result Set Model, and
    163  * Diagnostics Model   
    164 
    165 Endpoints and Clients `MUST` use the implement the APD Binding for SRU 1.2, as defined in [#REF_SRU_12 OASIS-SRU-12]. Endpoints and Clients `MAY` implement APD binding for version 1.1 or version 2.0.
    166 
    167 Endpoints and Clients `MUST` use the following namespace URIs for serializing responses:
    168  * `http://www.loc.gov/zing/srw/` for SRU response documents, and
    169  * `http://www.loc.gov/zing/srw/diagnostic/` for diagnostics within SRU response documents.
    170 CLARIN-FCS deviates from the OASIS specification [#REF_SRU_Overview OASIS-SRU-Overview] and [#REF_SRU_12 OASIS-SRU-12] to ensure backwards comparability with SRU 1.2 services as they where defined by the [#REF_LOC_SRU_12 LOC-SRU12].
    171 
    172 Endpoints or Clients `MUST` support CQL conformance ''Level 2'' (as defined in [#REF_OASIS_CQL OASIS-CQL, section 6]), i.e. be able to ''parse'' (Endpoints) or ''serialize'' (Clients) all of CQL and respond with appropriate error messages to the search/retrieve protocol interface.
    173 
    174 '''NOTE''': this does ''not imply'', that Endpoints are ''required'' support for all of CQL, but rather that they are able to ''parse'' all of CQL and generate the appropriate error message, if a query includes a feature they do not support.
    175 
    176 Endpoints `MUST` generate diagnostics according to [#REF_SRU_12 OASIS-SRU-12, Appendix C] for error conditions or to indicate unsupported features. Unfortunately, the OASIS specification does not provides a comprehensive list of diagnostics for CQL related errors. Therefore, Endpoints `MUST` use diagnostics from [#REF_LOC_DIAG LOC-DIAG, section "Diagnostics Relating to CQL"] for CQL related errors.
    177182
    178183
     
    263268
    264269
     270=== SRU/CQL ===
     271SRU (!Search/Retrieve via URL) specifies a general communication protocol for searching and retrieving records and the CQL (Contextual Query Language) specifies a extensible query language. CLARIN-FCS is built on SRU 1.2. A subsequent specification may be built on SRU 2.0.
     272
     273Endpoints and Clients `MUST` implement the SRU/CQL protocol suite as defined in [#REF_SRU_Overview OASIS-SRU-Overview], [#REF_SRU_APD OASIS-SRU-APD], [#REF_CQL OASIS-CQL], [#REF_Explain SRU-Explain], [#REF_Scan SRU-Scan], especially with respect to:
     274 * Data Model,
     275 * Query Model,
     276 * Processing Model,
     277 * Result Set Model, and
     278 * Diagnostics Model   
     279
     280Endpoints and Clients `MUST` use the implement the APD Binding for SRU 1.2, as defined in [#REF_SRU_12 OASIS-SRU-12]. Endpoints and Clients `MAY` implement APD binding for version 1.1 or version 2.0.
     281
     282Endpoints and Clients `MUST` use the following namespace URIs for serializing responses:
     283 * `http://www.loc.gov/zing/srw/` for SRU response documents, and
     284 * `http://www.loc.gov/zing/srw/diagnostic/` for diagnostics within SRU response documents.
     285CLARIN-FCS deviates from the OASIS specification [#REF_SRU_Overview OASIS-SRU-Overview] and [#REF_SRU_12 OASIS-SRU-12] to ensure backwards comparability with SRU 1.2 services as they where defined by the [#REF_LOC_SRU_12 LOC-SRU12].
     286
     287Endpoints or Clients `MUST` support CQL conformance ''Level 2'' (as defined in [#REF_OASIS_CQL OASIS-CQL, section 6]), i.e. be able to ''parse'' (Endpoints) or ''serialize'' (Clients) all of CQL and respond with appropriate error messages to the search/retrieve protocol interface.
     288
     289'''NOTE''': this does ''not imply'', that Endpoints are ''required'' support for all of CQL, but rather that they are able to ''parse'' all of CQL and generate the appropriate error message, if a query includes a feature they do not support.
     290
     291Endpoints `MUST` generate diagnostics according to [#REF_SRU_12 OASIS-SRU-12, Appendix C] for error conditions or to indicate unsupported features. Unfortunately, the OASIS specification does not provides a comprehensive list of diagnostics for CQL related errors. Therefore, Endpoints `MUST` use diagnostics from [#REF_LOC_DIAG LOC-DIAG, section "Diagnostics Relating to CQL"] for CQL related errors.
     292
     293
    265294== CLARIN-FCS to SRU/CQL binding ==
    266295