Changes between Version 7 and Version 8 of Taskforces/FCS/FCS-Specification-Draft


Ignore:
Timestamp:
10/21/15 12:51:21 (9 years ago)
Author:
Oliver Schonefeld
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • Taskforces/FCS/FCS-Specification-Draft

    v7 v8  
    123123    SRU Version 1.2: SRU Diagnostics List, Library of Congress,\\
    124124    [http://www.loc.gov/standards/sru/diagnostics/diagnosticsList.html]
     125
     126 UD-POS[=#REF_UD_POS]::
     127    Universal Dependencies, Universal POS tags, \\
     128    [https://universaldependencies.github.io/docs/u/pos/index.html]
     129
     130 SAMPA[=#REF_SAMPA]::
     131    Dafydd Gibbon, Inge Mertins, Roger Moore (Eds.): Handbook of Multimodal and Spoken Language Systems. Resources, Terminology and Product Evaluation, Kluwer Academic Publishers, Boston MA, 2000, ISBN 0-7923-7904-7
    125132
    126133 CLARIN-FCS-!DataViews[=#REF_FCS_DataViews]::
     
    220227In the ''Searching'' step the Client performs the actual search request to a to previously [#Discovery discovered] Endpoint.
    221228
    222 === Basic Search
     229=== Basic Search #basicSearch
    223230The ''Basic Search'' capability provides simple full-text search. Queries in Basic Search `MUST` be performed in the ''Contextual Query Language'' ([#REF_CQL OASIS-CQL]). The Endpoint `MUST` support ''term-only''  queries.  The Endpoint `SHOULD` support ''terms'' combined with boolean operator queries (''AND'' and ''OR''), including sub-queries. An Endpoint `MAY` also support ''NOT'' or ''PROX'' operator queries. If an Endpoint does not support a query, i.e. the used operators are not supported by the Endpoint, it `MUST` return an appropriate error message using the appropriate SRU diagnostic ([#REF_LOC_DIAG LOC-DIAG]).
    224231
     
    242249About advanced search
    243250==== Layers
    244 About available layers
     251||= Identifier =||= Annotation Tier Description                                           =||= Syntax =||= Examples (without quotes) =||
     252|| `token`      || Appropriate tokenisation of resource, i.e. words                        || String       || "Dog", "cat", "walked" ||
     253|| `lemma`      || Lemmatisation of tokens                                                 || String   || "good", "walking", "dog" ||
     254|| `pos`        || Part-of-Speech annotations                                              || [#REF_UD_POS Universal POS tags] || "NOUN", "VERB", "ADJ" ||
     255|| `orth`       || Orthographic transcription of (mostly) spoken resources                 || String || "dug", "cat", "wolking" ||
     256|| `norm`       || Orthographic normalization of (mostly) spoken resources                 || String || "dog", "cat", "walking" ||
     257|| `phonetic`   || Phonetic transcription                              || [#REF_SAMPA Speech Assessment Methods Phonetic Alphabet (SAM-PA)] || "'du:", "'vi:-d6 'ha:-b@n" ||
     258|| `ne`         || Named entities  || String || "Utrecht", "Poland", "Felix the Cat" ||
     259|| `text`       || Annotation tier that is used in [#basicSearch Basic Search]             || String || "Dog", "cat" "walked"                ||
     260
    245261==== FCS-QL
    246262About available layers