Changes between Version 29 and Version 30 of Taskforces/FCS/FCS-Specification-Draft


Ignore:
Timestamp:
11/02/15 14:20:38 (9 years ago)
Author:
Oliver Schonefeld
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • Taskforces/FCS/FCS-Specification-Draft

    v29 v30  
    2020    A module or service to dispatch queries to repositories and collect results.
    2121
     22 [=#REF_Annotation_Layer Annotation Layer]::
     23    An annotation layer is the sum of possible annotations for a language resource, such as part of speech or orthographic transcription. Usually it is related to a given annotation task or topic. For the scope of the specification it is used as synonym for annotation tier.
     24
    2225 CLARIN-FCS, FCS::
    2326    CLARIN federated content search, an interface specification to allow searching within resource content of repositories.
     
    4649 Interface Specification::
    4750    Common harmonized interface and suite of protocols that repositories need to implement.
     51
     52 Layer::
     53    See [#REF_Annotation_Layer ''Annotation Layer'']
    4854
    4955 PID::
     
    419425
    420426==== Layers #layers
    421 ||=Identifier =||=Annotation Tier Description                                =||=Syntax                          =||=Examples (without quotes)           =||
     427||=Identifier =||=Annotation Layer Description                                =||=Syntax                          =||=Examples (without quotes)           =||
    422428|| `token`     || Appropriate tokenisation of resource, i.e. words            || ''String''                       || "Dog", "cat", "walking", "better"               ||
    423429|| `lemma`     || Lemmatisation of tokens                                     || ''String''                       || "good", "walk", "dog"             ||
     
    426432|| `norm`      || Orthographic normalization of (mostly) spoken resources     || ''String''                       || "dog", "cat", "walking", "best"              ||
    427433|| `phonetic`  || Phonetic transcription                                      || [#REF_SAMPA SAMPA]               || "'du:", "'vi:-d6 'ha:-b@n"           ||
    428 || `text`      || Annotation tier that is used in [#basicSearch Basic Search] || ''String''                       || "Dog", "cat" "walking", "better"                ||
     434|| `text`      || Annotation layer that is used in [#basicSearch Basic Search] || ''String''                       || "Dog", "cat" "walking", "better"                ||
    429435
    430436The column Syntax describes the inventory of symbols that a Client `MUST` use with a corresponding annotation layer; the value ''String'' denotes that symbols are arbitrary Unicode Strings, i.e. no fixed inventory of symbols are defined. An Endpoint `SHOULD` provide an appropriate error, if a Client used an invalid value.
     
    568574||=Layer ''lemma''    =|| _  || dat  || zijn || de  || enig  || echt  || hoop || voor || ons  || mens   ||
    569575||=Layer ''phonetic'' =|| t@ || dAz  || dAz  || d@  || en@G@ || Ext@  || hop  || for  || Ons  || mEns@  ||
     576
     577 - alignable (and referable)
     578
     579{{{#!xml
     580<Advanced>
     581    <Segments unit="items">
     582        <Segment id="s1"  start="1"  end="1"
     583            ref="http://hdl.handle.net/4711/123456789?urlappend=%3Fplay=0:173"/>
     584        <Segment id="s2"  start="3"  end="4"
     585            ref="http://hdl.handle.net/4711/123456789?urlappend=%3Fplay=173:304"/>
     586        <Segment id="s3"  start="6"  end="7"
     587            ref="http://hdl.handle.net/4711/123456789?urlappend=%3Fplay=173:304"/>
     588        <Segment id="s4"  start="9"  end="10"
     589            ref="http://hdl.handle.net/4711/123456789?urlappend=%3Fplay=304:480"/>
     590        <Segment id="s5"  start="12" end="16"
     591            ref="http://hdl.handle.net/4711/123456789?urlappend=%3Fplay=480:1119"/>
     592        <Segment id="s6"  start="18" end="22"
     593            ref="http://hdl.handle.net/4711/123456789?urlappend=%3Fplay=1339:1901"/>
     594        <Segment id="s7"  start="24" end="27"
     595            ref="http://hdl.handle.net/4711/123456789?urlappend=%3Fplay=1901:2427"/>
     596        <Segment id="s8"  start="29" end="32"
     597            ref="http://hdl.handle.net/4711/123456789?urlappend=%3Fplay=3084:3493"/>
     598        <Segment id="s9"  start="34" end="36"
     599            ref="http://hdl.handle.net/4711/123456789?urlappend=%3Fplay=3493:3754"/>
     600        <Segment id="s10" start="38" end="43"
     601            ref="http://hdl.handle.net/4711/123456789?urlappend=%3Fplay=3754:4274"/>
     602    </Segments>
     603
     604    <Layers>
     605        <Layer id="http://endpoint.example.org/Layers/orth1">
     606            <Span ref="s1">t</Span>
     607            <Span ref="s2">da</Span>
     608            <Span ref="s3">'s</Span>
     609            <Span ref="s4">de</Span>
     610            <Span ref="s5">enige</Span>
     611            <Span ref="s6">echte</Span>
     612            <Span ref="s7">hoop</Span>
     613            <Span ref="s8">voor</Span>
     614            <Span ref="s9">ons</Span>
     615            <Span ref="s10">mensen</Span>
     616        </Layer>
     617
     618        <Layer id="http://endpoint.example.org/Layers/pos1">
     619            <Span ref="s1"  alt-value="SPEC(afgebr)">X</Span>
     620            <Span ref="s2"  alt-value="VNW(aanw,pron,stan,vol,3o,ev)">PRON</Span>
     621            <Span ref="s3"  alt-value="WW(pv,tgw,ev)">VERB</Span>
     622            <Span ref="s4"  alt-value="LID(bep,stan,rest)">DET</Span>
     623            <Span ref="s5"  alt-value="VNW(onbep,det,stan,prenom,met-e,rest)">DET</Span>
     624            <Span ref="s6"  alt-value="ADJ(prenom,basis,met-e,stan)">ADJ</Span>
     625            <Span ref="s7"  alt-value="N(soort,ev,basis,zijd,stan)">NOUN</Span>
     626            <Span ref="s8"  alt-value="VZ(init)">ADP</Span>
     627            <Span ref="s9"  alt-value="VNW(pr,pron,obl,vol,1,mv)">PRON</Span>
     628            <Span ref="s10" alt-value="N(soort,mv,basis)">NOUN</Span>
     629        </Layer>
     630
     631        <Layer id="http://endpoint.example.org/Layers/lemma1">
     632            <Span ref="s1">_</Span>
     633            <Span ref="s2">dat</Span>
     634            <Span ref="s3">zijn</Span>
     635            <Span ref="s4" >de</Span>
     636            <Span ref="s5">enig</Span>
     637            <Span ref="s6" highlight="h1">echt</Span>
     638            <Span ref="s7" highlight="h1">hoop</Span>
     639            <Span ref="s8">voor</Span>
     640            <Span ref="s9">ons</Span>
     641            <Span ref="s10">mens</Span>
     642        </Layer>
     643
     644        <Layer id="http://endpoint.example.org/Layers/phon">
     645            <Span ref="s1">t@</Span>
     646            <Span ref="s2" highlight="h2">dAz</Span>
     647            <Span ref="s3">dAz</Span>
     648            <Span ref="s4">d@</Span>
     649            <Span ref="s5">en@G@</Span>
     650            <Span ref="s6">Ext@</Span>
     651            <Span ref="s7">hop</Span>
     652            <Span ref="s8">for</Span>
     653            <Span ref="s9">Ons</Span>
     654            <Span ref="s10">mEns@</Span>
     655        </Layer>
     656    </Layers>
     657</Advanced>
     658}}}
    570659
    571660=== Versioning and Extensions