Changes between Version 2 and Version 3 of FCS-specification
- Timestamp:
- 04/17/12 11:57:29 (12 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
FCS-specification
v2 v3 15 15 16 16 In general each CLARIN-center participating will provide at least the following services: 17 •Provide one or more resources18 •Support Content-search within those resources19 •Return search-hits in the agreed-upon format20 •Support query-expansion if possible21 •Support the selection of a sub-part of the offered resources to perform content-search on that sup-part22 •Provide support for the sub-part selection by providing CMDI metadata at the same, reasonable , granularity17 * Provide one or more resources 18 * Support Content-search within those resources 19 * Return search-hits in the agreed-upon format 20 * Support query-expansion if possible 21 * Support the selection of a sub-part of the offered resources to perform content-search on that sup-part 22 * Provide support for the sub-part selection by providing CMDI metadata at the same, reasonable , granularity 23 23 24 24 = Global Design Thoughts / The Aggregator = … … 74 74 This basic request serves to announce server's capabilities and should allow the client to configure itself automatically. The explain response should, ideally, provide a list of ISOcatted indexes as possible search indexes. If there is no ISOcat equivalent the CCS-context* set is to be used. We provide a telling example (as seen within the context of the explain response as defined on the SRU/CQL website): 75 75 76 {{{ 76 {{{#!xml 77 77 <indexInfo> 78 78 <set identifier="isocat.org/datcat" name="isocat"/> … … 105 105 === Scan === 106 106 107 We foresee the scan operation as a way of signaling to the calling program/user/aggregator the available resources available for searching at the endpoint. This in contrast to the definition in SRU, where scan is a way to browse a list of keywords. The value of the scanClause parameter should be cmd.collection.107 We foresee the scan operation as a way of signaling to the calling program/user/aggregator the available resources available for searching at the endpoint. This in contrast to the definition in SRU, where scan is a way to browse a list of keywords. The value of the scanClause parameter should be '''fcs.resource'''. 108 108 109 109 To this the endpoint will return a list of terms, which are searchable collections. Their identifiers can than be used to restrict the search by passing one (or more) as parameters in x-cmd-context in the searchRetrieve operation. 110 110 111 111 Again, we provide a telling example: 112 {{{ 112 {{{#!xml 113 113 <sru:scanResponse xmlns:sru="http://www.loc.gov/zing/srw/" > 114 114 <sru:version>1.2</sru:version> 115 115 <sru:terms> 116 116 <sru:term> 117 <sru:value> MPI86949#</sru:value>117 <sru:value>hdl:1839/00-0000-0000-0001-53A5-2</sru:value> 118 118 <sru:numberOfRecords>12098</sru:numberOfRecords> 119 119 <sru:displayTerm>The CGN-Corpus (Corpus Gesproken Nederlands)</sru:displayTerm> 120 120 </sru:term> 121 121 <sru:term> 122 <sru:value> MPI1296694#</sru:value>122 <sru:value>http://corpus1.mpi.nl/qfs1/media-archive/mirrored_corpora/childes/Corpusstructure/childes.imdi</sru:value> 123 123 <sru:numberOfRecords>42</sru:numberOfRecords> 124 124 <sru:displayTerm>Childes corpus</sru:displayTerm> … … 127 127 <sru:echoedScanRequest> 128 128 <sru:version>1.2</sru:version> 129 <sru:scanClause> cmd.collections</sru:scanClause>129 <sru:scanClause>fcs.resource</sru:scanClause> 130 130 <sru:responsePosition></sru:responsePosition> 131 131 <sru:maximumTerms>42</sru:maximumTerms> … … 134 134 }}} 135 135 136 Note that the values in the sru:value elements should be valid PID. These PIDs are ideally also available from within the matching CMDI metadata file . (see also below under “Restricting the search”. 136 Note that the values in the sru:value elements should be valid [http://www.clarin.eu/faq/3460 MdSelfLink]. These MdSelfLinks should also be available from within the matching CMDI metadata file (via a reference in the Header section - see also below under "Restricting the search"). 137 138 Additionally it is possible (but not obligatory) to perform extra Scan operations to retrieve subcollections, as in a [http://en.wikipedia.org/wiki/Tree_traversal tree traversal] algorithm. 139 140 E.g. to find out the subcollections of the CGN-Corpus in the example above one would perform the following scan operation: http://clarin_srucql_endpoint?operation=Scan&version=1.2&scanClause=fcs.resource=hdl:1839/00-0000-0000-0001-53A5-2 141 142 {{{#!xml 143 <sru:scanResponse xmlns:sru="http://www.loc.gov/zing/srw/" > 144 <sru:version>1.2</sru:version> 145 <sru:terms> 146 <sru:term> 147 <sru:value>hdl:1839/00-0000-0000-0003-467E-9</sru:value> 148 <sru:numberOfRecords>300</sru:numberOfRecords> 149 <sru:displayTerm>Annotation types</sru:displayTerm> 150 </sru:term> 151 <sru:term> 152 <sru:value>hdl:1839/00-0000-0000-0003-4682-F</sru:value> 153 <sru:numberOfRecords>400</sru:numberOfRecords> 154 <sru:displayTerm>Components</sru:displayTerm> 155 </sru:term> 156 <sru:term> 157 <sru:value>hdl:1839/00-0000-0000-0003-4692-D</sru:value> 158 <sru:numberOfRecords>350</sru:numberOfRecords> 159 <sru:displayTerm>Regions</sru:displayTerm> 160 </sru:term> 161 </sru:terms> 162 <sru:echoedScanRequest> 163 <sru:version>1.2</sru:version> 164 <sru:scanClause>fcs.resource=hdl:1839/00-0000-0000-0001-53A5-2</sru:scanClause> 165 <sru:responsePosition></sru:responsePosition> 166 <sru:maximumTerms>42</sru:maximumTerms> 167 </sru:echoedScanRequest> 168 </sru:scanResponse> 169 }}} 170 137 171 138 172 === SearchRetrieve === … … 154 188 There are several dataviews agreed upon. Each dataView will have an attribute “type”, which has as value the type of dataView contained. It is possible to, in the future, add different dataviews if required. It is mandatody to support the KWIC dataview (as this type is fairly straightforward to show as a list of results). 155 189 Our KWIC dataView looks as follows: 156 {{{ 190 {{{#!xml 157 191 <ccs:DataView type="kwic"> 158 192 <c type="left">Some text with </c>