Changes between Version 4 and Version 5 of VLO-Taskforce/RecommendationsForFacets
- Timestamp:
- 11/03/15 11:49:23 (9 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
VLO-Taskforce/RecommendationsForFacets
v4 v5 39 39 Definition:: The country of origin of the source material of the resource, i.e. not the country in which the resource was created, but where e.g. original texts were written or speech recordings made. 40 40 Tooltip:: The country of origin of the source material of the resource 41 Recommended Data Categories:: http:// www.isocat.org/datcat/DC-2532: location country; → Nota bene: This data category has a broader definition than the VLO facet `Country`. Decisive for the interpretation by the VLO is the (stricter) facet definition.42 Problematic Data Categories: http:// www.isocat.org/datcat/DC-3792: country name, http://www.isocat.org/datcat/DC-2092: country coding → Both these data categories are not sufficiently concrete themselves for further interpretation within the VLO. If used, it is necessary that they are embedded in a context (of other data categories) which narrows down the possible readings of these categories to the one given in the facet definition.41 Recommended Data Categories:: http://hdl.handle.net/11459/CCR_C-2532_d004b0a6-fd1d-3ca3-abf1-1e6aeb3e37b2: location country; → Nota bene: This data category has a broader definition than the VLO facet `Country`. Decisive for the interpretation by the VLO is the (stricter) facet definition. 42 Problematic Data Categories: http://hdl.handle.net/11459/CCR_C-3792_68c770a4-d58c-46dd-d429-5609ce5f81c3: country name, http://hdl.handle.net/11459/CCR_C-2092_36cd7ca8-e412-9f29-7ea7-4a3ba4ba2c91: country coding → Both these data categories are not sufficiently concrete themselves for further interpretation within the VLO. If used, it is necessary that they are embedded in a context (of other data categories) which narrows down the possible readings of these categories to the one given in the facet definition. 43 43 Recommended Vocabulary:: Codes or Names according to ISO 3166-1 (Alpha-2, Alpha-3, or Numerical); cf.http://en.wikipedia.org/wiki/ISO_3166-1, http://www.geonames.org/countries/. 44 44 … … 53 53 Definition:: The mime types of the files in the resource or consumed/produced by the tool. 54 54 Tooltip:: The mime types used in the resource or by the tool 55 Recommended Data Categories:: http:// www.isocat.org/datcat/DC-2571: mime type55 Recommended Data Categories:: http://hdl.handle.net/11459/CCR_C-2571_2be2e583-e5af-34c2-3673-93359ec1f7df: mime type 56 56 Recommended Vocabulary:: The Template expressions under: http://www.iana.org/assignments/media-types/media-types.xhtml 57 57 Open Issues:: Maybe create a more tight vocabulary for linguistic resources based on https://docs.google.com/spreadsheets/d/1Tjmp_sEZDHIqnFAU1erx2VtzyYdjbKdM7RQGNIUbWhs/edit#gid=884722196 ? … … 60 60 Definition:: The conventionalized discourse or text type of the content of the resource, consistently applied within the collection. 61 61 Tooltip:: The genre of the content of the resource 62 Recommended Data Categories:: http:// www.isocat.org/datcat/DC-2470: genre, http://www.isocat.org/datcat/DC-3899: subGenre62 Recommended Data Categories:: http://hdl.handle.net/11459/CCR_C-2470_d191f2b2-6339-f031-b534-70d526b28357: genre, http://hdl.handle.net/11459/CCR_C-3899_c6c608e7-cb2e-1832-09ff-aee36e1f2ed4: subGenre 63 63 Recommendations on the Vocabulary:: The usage of a somehow controlled vocabulary is recommended which is homogeneous in itself, sufficiently documented, linked to via some reference within the respective element. 64 64 … … 66 66 Definition:: Keywords containing relevant information on the resource or tool not stated in other VLO metadata facets, consistently applied within the collection. 67 67 Tooltip:: Keywords describing the resource or tool 68 Recommended Data Categories:: http:// www.isocat.org/datcat/DC-5436: metadata tag68 Recommended Data Categories:: http://hdl.handle.net/11459/CCR_C-5436_6ab57c2c-5f8d-3561-6db6-d75da23d2637: metadata tag 69 69 Recommendations on the Vocabulary:: The usage of a somehow controlled vocabulary is recommended which is homogeneous in itself, sufficiently documented, 70 70 linked to via some reference within the respective element. … … 73 73 Definition:: The object language relevant for the resource or tool, i.e. the language of the source material of a resource, the object language of a language description, or the language supported by a linguistic tool. 74 74 Tooltip:: The object language relevant for the resource or tool 75 Recommended Data Categories:: http:// www.isocat.org/rest/dc/2482: languageID, http://www.isocat.org/rest/dc/2484: languageName → N.B.: If possible, the languageName-category should only be used in combination with a language ID corresponding to ISO 639-3 (see: Recommendations on the Vocabulary). http://www.isocat.org/rest/dc/5361: langUsage with http://www.isocat.org/rest/dc/5358: language → N.B.: The langUsage and language categories are modeled analogous to the corresponding TEI Header elements. They should only be used in combination with one another.75 Recommended Data Categories:: http://hdl.handle.net/11459/CCR_C-2482_08eded24-4086-7e3f-88e5-e0807fb01e17: languageID, http://hdl.handle.net/11459/CCR_C-2484_669684e7-cb9e-ea96-59cb-a25fe89b9b9d: languageName → N.B.: If possible, the languageName-category should only be used in combination with a language ID corresponding to ISO 639-3 (see: Recommendations on the Vocabulary). http://hdl.handle.net/11459/CCR_C-5361_ba085ec1-9746-52bf-8cc1-3c300ce16eb8: langUsage with http://hdl.handle.net/11459/CCR_C-5358_3cd089fe-ad03-6181-b20c-635ea41ed818: language → N.B.: The langUsage and language categories are modeled analogous to the corresponding TEI Header elements. They should only be used in combination with one another. 76 76 Recommendations on the Vocabulary:: Language Codes according to ISO 639-3 (Languages) or ISO 639-5 (Language Families) ISO 639-3 `und` (undeterminded) for resources which cannot be assigned one exact language (e.g. language independent tools). 77 77 … … 79 79 Definition:: A rough description of the conditions under which the resource or tool can be used. 80 80 Tooltip:: The usage conditions for the resource or tool 81 Recommended Data Categories:: http:// www.isocat.org/datcat/DC-2453: i.e. availability, such as “free; free for academic use; restricted use; request required; user licence required; registration required; unknown”; http://www.isocat.org/datcat/DC-6846: i.e. rights, meaning “Any rights information for this resource.” (hence: very broad definition); http://www.isocat.org/datcat/DC-5439: i.e. license type81 Recommended Data Categories:: http://hdl.handle.net/11459/CCR_C-2453_1f0c3ea5-7966-ae11-d3c6-448424d4e6e8: i.e. availability, such as “free; free for academic use; restricted use; request required; user licence required; registration required; unknown”; http://www.isocat.org/datcat/DC-6846: i.e. rights, meaning “Any rights information for this resource.” (hence: very broad definition); http://hdl.handle.net/11459/CCR_C-5439_98bb103d-476a-7f62-54b4-bf9de24d2229: i.e. license type 82 82 Recommended Vocabulary:: '''free''' (The resource is available for re-use under a free license, e.g. CC, LGPL. There may be still usage constraints within the scope of the respective licenses, e.g. concerning attribution or conditions of further sharing). '''free for academic use''' (The resource is available for re-use in an academic context.) '''restricted''' (The resource is not available for free re-use. Restrictions may concern the availability or functionality of the resource or its parts. Restrictions might be loosened by purchase of a license.) '''upon-request''' (The resource is freely available upon a user’s request. The terms of usage will be negotiated based on the intended usage scenarios.) 83 83 Open Issues:: Test implementation available under: http://aspra11.informatik.uni-leipzig.de:8080/vlo/search?0 This facet should be accompanied by a display facet `License`. … … 86 86 Definition:: The status in the life cycle of the resource or tool. 87 87 Tooltip:: The status in the life cycle of the resource or tool 88 Recommended Data Categories:: (not discussed yet!) http:// www.isocat.org/datcat/DC-3818: life cycle status, such as: “.”88 Recommended Data Categories:: (not discussed yet!) http://hdl.handle.net/11459/CCR_C-3818_8c4aec73-1654-7565-9575-c4a17425ee29: life cycle status, such as: “.” 89 89 Recommended Vocabulary:: As far as possible use the vocabulary proposed in DC-3818, i.e.: '''planned, development, released, production, withdrawn, retired, superseded, archived''' 90 90 … … 92 92 Definition:: The channel by which the signs in the content of the resource were transmitted or the modality for which a tool is intended, e.g. to regonize speech, gestures or entities of a text. 93 93 Tooltip:: The modality of the content of the resource or intended for the tool 94 Recommended Data Categories:: http:// www.isocat.org/datcat/DC-2490: modalities95 Recommended Vocabulary:: (based on the examples under http:// www.isocat.org/datcat/DC-2490and the data within the VLO’s modality facet)94 Recommended Data Categories:: http://hdl.handle.net/11459/CCR_C-2490_44bc38a3-1799-4149-c791-40ac0176f0ff: modalities 95 Recommended Vocabulary:: (based on the examples under http://hdl.handle.net/11459/CCR_C-2490_44bc38a3-1799-4149-c791-40ac0176f0ff and the data within the VLO’s modality facet) 96 96 97 97 ||= Type =||= Subtype =||=Examples and/or notes|| … … 128 128 Definition:: The name of the organisation currently responsible for the resource or tool, i.e. to be contacted with any questions or requests regarding the metadata or access to the resource/tool. 129 129 Tooltip:: The organisation currently responsible for the resource or tool 130 Recommended Data Categories:: www.isocat.org/datcat/DC-2459: organization131 Problematic Data Categories:: www.isocat.org/datcat/DC-2979: Organisation → Underspecified, i.e. this DC doesn’t specify the purpose of the organisation named here in the context of the resource.132 Eliminated/Ignored Data Categories:: www.isocat.org/datcat/DC-6134:publisher (from TEI), http://purl.org/dc/terms/publisher: publisher (Dublin Core) → Both underspecified, i.e. this DCs don’t specify the entity named here (organisation, person, etc.) now the purpose of the entity in the context of the resource.130 Recommended Data Categories:: http://hdl.handle.net/11459/CCR_C-2459_fc4e74d6-84de-c8cd-1ae8-2c2be5ee90b1: organization 131 Problematic Data Categories:: http://hdl.handle.net/11459/CCR_C-2979_8030473e-bbcb-6b87-3fd2-90554429ec50: Organisation → Underspecified, i.e. this DC doesn’t specify the purpose of the organisation named here in the context of the resource. 132 Eliminated/Ignored Data Categories:: http://hdl.handle.net/11459/CCR_C-6134_72c22724-2615-fd70-2eff-8cd3cb59e91d:publisher (from TEI), http://purl.org/dc/terms/publisher: publisher (Dublin Core) → Both underspecified, i.e. this DCs don’t specify the entity named here (organisation, person, etc.) now the purpose of the entity in the context of the resource. 133 133 Recommendations on the Vocabulary:: (1) There should be an English reading provided for each institution name which will be represented within the VLO. The English name should be marked as such by usage of a language-defining attribute and an ISO 639-3 language code (e.g. `@xml:lang="eng"`). (2) There should only be one variant of an institution name used within all metadata records provided by this respective institution. 134 134 … … 136 136 Definition:: The name of the projects originally involved in the creation of the resource or tool. These projects may no longer exist and are usually not the ones to be contacted regarding the resource/tool. 137 137 Tooltip:: The project within which the resource was created 138 Recommended Data Categories:: http:// www.isocat.org/datcat/DC-2536: project name, http://www.isocat.org/datcat/DC-2537: project title138 Recommended Data Categories:: http://hdl.handle.net/11459/CCR_C-2536_13fc5f10-c14a-1f64-a669-32736f6d3ef5: project name, http://hdl.handle.net/11459/CCR_C-2537_fa206273-223a-f4fa-dde3-ba59b965701f: project title 139 139 Recommendations on the Vocabulary:: TODO 140 140 … … 142 142 Definition:: The type of the resource or tool (e.g. corpus, lexicon, grammar, tool, …) 143 143 Tooltip:: The type of the resource or tool (e.g. corpus, lexicon, grammar, tool, …) 144 Recommended Data Categories:: http:// www.isocat.org/datcat/DC-3806: resource class, http://www.isocat.org/datcat/DC-5424: type → based on the TEI Header definition of type144 Recommended Data Categories:: http://hdl.handle.net/11459/CCR_C-3806_e55e9ed6-b099-c21d-a634-3c7f4d22a215: resource class, http://hdl.handle.net/11459/CCR_C-5424_3200a38b-344e-41de-e539-f71f80c38df8: type → based on the TEI Header definition of type 145 145 Problematic Data Categories:: http://purl.org/dc/terms/type, http://purl.org/dc/elements/1.1/type → This DC is underspecified since “genre” is included in the definition as well. If used, it is necessary that it is embedded in a context (of other data categories) disambiguating the possible readings of this category. 146 Recommended Vocabulary:: (based on the examples under http:// www.isocat.org/datcat/DC-3806and the data within the VLO’s Resource Type facet)146 Recommended Vocabulary:: (based on the examples under http://hdl.handle.net/11459/CCR_C-3806_e55e9ed6-b099-c21d-a634-3c7f4d22a215 and the data within the VLO’s Resource Type facet) 147 147 148 148 ||= Value =||Example|| … … 181 181 Definition:: The subject or topic of the content of the resource, consistently applied within the collection. 182 182 Tooltip:: The subject or topic of the content of the resource 183 Recommended Data Categories:: http://purl.org/dc/terms/subject: subject, http://purl.org/dc/elements/1.1/subject: subject, http:// www.isocat.org/datcat/DC-6147: domain of use, http://www.isocat.org/datcat/DC-5316: classification code → This DC is designed analogous to the TEI-Header element classCode. It is thus underspecified for the subject facet. Hence, when used the classification scheme has to be determined and the usage of this element according to the definition of the subject facet should be specified by the context somehow.183 Recommended Data Categories:: http://purl.org/dc/terms/subject: subject, http://purl.org/dc/elements/1.1/subject: subject, http://hdl.handle.net/11459/CCR_C-6147_ebed915e-f911-f128-cddc-466aa41c9c73: domain of use, http://hdl.handle.net/11459/CCR_C-5316_2c6244b4-4f10-5e8e-49b6-26fbf7004791: classification code → This DC is designed analogous to the TEI-Header element classCode. It is thus underspecified for the subject facet. Hence, when used the classification scheme has to be determined and the usage of this element according to the definition of the subject facet should be specified by the context somehow. 184 184 Recommended Vocabulary:: The usage of a somehow controlled vocabulary is recommended which is homogeneous in itself, and preferably sufficiently documented. 185 185 … … 188 188 Definition:: The temporal coverage of the source material of the resource, i.e. not the time within which the resource was created, but when e.g. original texts were written or speech recordings made. 189 189 Tooltip:: The temporal coverage of the source material of the resource 190 Possible Data Categories:: http:// www.isocat.org/datcat/DC-3664: Time coverage, http://www.isocat.org/datcat/DC-3654: Start range → This data category has to be used together with DC-3655, http://www.isocat.org/datcat/DC-3655: End range → This data category has to be used together with DC-3654191 Problematic Data Categories:: http:// www.isocat.org/datcat/DC-4343: interval → Underspecified (“a : a space of time between events or states”): could be filled with values like “200 years”, “a decade”, etc.; http://www.isocat.org/datcat/DC-5742: End time → incomplete; corresponding start time category missing; http://www.isocat.org/datcat/DC-2502: time coverage → Definition (“The time period that the content of a resource is about.”) does not suit the focus of this facet (cf. 17.1)192 Proposed vocabulary:: Open Date Range format www.ukoln.ac.uk/metadata/dcmi/date-dccd-odrf for http:// www.isocat.org/datcat/DC-3664; W3C DateTime http://www.w3.org/TR/NOTE-datetime for http://www.isocat.org/datcat/DC-3654 and http://www.isocat.org/datcat/DC-3655190 Possible Data Categories:: http://hdl.handle.net/11459/CCR_C-3664_eb600f47-5123-efbe-251b-d952c65fc847: Time coverage, http://hdl.handle.net/11459/CCR_C-3654_f1608e88-95e6-4233-5d21-5312e76de32d: Start range → This data category has to be used together with DC-3655, http://hdl.handle.net/11459/CCR_C-3655_bc4c2656-2946-0be9-49f0-021a811e531b: End range → This data category has to be used together with DC-3654 191 Problematic Data Categories:: http://hdl.handle.net/11459/CCR_C-4343_0d6e80e7-6f6c-5497-eac6-29b95a5fa9ec: interval → Underspecified (“a : a space of time between events or states”): could be filled with values like “200 years”, “a decade”, etc.; http://hdl.handle.net/11459/CCR_C-5742_66aaccdd-f1d0-a6a6-a4fb-efa704e06d8b: End time → incomplete; corresponding start time category missing; http://hdl.handle.net/11459/CCR_C-2502_747eb0cd-03e9-cffb-34cc-d0c8c77e4c5a: time coverage → Definition (“The time period that the content of a resource is about.”) does not suit the focus of this facet (cf. 17.1) 192 Proposed vocabulary:: Open Date Range format www.ukoln.ac.uk/metadata/dcmi/date-dccd-odrf for http://hdl.handle.net/11459/CCR_C-3664_eb600f47-5123-efbe-251b-d952c65fc847; W3C DateTime http://www.w3.org/TR/NOTE-datetime for http://hdl.handle.net/11459/CCR_C-3654_f1608e88-95e6-4233-5d21-5312e76de32d and http://hdl.handle.net/11459/CCR_C-3655_bc4c2656-2946-0be9-49f0-021a811e531b 193 193 194 194 … … 212 212 Definition:: The name of the license or a very brief description of the licensing conditions under which the resource or tool can be used. 213 213 Tooltip:: The licensing conditions for the resource or tool 214 Recommended Data Categories:: http:// www.isocat.org/datcat/DC-2453: i.e. availability, such as “free; free for academic use; restricted use; request required; user licence required; registration required; unknown”; http://www.isocat.org/datcat/DC-6846: i.e. rights, meaning “Any rights information for this resource.” (hence: very broad definition)214 Recommended Data Categories:: http://hdl.handle.net/11459/CCR_C-2453_1f0c3ea5-7966-ae11-d3c6-448424d4e6e8: i.e. availability, such as “free; free for academic use; restricted use; request required; user licence required; registration required; unknown”; http://www.isocat.org/datcat/DC-6846: i.e. rights, meaning “Any rights information for this resource.” (hence: very broad definition) 215 215 Recommendations on the Vocabulary:: XXX 216 216 General Recommendations:: In case of more than one license per resource (e.g. a resource containing several parts which are published under different licenses) there may be multiple selections of licenses within one metadata record. … … 222 222 Definition:: The person or organization holding the full rights for the resource or tool. 223 223 Tooltip:: The rights holder of the resource or tool 224 Recommended Data Categories:: http:// www.isocat.org/datcat/DC-6709, http://purl.org/dc/terms/rightsHolder, www.isocat.org/datcat/DC-2956?224 Recommended Data Categories:: http://hdl.handle.net/11459/CCR_C-6709_cb3572ed-ffd3-04f1-c145-b9c1f26bfc82, http://purl.org/dc/terms/rightsHolder, http://hdl.handle.net/11459/CCR_C-2956_519a4aab-2f76-0fd3-090e-f0d6b81a7dbb ? 225 225 Recommendations on the Vocabulary:: XXX 226 226 … … 231 231 Definition:: Information on the participant (actor/author) that produced the source material of the resource 232 232 Tooltip:: Actor or author information 233 Recommended Data Categories:: http:// www.isocat.org/datcat/DC-2550 (actorAge/participantAge), http://www.isocat.org/datcat/DC-2484 and/or http://www.isocat.org/datcat/DC-2482 (actorLanguages) - if within DC-4146 (Container DC for Actor) or maybe within other Actor components, http://www.isocat.org/datcat/DC-2560 (actorSex/participantSex), http://www.isocat.org/datcat/DC-4578(actorCountry, the birth country)233 Recommended Data Categories:: http://hdl.handle.net/11459/CCR_C-2550_cdb6b956-9997-0923-68ef-09de017f24ef (actorAge/participantAge), http://hdl.handle.net/11459/CCR_C-2484_669684e7-cb9e-ea96-59cb-a25fe89b9b9d and/or http://hdl.handle.net/11459/CCR_C-4160_192be757-0d8f-f4fe-b10b-d3d50de92482 (actorLanguages) - if within http://hdl.handle.net/11459/CCR_C-4146_5ccc45c8-d729-c180-2bf1-fccc56dde24d (Container DC for Actor) or maybe within other Actor components, http://hdl.handle.net/11459/CCR_C-2560_ebb466e1-2b6b-6701-4e64-94618b4b455b (actorSex/participantSex), http://hdl.handle.net/11459/CCR_C-4578_c4d97a82-4ba4-d5ff-a9f9-fbd43f121e33 (actorCountry, the birth country) 234 234 Recommendations on the Vocabulary:: XXX