| 1 | {{{ |
| 2 | These are the xPaths of our CMDI elements and the facette to be mapped. If no xPath is provided, we explain very briefly why. One can find also comments preceeded by # |
| 3 | |
| 4 | Facettes are taken after checking automatic mapping of our CMDI profile with Menzo's tool. |
| 5 | |
| 6 | http://lux13.mpi.nl/isocat/clarin/vlo/mapping/check?prof=clarin.eu:cr1:p_1288172614026 |
| 7 | |
| 8 | Facet: id |
| 9 | /CMD/Header/MdSelfLink |
| 10 | |
| 11 | Facet: collection |
| 12 | /CMD/Header/MdCollectionDisplayName |
| 13 | |
| 14 | Facet: projectName |
| 15 | # We had that annotated as creator http://dublincore.org/documents/2012/06/14/dcmi-terms/?v=terms#terms-creator |
| 16 | # We also use creator element for the institution where the resource was produced. As it is, dissambiguation has to be done manually to provide a correct mapping |
| 17 | |
| 18 | Facet: name |
| 19 | /CMD/Components/OLAC-DcmiTerms/title |
| 20 | |
| 21 | Facet: year |
| 22 | /CMD/Components/OLAC-DcmiTerms/date |
| 23 | # We provide sometimes instead of a exact date, a range. |
| 24 | |
| 25 | Facet: continent |
| 26 | # we don't have this information annotated in our metadata |
| 27 | # maybe it can be inferred from coverage information |
| 28 | |
| 29 | Facet: country |
| 30 | # We use coverage http://dublincore.org/documents/2012/06/14/dcmi-terms/?v=terms#terms-coverage |
| 31 | # But under coverage, we currently annotate geographical and temporal information. Therefore, disambiguation has to be done by postprocessing (matching TGN entries). |
| 32 | # See also: https://trac.clarin.eu/ticket/388 |
| 33 | |
| 34 | Facet: language |
| 35 | /CMD/Components/OLAC-DcmiTerms/language |
| 36 | |
| 37 | Facet: languages |
| 38 | /CMD/Components/OLAC-DcmiTerms/language |
| 39 | # What is the difference with "language"? |
| 40 | |
| 41 | Facet: organisation |
| 42 | /CMD/Components/OLAC-DcmiTerms/publisher |
| 43 | # http://dublincore.org/documents/2012/06/14/dcmi-terms/?v=terms#terms-publisher |
| 44 | # According to the ISOcat categories, organisation can be "The organization that was leading the creation project or that is responsible for accessing the resource and the contact person is affiliated with.". Do we need some kind of distinction here, or is it OK as it is? |
| 45 | |
| 46 | Facet: genre |
| 47 | # We don't have this information in our CMDI metadata. What does happen if more than a genre/register is included in a corpus? Can we have more than a genre element in the metadata? This would be the case of the CroCo corpus. Genre is a bit tricky notion. Wouldn't it be better to split this into different aspects describing the communicative situation within the text was produced? Like modality, channel... |
| 48 | |
| 49 | Facet: subject |
| 50 | /CMD/Components/OLAC-DcmiTerms/subject |
| 51 | # We have several subject elements, but in the facets, only one per record is showed. |
| 52 | |
| 53 | Facet: description |
| 54 | /CMD/Components/OLAC-DcmiTerms/description |
| 55 | # We have descriptions both in English and German. Shall we always provide descriptions in English? Can the VLO show only one version? |
| 56 | |
| 57 | Facet: resourceType |
| 58 | /CMD/Components/OLAC-DcmiTerms/type |
| 59 | # We have several instances of type for a record, like "collection" and "corpus" although only one is showed. |
| 60 | |
| 61 | Facet: nationalProject |
| 62 | # We don't have this explicitly annotated, but the information already appears properly mapped. |
| 63 | |
| 64 | Facet: text |
| 65 | # What does it mean? |
| 66 | |
| 67 | Facet: _componentProfile |
| 68 | /CMD/Header/MdProfile |
| 69 | |
| 70 | Facet: tag |
| 71 | # We don't use it |
| 72 | }}} |