| 6 | |
| 7 | == Exploiting ISOcat data categories == |
| 8 | |
| 9 | Currently (January 2014) the VLO relies on two separate strategies for MD selection to populate its facets: |
| 10 | |
| 11 | 1. ISOcat DCs associated with a MD field in the underlying CMDI profile |
| 12 | 2. a set of centre specific XPaths for CMDI instances to |
| 13 | * explicitly select MD fields for inclusion in a facet that was not matched by a ISOcat DC (white list) |
| 14 | * explicitly discard MD fields that were selected via their ISOcat DC (black list) |
| 15 | |
| 16 | Ideally, strategy 1 should suffice for proper MD selection for the facets. To fix the current state of the MD instances and the VLO we will have to answer the questions: |
| 17 | |
| 18 | 1. Does VLO rely on appropriate, i.e. sufficiently concrete defined ISOcat DCs? Obviously DCs like http://www.isocat.org/datcat/DC-2482 (language ID) or http://www.isocat.org/datcat/DC-2484 (language name) are semantically too vague and don't allow for the differentiation between the language a resource is written in or the language of an actor in case of transcribed recordings and so on. |
| 19 | * Solution: Only use narrowly defined DCs for VLO facets. |
| 20 | 2. Do CMDI profiles use sufficiently concrete ISOcat DCs? |
| 21 | * Task: Create an overview of the DCs actually used by the centres. |
| 22 | * Task: Evaluate the whitelist/blackclist XPaths with respect to why they are needed (to select vaguely defined DCs? to select MD fields that don't have a DC associated? anything else?) |
| 23 | * Task: Propose the adoption of container DCs for profiles that rely on vaguely defined DCs |