| 165 | === IMS Stuttgart === |
| 166 | |
| 167 | This is a list of the relations which we would like to represent in the CMDI records, and how we deal with them so far. |
| 168 | Due to the fact, that we were not sure about recommendations how to use the 'Resources' component, we have not applied it yet, but are of course interested in using it, so that some of the relations might also be exploited in the VLO: |
| 169 | |
| 170 | a. |
| 171 | * Relation type: tool -- modular tool component (parameter file, language model, lexicon, ...) |
| 172 | * Current approach: Using the CMDI component 'Prereqisites' (clarin.eu:cr1:c_1290431694521) to state that a tool needs an additional component. The tool components are described in their own CMDI record (profile not in public section yet). |
| 173 | * Examples: |
| 174 | * !TreeTagger (Metadata: http://hdl.handle.net/11858/00-247C-0000-0022-C698-E) --> Italian parameter file (Achim Stein) |
| 175 | * !TreeTagger (Metadata: http://hdl.handle.net/11858/00-247C-0000-0022-C698-E) --> Italian parameter file (Marco Baroni) |
| 176 | * !TreeTagger (Metadata: http://hdl.handle.net/11858/00-247C-0000-0022-C698-E) --> English parameter file |
| 177 | * Mate Tools Parser (Metadata: http://hdl.handle.net/11858/00-247C-0000-0022-C697-0) --> French model |
| 178 | * Mate Tools Parser (Metadata: http://hdl.handle.net/11858/00-247C-0000-0022-C697-0) --> English model |
| 179 | * !BitPar (Metadata http://hdl.handle.net/11858/00-247C-0000-0022-F7AF-1)--> Lexicon DE |
| 180 | * !BitPar (Metadata http://hdl.handle.net/11858/00-247C-0000-0022-F7AF-1)--> Grammar EN |
| 181 | b. |
| 182 | * Relation type: trained model -- data set on which the model was trained |
| 183 | * Current approach: Optional CMDI component '!BasedOn' as part of the profile to describe tool components (CMDI component not in public section yet). |
| 184 | * Examples: |
| 185 | * Dutch parameter file for !TreeTagger --> trained on Eindhoven corpus |
| 186 | * German grammar for !BitPar --> extracted from Tiger treebank |
| 187 | c. |
| 188 | * Relation type: versioning |
| 189 | * Current approach: Element 'Version' in CMDI component '!GeneralInfo' (clarin.eu:cr1:c_1290431694495) and where applicable a common part in the '!ResourceName' to implicitly relate different versions. |
| 190 | * No examples yet, but this will change in future. Affects all types of resources: corpora, lexicons, tools, web services, ... |
| 191 | d. |
| 192 | * Relation type: tool -- web service |
| 193 | * We keep getting mails asking for information and publications on the tools 'behind' the web services. This also affects the action item regarding the CMDI profiles for web services. |
| 194 | |
| 195 | Further notes: |
| 196 | * Making these relations explicit will also mean more maintenance effort to keep the md-records up to date, thus we should discuss where this will make sense and where it creates too much overhead. |
| 197 | * Some relationships are between 'clearly distinct' resources: 'Source' was mentioned in the Tischvorlage, others might include: |
| 198 | * 'A is based on B' e.g. when a trainable component (parameterfile, language model, ...) has been trained on a specific corpus, or a collection of technical terms has been extracted based on the frequencies of a specific corpus resource (see relation type b.) |
| 199 | * 'A can be used with B', e.g. when a tool can use an (additional) knowledge base (lexicon, language model, ...), or a corpus is encoded for a specific query engine (see relation type a.) |
| 200 | * 'A was used for the creation of B', e.g. when manual annotations were added with the help of a specific system, ... |
| 201 | At the moment some of these relationships can be expressed in the component section of the CMDI-records (e.g. 'Derivation', 'Source') but are not exploited in the VLO. |
| 202 | |
| 203 | |