Ignore:
Timestamp:
10/02/13 19:52:31 (11 years ago)
Author:
vronk
Message:

rework of Results, Definitions, appendix, added Conclusion,
smaller changes to Design, Data

File:
1 edited

Legend:

Unmodified
Added
Removed
  • SMC4LRT/chapters/Design_SMCinstance.tex

    r3638 r3665  
    1 \chapter{Mapping on instance level, CMD as LOD}
     1\chapter{Mapping on instance level,\\ CMD as LOD}
    22\label{ch:design-instance}
    33
     
    1515\end{quotation}
    1616
    17 As described in previous chapters (\ref{ch:infrastructure},\ref{ch:design_schema}), semantic interoperability is one of the main motivations for the CMD infrastructure. However, this machinery pertains mostly to the schema level, the actual values in the fields of CMD instances reman ``just strings''. This is the case even though the problem of different labels for semantically equivalent or even identical entities is even more so virulent on the instance level. While for a number of metadata fields the value domain can be enforced through schema validation, some important fields (like \concept{organization} or \concept{resource type})  have a constrained value domain that yet cannot be explicitly exhaustively enumerated. This leads to a chronically inconsistent use of labels for referring to entities (as the instance data shows, some organizations are referred to by more than 20 different labels, or spelling variants.) prompting an urgent need for better means for harmonizing the constrained-field values.
     17As described in previous chapters (\ref{ch:infra},\ref{ch:design}), semantic interoperability is one of the main motivations for the CMD infrastructure. However, the established machinery pertains mostly to the schema level, the actual values in the fields of CMD instances remain ``just strings''. This is the case even though the problem of different labels for semantically equivalent or even identical entities is even more so virulent on the instance level. While for a number of metadata fields the value domain can be enforced through schema validation, some important fields (like \concept{organization} or \concept{resource type})  have a constrained value domain that yet cannot be explicitly exhaustively enumerated. This leads to a chronically inconsistent use of labels for referring to entities (as the instance data shows, some organizations are referred to by more than 20 different labels, or spelling variants.) prompting an urgent need for better means for harmonizing the constrained-field values.
    1818
    1919One potential remedy is the use of reference datasets -- controlled vocabularies, taxonomies, ontologies and such. In fact, this is a very common approach, be it the authority files in libraries world, or domain-specific reference vocabularies maintained by practically every research community. Not as strict as schema definitions, they cannot be used for validation, but still help to harmonize the data, by offering preferred labels and identifiers for entities.
     
    312312\end{figure*}
    313313
    314 \subsubsection{Identify vocabularies  – CLAVAS}
     314\subsubsection{Identify vocabularies}
    315315
    316316\todoin{Identify related ontologies, vocabularies? - see DARIAH:CV}
     
    403403\label{semantic-search}
    404404
    405 With the new enhanced dataset, as detailed in section \ref{ch:cmd2rdf}, the groundwork is laid for the full-blown semantic search as proposed in the original goals, i.e. the possibility for ontology-driven or at least `semantic resources assisted' exploration of the dataset.
     405With the new enhanced dataset, as detailed in section \ref{sec:cmd2rdf}, the groundwork is laid for the full-blown semantic search as proposed in the original goals, i.e. the possibility for ontology-driven or at least `semantic resources assisted' exploration of the dataset.
    406406
    407407Namely to enhance it by employing ontological resources.
Note: See TracChangeset for help on using the changeset viewer.