Changeset 3665 for SMC4LRT/chapters/Data.tex
- Timestamp:
- 10/02/13 19:52:31 (11 years ago)
- File:
-
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
SMC4LRT/chapters/Data.tex
r3638 r3665 11 11 \label{def:CMD} 12 12 13 The \emph{Component Metadata Framework} (CMD) is the data model of the CLARIN metadata infrastructure. (See \ref{CMDI} for information about the infrastructure. The XML-schema of CMD -- the general-component-schema -- is featured in appendix \ref{lst:general-component-schema}.)13 The \emph{Component Metadata Framework} (CMD) is the data model of the CLARIN Component Metadata Infrastructure. (See \ref{def:CMDI} for information about the infrastructure. The XML-schema defining CMD -- the \xne{general-component-schema} -- is featured in appendix \ref{lst:cmd-schema}.) 14 14 CMD is used to define the so-called \var{profiles} being constructed out of reusable \var{components} -- collections of metadata fields. The components can contain other components and they can be reused in multiple profiles. Profile itself is just a special kind of a component (a sub class), with some additional administrative information. 15 15 The actual core provision for semantic interoperability is the requirement, that each CMD element (i.e. metadata field) refers ``via a PID to exactly one data category\footnote{persistently referenceable concept definition} (cf. \ref{def:DCR}), thus 16 16 indicating unambiguously how the content of the field in a metadata description should be interpreted'' \cite{Broeder+2010}. 17 17 18 This approach of integrating prerequisites for semantic interoperability directly into the process of metadata creation is fundamentally different from the traditional methods of schema matching that try to establish pairwise alignments between already existing schemas -- be it algorithm-based or by means of explicit manually defined crosswalks\cite{Shvaiko2005}. 19 18 20 While the primary registry for data categories used in CMD is the \xne{ISOcat} Data Category Registry (cf. \ref{def:DCR}), other authoritative sources are accepted (so-called ``trusted registries''), especially the set of terms maintained by the Dublin Core Metadata Initiative \cite{DCMI:2005}. 19 21 … … 32 34 \caption{The development of defined profiles and DCs over time} 33 35 \label{table:dev_profiles} 34 \begin{tabular}{ l | r | r | r | r } 36 % \begin{tabular}{ l | r | r | r | r } 37 \begin{tabular}{ l r r r r } 38 35 39 \hline 36 40 date & 2011-01 & 2012-06 & 2013-01 & 2013-06 \\ … … 51 55 52 56 53 \subs ection{Instance Data}54 55 56 \todoin{ add historical perspective on data - list overall}57 \subsubsection{Instance Data} 58 59 60 %\todoin{ add historical perspective on data - list overall} 57 61 58 62 The main CLARIN OAI-PMH harvester\footnote{\url{http://catalog.clarin.eu/oai-harvester/}} … … 65 69 \caption{Top 20 profiles, with the respective number of records} 66 70 \begin{center} 67 \begin{tabular}{ r | l } 71 \begin{tabular}{ r l } 72 \hline 68 73 \# records & profile \\ 69 74 \hline … … 96 101 \caption{Top 20 collections, with the respective number of records} 97 102 \begin{center} 98 \begin{tabular}{ r | l } 103 \begin{tabular}{ r l } 104 \hline 99 105 \# records & colleciton \\ 100 106 \hline … … 154 160 155 161 \subsection{TEI / teiHeader} 162 \label{tei} 163 156 164 TEI/teiHeader/ODD, 165 157 166 158 167 \subsection{ISLE/IMDI}
Note: See TracChangeset
for help on using the changeset viewer.