1 | \chapter{Definitions} |
---|
2 | \label{ch:def} |
---|
3 | |
---|
4 | \section {Namespaces} |
---|
5 | Namespaces mentioned through this document listed: |
---|
6 | |
---|
7 | \begin{description} |
---|
8 | \item[dcif] |
---|
9 | \item[skos] |
---|
10 | \end{description} |
---|
11 | |
---|
12 | \section {Abbreviations} |
---|
13 | |
---|
14 | \begin{description} |
---|
15 | \item[CLARIN] \textit{Common Language Resources and Technology Infrastructure} \ref{def:CLARIN} |
---|
16 | \item[CLAVAS] \textit{Vocabulary Alignement Service for CLARIN} \ref{def:CLAVAS} |
---|
17 | \item[CMD] \textit{Component Metadata} \ref{def:CMD} |
---|
18 | \item[CMDI] \textit{Component Metadata Infrastructure} \ref{def:CMDI} |
---|
19 | \item[ERIC] \textit{European Research Infrastructure Consortium} - a legal entity for long-term research infrastructure initiatives |
---|
20 | \item[DARIAH] \textit{Digital Research Infrastructure for Arts and Humanities} |
---|
21 | \item[DC] data category |
---|
22 | \item[DCR] data category registry \cite{ISO12620:2009} |
---|
23 | \item[DH] Digital Humanities, also eHumanities |
---|
24 | \item[LINDAT] czech national infrastructure for LRT\furl{http://lindat.ufal.cuni.cz} |
---|
25 | \item[OLAC] \textit{Open Language Archive Community}\furl{http://www.language-archives.org/}\ref{def:OLAC} |
---|
26 | \item[PID] persistend identifier \todocite{PID} |
---|
27 | \item[PURL] persistent uniform resource locator \todocite{PURL} |
---|
28 | \item[RDF] \textit{Resource Description Framework} \todocite{RDF} |
---|
29 | \item[RR] Relation Registry\ref{def:rr} |
---|
30 | \item[TEI] \textit{Text Encoding Initiative} |
---|
31 | \end{description} |
---|
32 | |
---|
33 | \section {Terms} |
---|
34 | |
---|
35 | In the following, the terms used in this work are explained. |
---|
36 | |
---|
37 | \begin{description} |
---|
38 | \item[Concept] Basic "entity" in an ontology? that of what an ontology is build |
---|
39 | \item[Ontology] ``an explicit specification of a conceptualization'' \todocite {Ontology!}, but for us mainly a collection of concepts as opposed to lexicon, which is a collection of words. |
---|
40 | \item[Word] a lexical unit, a word in a language, something that has a surface realization (writtenForm) and is a carrier of sense. so a relation holds: hasSense(Word, Concept) |
---|
41 | \item[Lexicon] a collection of words, a (lexical) vocabulary |
---|
42 | \item[Vocabulary] an index providing mapping from Word (string) to Concept (uri) |
---|
43 | \item[(Data)Category] (almost) the same as Concept; Things like \concept{Topic}, \concept{Genre}, \concept{Organization}, \concept{ResourceType} are instantiations of Category |
---|
44 | \item[ConceptualDomain] the Class of entities a Concept/Category denotes. For Organization it would be all (existing) organizations, CD(ResourceType)={Corpus, Lexicon, Document, Image, Video, ...}. Entities of the domain can itself be Categories (\concept{ResourceType:Image}), but it can be also individuals |
---|
45 | (\concept{Organization University of Vienna}) |
---|
46 | \todoin{Is it synonymous to value domain, range} |
---|
47 | \item[Entity] |
---|
48 | \item[Resource] informational resource, in the context of CLARIN-Project mainly Language Resources (Corpus, Lexicon, Multimedia) |
---|
49 | \item[Metadata Description] description of some properties of a resource. MD-Record |
---|
50 | \item[Schema] - CMD-Profile |
---|
51 | \item[Annotation] |
---|
52 | \end{description} |
---|
53 | |
---|
54 | |
---|
55 | Lexicon vs. Ontology |
---|
56 | Lexicon is a linguistic object an ontology is not.\cite{Hirst2009} We don't need to be that strict, but it shall be a guiding principle in this work to consider things (Datasets, Vocabularies, Resources) also along this dichotomy/polarity: Conceptual vs. Lexical. |
---|
57 | And while every Ontology has to have a lexical representation (canonically: rdfs:label, rdfs:comment, skos:*label), if we don't try to force observed objects into a binary classification, but consider a bias spectrum, we should be able to locate these along this spectrum. |
---|
58 | So the main focus of a typical ontology are the concepts (``conceptualization''), primarily language-independent. |
---|
59 | |
---|
60 | |
---|
61 | Another special case are Controlled Vocabularies or Taxonomies/Classification Systems, let alone folksonomies, in that they identify terms and concepts/meanings, ie there is no explicit mapping between the language represenation and the concept, but rather the term is implicit carrier of the meaning/concept. |
---|
62 | So for example in the LCSH the surface realization of each subject-heading at the same time identifies the Concept ~. |
---|
63 | |
---|
64 | ontologicky vs. semaziologicky (Semanticke priznaky: kategoriálne/archysémy, difernciacne, specifikacne) |
---|