Changes between Version 9 and Version 10 of Taskforces/Curation/ValueNormalization/ResourceType


Ignore:
Timestamp:
01/28/18 20:53:48 (6 years ago)
Author:
matej.durco@oeaw.ac.at
Comment:

reworked proposed vocabulary, added definitions, added note on tools/services

Legend:

Unmodified
Added
Removed
Modified
  • Taskforces/Curation/ValueNormalization/ResourceType

    v9 v10  
    11= Resource Type
    22
     3This page collects info about the facet `Resource Type` (or `Resource Class`) as used in VLO, concentrating especially on the controlled vocabulary to be used for normalization.
    34
    4 == proposed vocabulary
     5
     6== Proposed vocabulary
    57
    68basic principle: to decompose the values and rather allow the use of a combination of values to describe one resource.
     
    810{{{
    911AnnotatedTextCorpus = collection text annotation
    10 Audio-recording with transcription = audio annotation
     12Audio recording with transcription = audio annotation
    1113}}}
    1214
    13 [https://docs.google.com/spreadsheets/d/1KcLgbHjmC6UP1gm2DAfpe2MVYWvrmgSy0TLLEsC8xkw/edit#gid=0 Collecting possible definitions from external sources in a gsheet]
     15* [https://docs.google.com/spreadsheets/d/1KcLgbHjmC6UP1gm2DAfpe2MVYWvrmgSy0TLLEsC8xkw/edit#gid=0 Collecting possible definitions from external sources in a gsheet]
     16* [[https://github.com/acdh-oeaw/VLO-mapping/blob/master/vocabs/resourcetype.csv|proposed vocabulary on github]] as common place to maintain the vocabulary. (Still being published/released via CLAVAS.)
    1417
    15 === annotation
     18 collection :: An aggregation of items. The term collection means that the resource is described as a group; its parts may be separately described and navigated. (source: DCMI)
     19 text :: written sequence of human language (source: Durco)
     20 lexicalResource :: formalized list of items describing/defining various aspects of lexical units (words, multi-word expressions) (source: Durco)
     21 grammar :: formalized description of the structure of a language (source: Durco)
     22 database :: structured dataset ?? (source: Durco)
     23 annotation :: additional/secondary structured information explicating certain aspects of the original resource. (Which could be of any type: text, audio, video, image, ...) (source: Durco)
     24 image :: static digital visual representation of something (source: Durco)
     25 audio | audioRecording :: Resource whose content is primarily intended to be perceived acoustically. (source: based on: DCMI, adapted: Durco)
     26 video | videoRecording :: Resource consisting of a series of images  imparting an impression of motion when shown in succession, mostly accompanied by an synchronized audio signal; primarily intended to be perceived visually and acoustically. (source: based on: DCMI, adapted: Durco)
     27 speech :: ??)
     28 software :: an artefact that can be executed on a computer to perform specific operations (source: CRMdig:D14Software (source: Durco)
     29 software / source code :: algorithmic processing instructions in human readable form in one or multiple programming languages (source: Durco)
     30 software / binaries :: algorithmic processing instructions in machine readable form executable on a computer (source: Durco)
     31 interactiveResource :: A resource requiring interaction from the user to be understood, executed, or experienced. (source: DCMI)
     32 interactiveResource / website :: interactive resource meant for human interaction, available online, accessible via web browser (source: Durco)
     33 interactiveResource / webservice :: meant for machine interaction, available online, accessible via defined protocol (source: Durco)
     34 interactiveResource / clientApplication :: meant for human interaction, has to be executed on local computer (source: Durco)
     35 physicalObject ::  All persistent physical items with a relatively stable form, man-made or natural (source: CIDOC-CRM: Physical Thing)
    1636
    17 === audio / audioRecording
    18 === collection
    1937
    20 === structuredData
    21 === grammar
    22 === image
    23 === lexicalResource
    24 === physicalObject
    25 === text
    26 === video / videoRecording
    27 === software
     38=== Note on tool/software/service/web application/... complex
    2839
    29 set of instructions / code that need to be executed / run.
    30 (May need download and installation)
    31 As opposed to service that can be used directly
     40The term `tool` seems to be too ambiguous and its use should be discouraged.
    3241
    33 === service
     42There is a clear distinction between a '''software''' and a '''service''': `software` is a set of instructions / code that need to be executed / run, `(web) service` is provided (as a running computer process) by another party (service provider) and can be used without installation. There is obviously always a software needed to run a service, but this is invisible, irrelevant for the user.
     43As a special case (and beset practice), a service provider can offer both the service and the underlying software, but this needs to be decomposed and considered as two distinct resources (even if they are represented by one CMDI record).
    3444
    35 a software system available online that provides one or more functions usable by humans and other systems (based on DCMI-type)
     45In the current proposal, the terms `software` (esp. `binaries`) and `clientApplication` overlap (are to a certain extent synonymous). => this should be resolved still.
    3646
    37 ===  (interactiveResource)
    38 
    39 could be further divided into:
    40 * standalone application = meant for human interaction, has to be executed on local computer
    41 * web application = meant for human interaction, available online, accessible via web browser
    42 * webservice = api = meant for machine interaction, available online, accessible via defined protocol
    43 
    44 "tool" seems to be too ambiguous and it's use should be discourages (could be altLabel for interactiveResource)
     47While `(web)service` seems clearly distinguishable from other web resources, `website`, `web application` (and maybe others) seem synonymous (available online, for human consumption). One could distinguish between ''static content'' (`website`) and ''dynamic content'' (`web application`), but that could create more confusion than clarity.
    4548
    4649
    4750== Comparison of different vocabularies
    4851
    49 propositions from within CLARIN, but also from external sources.
     52Following is an attempt to align the terms from various vocabularies, proposed by people from within CLARIN, but also from external sources.
    5053
    51 still check:
    52 http://schema.org/MediaObject
     54[[https://schema.datacite.org/meta/kernel-4.0/|DataCite metadata schema 4.0]]
     55Defined resource types only listed in the schema (XSD or PDF). ACDH-OEAW has published them as separate vocabulary in acdh-vocabs: https://vocabs.acdh.oeaw.ac.at/archecategory/Schema
     56(In below (not mapped: Event, Model)
    5357
    54 DataCite
    55 DataCite metadata schema 4.0. (not mapped: Event, Model)
     58There are also [[http://dictionary.casrai.org/Output_Types|CASRAI Output Types]] but these seem too detailed and more bibliographic style, so we don't consider them here and now.
     59
     60Furthermore there is a number of classes defined in [[http://schema.org/|schema.org]], e.g. [[http://schema.org/MediaObject|MediaObject]], and although there is some potential for mapping, overall this schema is both too broad and not sufficiently specific for our purposes, thus it's also not considered for the moment.
     61
    5662
    5763||= VLO Taskforce ||= DCMI Type Vocabulary ||= DataCite 4.0 ||= Odijk Resource Type ||= Odijk Hierarchy ||= Menzo ||= "current final" vocabulary ||