wiki:Taskforces/Curation/ValueNormalization/ResourceType

Resource Type

This page collects info about the facet Resource Type (or Resource Class) as used in VLO, concentrating especially on the controlled vocabulary to be used for normalization.

Proposed vocabulary

basic principle: to decompose the values and rather allow the use of a combination of values to describe one resource. Example:

AnnotatedTextCorpus = collection text annotation 
Audio recording with transcription = audio annotation
collection
An aggregation of items. The term collection means that the resource is described as a group; its parts may be separately described and navigated. (source: DCMI)
text
written sequence of human language (source: Durco)
lexicalResource
formalized list of items describing/defining various aspects of lexical units (words, multi-word expressions) (source: Durco)
grammar
formalized description of the structure of a language (source: Durco)
database
structured dataset ?? (source: Durco)
annotation
additional/secondary structured information explicating certain aspects of the original resource. (Which could be of any type: text, audio, video, image, ...) (source: Durco)
image
static digital visual representation of something (source: Durco)
audio | audioRecording
Resource whose content is primarily intended to be perceived acoustically. (source: based on: DCMI, adapted: Durco)
video | videoRecording
Resource consisting of a series of images imparting an impression of motion when shown in succession, mostly accompanied by an synchronized audio signal; primarily intended to be perceived visually and acoustically. (source: based on: DCMI, adapted: Durco)
speech
??)
software
an artefact that can be executed on a computer to perform specific operations (source: CRMdig:D14Software (source: Durco)
software / source code
algorithmic processing instructions in human readable form in one or multiple programming languages (source: Durco)
software / binaries
algorithmic processing instructions in machine readable form executable on a computer (source: Durco)
interactiveResource
A resource requiring interaction from the user to be understood, executed, or experienced. (source: DCMI)
interactiveResource / website
interactive resource meant for human interaction, available online, accessible via web browser (source: Durco)
interactiveResource / webservice
meant for machine interaction, available online, accessible via defined protocol (source: Durco)
interactiveResource / clientApplication
meant for human interaction, has to be executed on local computer (source: Durco)
physicalObject
All persistent physical items with a relatively stable form, man-made or natural (source: CIDOC-CRM: Physical Thing)

Note on tool/software/service/web application/... complex

The term tool seems to be too ambiguous and its use should be discouraged.

There is a clear distinction between a software and a service: software is a set of instructions / code that need to be executed / run, (web) service is provided (as a running computer process) by another party (service provider) and can be used without installation. There is obviously always a software needed to run a service, but this is invisible, irrelevant for the user. As a special case (and beset practice), a service provider can offer both the service and the underlying software, but this needs to be decomposed and considered as two distinct resources (even if they are represented by one CMDI record).

In the current proposal, the terms software (esp. binaries) and clientApplication overlap (are to a certain extent synonymous). => this should be resolved still.

While (web)service seems clearly distinguishable from other web resources, website, web application (and maybe others) seem synonymous (available online, for human consumption). One could distinguish between static content (website) and dynamic content (web application), but that could create more confusion than clarity.

Comparison of different vocabularies

Following is an attempt to align the terms from various vocabularies, proposed by people from within CLARIN, but also from external sources.

DataCite metadata schema 4.0 Defined resource types only listed in the schema (XSD or PDF). ACDH-OEAW has published them as separate vocabulary in acdh-vocabs: https://vocabs.acdh.oeaw.ac.at/archecategory/Schema (In below (not mapped: Event, Model)

There are also CASRAI Output Types but these seem too detailed and more bibliographic style, so we don't consider them here and now.

Furthermore there is a number of classes defined in schema.org, e.g. MediaObject, and although there is some potential for mapping, overall this schema is both too broad and not sufficiently specific for our purposes, thus it's also not considered for the moment.

VLO Taskforce DCMI Type Vocabulary DataCite? 4.0 Odijk Resource Type Odijk Hierarchy Menzo "current final" vocabulary
annotatedText Text Text textAnnotation annotation annotation + text
audioRecording Sound Sound audio data audio audioRecording
collection Collection Collection collection data collection collection
corpus Collection Text? collection data collection collection (+ text)
database Dataset Dataset structered data data structured data structuredData
dataset:experimentalData Dataset Dataset structured data data
dataset:fieldworkMaterial Dataset Dataset structured data data
dataset:surveyData Dataset Dataset structured data data
dataset:testData Dataset Dataset structured data data
grammar lexicalResource ?? data grammar structuredData/grammar
image (Image, StillImage?) Image image data image image
lexicalResource Text|Dataset? Text|Dataset? (lexicalResource, lexicalResource/monolingual, lexicalResource/valency lexicon, Semantic lexicalResource) data lexicalResource lexicalResource
physicalObject PhysicalObject? PhysicalObject? physicalObject analogue object physicalObject physicalObject
plainText Text Text Running Natural Language Text data text / running natural language text text
session (Text, Sound, MovingImage?)* (Text,Sound,Audiovisual)* audio data ?? ??
teachingMaterial * * * data - -
tool Service Service service software tool (interactiveResource, clientApplication, software, webservice) ?
toolChain Service Workflow service chain software tool (interactiveResource, webservice) ?
videoRecording MovingImage? Audiovisual (video, audio+video) data video videoRecording
webApplication InteractiveResource? InteractiveResource? application software tool+service interactiveResource / web application
webService Service Service service software service interactiveResource / service
unspecified unspecified other
other Other other other
Last modified 6 years ago Last modified on 01/28/18 20:53:48