wiki:Taskforces/Curation/ValueNormalization/ResourceType

Version 9 (modified by matej.durco@oeaw.ac.at, 7 years ago) (diff)

minor corrections to the mapping table

Resource Type

proposed vocabulary

basic principle: to decompose the values and rather allow the use of a combination of values to describe one resource. Example:

AnnotatedTextCorpus = collection text annotation 
Audio-recording with transcription = audio annotation

Collecting possible definitions from external sources in a gsheet

annotation

audio / audioRecording

collection

structuredData

grammar

image

lexicalResource

physicalObject

text

video / videoRecording

software

set of instructions / code that need to be executed / run. (May need download and installation) As opposed to service that can be used directly

service

a software system available online that provides one or more functions usable by humans and other systems (based on DCMI-type)

(interactiveResource)

could be further divided into:

  • standalone application = meant for human interaction, has to be executed on local computer
  • web application = meant for human interaction, available online, accessible via web browser
  • webservice = api = meant for machine interaction, available online, accessible via defined protocol

"tool" seems to be too ambiguous and it's use should be discourages (could be altLabel for interactiveResource)

Comparison of different vocabularies

propositions from within CLARIN, but also from external sources.

still check: http://schema.org/MediaObject

DataCite? DataCite? metadata schema 4.0. (not mapped: Event, Model)

VLO Taskforce DCMI Type Vocabulary DataCite? 4.0 Odijk Resource Type Odijk Hierarchy Menzo "current final" vocabulary
annotatedText Text Text textAnnotation annotation annotation + text
audioRecording Sound Sound audio data audio audioRecording
collection Collection Collection collection data collection collection
corpus Collection Text? collection data collection collection (+ text)
database Dataset Dataset structered data data structured data structuredData
dataset:experimentalData Dataset Dataset structured data data
dataset:fieldworkMaterial Dataset Dataset structured data data
dataset:surveyData Dataset Dataset structured data data
dataset:testData Dataset Dataset structured data data
grammar lexicalResource ?? data grammar structuredData/grammar
image (Image, StillImage?) Image image data image image
lexicalResource Text|Dataset? Text|Dataset? (lexicalResource, lexicalResource/monolingual, lexicalResource/valency lexicon, Semantic lexicalResource) data lexicalResource lexicalResource
physicalObject PhysicalObject? PhysicalObject? physicalObject analogue object physicalObject physicalObject
plainText Text Text Running Natural Language Text data text / running natural language text text
session (Text, Sound, MovingImage?)* (Text,Sound,Audiovisual)* audio data ?? ??
teachingMaterial * * * data - -
tool Service Service service software tool (interactiveResource, clientApplication, software, webservice) ?
toolChain Service Workflow service chain software tool (interactiveResource, webservice) ?
videoRecording MovingImage? Audiovisual (video, audio+video) data video videoRecording
webApplication InteractiveResource? InteractiveResource? application software tool+service interactiveResource / web application
webService Service Service service software service interactiveResource / service
unspecified unspecified other
other Other other other