wiki:Taskforces/Curation/ValueNormalization/ResourceType

Version 6 (modified by matej.durco@oeaw.ac.at, 9 years ago) (diff)

added link to gsheet for collecting definitions

Resource Type

proposed vocabulary

basic principle: to decompose the values and rather allow the use of a combination of values to describe one resource. Example:

AnnotatedTextCorpus = collection text annotation 
Audio-recording with transcription = audio annotation

Collecting possible definitions from external sources in a gsheet

annotation

audio

collection

structuredData

grammar

image

lexicalResource

physicalObject

text

video

interactiveResource

could be further divided into:

  • standalone application = meant for human interaction, has to be executed on local computer
  • web application = meant for human interaction, available online, accessible via web browser
  • webservice = api = meant for machine interaction, available online, accessible via defined protocol

"tool" seems to be too ambiguous and it's use should be discourages (could be altLabel for interactiveResource)

Comparison of different vocabularies

propositions from within CLARIN, but also from external sources.

still check: http://schema.org/MediaObject

VLO Taskforce DCMI Type Vocabulary Odijk Resource Type Odijk Hierarchy Menzo "current final" vocabulary
annotatedText Text textAnnotation annotation annotation
audioRecording Sound audio data audio audio
collection Collection collection data collection collection
corpus Collection collection data collection
database Dataset structered data data structured data structuredData
dataset:experimentalData Dataset structured data data
dataset:fieldworkMaterial Dataset structured data data
dataset:surveyData Dataset structured data data
dataset:testData Dataset structured data data
grammar lexicalResource ?? data grammar grammar
image (Image, StillImage?) image data image image
lexicalResource Text (lexicalResource, lexicalResource/monolingual, lexicalResource/valency lexicon, Semantic lexicalResource) data lexicalResource lexicalResource
physicalObject PhysicalObject? physicalObject analogue object physicalObject physicalObject
plainText Text Running Natural Language Text data text / running natural language text text
session Event audio data ??
teachingMaterial (Text, Sound, MovingImage?)* (audio, video, Running Language Text)* data -
tool Service service software tool interactive resource tool
toolChain Service service chain software tool interactive resource
videoRecording MovingImage? (video, audio+video) data video video
webApplication Interactive Resource application software tool+service interactive resource interactiveResource
webService Service service software service interactive resource
unspecified unspecified other app
other other other tool
webservice = api