= Resource Type This page collects info about the facet `Resource Type` (or `Resource Class`) as used in VLO, concentrating especially on the controlled vocabulary to be used for normalization. == Proposed vocabulary basic principle: to decompose the values and rather allow the use of a combination of values to describe one resource. Example: {{{ AnnotatedTextCorpus = collection text annotation Audio recording with transcription = audio annotation }}} * [https://docs.google.com/spreadsheets/d/1KcLgbHjmC6UP1gm2DAfpe2MVYWvrmgSy0TLLEsC8xkw/edit#gid=0 Collecting possible definitions from external sources in a gsheet] * [[https://github.com/acdh-oeaw/VLO-mapping/blob/master/vocabs/resourcetype.csv|proposed vocabulary on github]] as common place to maintain the vocabulary. (Still being published/released via CLAVAS.) collection :: An aggregation of items. The term collection means that the resource is described as a group; its parts may be separately described and navigated. (source: DCMI) text :: written sequence of human language (source: Durco) lexicalResource :: formalized list of items describing/defining various aspects of lexical units (words, multi-word expressions) (source: Durco) grammar :: formalized description of the structure of a language (source: Durco) database :: structured dataset ?? (source: Durco) annotation :: additional/secondary structured information explicating certain aspects of the original resource. (Which could be of any type: text, audio, video, image, ...) (source: Durco) image :: static digital visual representation of something (source: Durco) audio | audioRecording :: Resource whose content is primarily intended to be perceived acoustically. (source: based on: DCMI, adapted: Durco) video | videoRecording :: Resource consisting of a series of images imparting an impression of motion when shown in succession, mostly accompanied by an synchronized audio signal; primarily intended to be perceived visually and acoustically. (source: based on: DCMI, adapted: Durco) speech :: ??) software :: an artefact that can be executed on a computer to perform specific operations (source: CRMdig:D14Software (source: Durco) software / source code :: algorithmic processing instructions in human readable form in one or multiple programming languages (source: Durco) software / binaries :: algorithmic processing instructions in machine readable form executable on a computer (source: Durco) interactiveResource :: A resource requiring interaction from the user to be understood, executed, or experienced. (source: DCMI) interactiveResource / website :: interactive resource meant for human interaction, available online, accessible via web browser (source: Durco) interactiveResource / webservice :: meant for machine interaction, available online, accessible via defined protocol (source: Durco) interactiveResource / clientApplication :: meant for human interaction, has to be executed on local computer (source: Durco) physicalObject :: All persistent physical items with a relatively stable form, man-made or natural (source: CIDOC-CRM: Physical Thing) === Note on tool/software/service/web application/... complex The term `tool` seems to be too ambiguous and its use should be discouraged. There is a clear distinction between a '''software''' and a '''service''': `software` is a set of instructions / code that need to be executed / run, `(web) service` is provided (as a running computer process) by another party (service provider) and can be used without installation. There is obviously always a software needed to run a service, but this is invisible, irrelevant for the user. As a special case (and beset practice), a service provider can offer both the service and the underlying software, but this needs to be decomposed and considered as two distinct resources (even if they are represented by one CMDI record). In the current proposal, the terms `software` (esp. `binaries`) and `clientApplication` overlap (are to a certain extent synonymous). => this should be resolved still. While `(web)service` seems clearly distinguishable from other web resources, `website`, `web application` (and maybe others) seem synonymous (available online, for human consumption). One could distinguish between ''static content'' (`website`) and ''dynamic content'' (`web application`), but that could create more confusion than clarity. == Comparison of different vocabularies Following is an attempt to align the terms from various vocabularies, proposed by people from within CLARIN, but also from external sources. [[https://schema.datacite.org/meta/kernel-4.0/|DataCite metadata schema 4.0]] Defined resource types only listed in the schema (XSD or PDF). ACDH-OEAW has published them as separate vocabulary in acdh-vocabs: https://vocabs.acdh.oeaw.ac.at/archecategory/Schema (In below (not mapped: Event, Model) There are also [[http://dictionary.casrai.org/Output_Types|CASRAI Output Types]] but these seem too detailed and more bibliographic style, so we don't consider them here and now. Furthermore there is a number of classes defined in [[http://schema.org/|schema.org]], e.g. [[http://schema.org/MediaObject|MediaObject]], and although there is some potential for mapping, overall this schema is both too broad and not sufficiently specific for our purposes, thus it's also not considered for the moment. ||= VLO Taskforce ||= DCMI Type Vocabulary ||= DataCite 4.0 ||= Odijk Resource Type ||= Odijk Hierarchy ||= Menzo ||= "current final" vocabulary || || annotatedText || Text || Text || textAnnotation || || annotation || annotation + text || || audioRecording || Sound || Sound || audio || data || audio || audioRecording || || collection || Collection || Collection || collection || data || collection || collection || || corpus || Collection || Text? || collection || data || collection || collection (+ text) || || database || Dataset || Dataset || structered data || data || structured data || structuredData || || dataset:experimentalData || Dataset || Dataset || structured data || data || || || || dataset:fieldworkMaterial || Dataset || Dataset || structured data || data || || || || dataset:surveyData || Dataset || Dataset || structured data || data || || || || dataset:testData || Dataset || Dataset || structured data || data || || || || grammar || || lexicalResource ?? || data || grammar || || structuredData/grammar || || image || (Image, StillImage) || Image || image || data || image || image || || lexicalResource || Text|Dataset? || Text|Dataset? || (lexicalResource, lexicalResource/monolingual, lexicalResource/valency lexicon, Semantic lexicalResource) || data || lexicalResource || lexicalResource || || physicalObject || PhysicalObject || PhysicalObject || physicalObject || analogue object || physicalObject || physicalObject || || plainText || Text || Text || Running Natural Language Text || data || text / running natural language text || text || || session || (Text, Sound, MovingImage)* || (Text,Sound,Audiovisual)* || audio || data || ?? || ?? || || teachingMaterial || * || * || * || data || - || - || || tool || Service || Service || service || software || tool || (interactiveResource, clientApplication, software, webservice) ? || || toolChain || Service || Workflow || service chain || software || tool || (interactiveResource, webservice) ? || || videoRecording || MovingImage || Audiovisual || (video, audio+video) || data || video || videoRecording || || webApplication || InteractiveResource || InteractiveResource || application || software || tool+service || interactiveResource / web application || || webService || Service || Service || service || software || service || interactiveResource / service || || unspecified || || unspecified || other || || || || || other || || Other || other || other || || || ||