= CMDI specific task force meeting 2020-01-15 = ---- * What? * Kick-off and initial (planning) discussion of tasks on 1) core metadata components and profiles + extensions; 2) (investigation into) use of FAIR vocabularies in CLARIN metadata * Who? * Those partcipating in the relevant specific [[Taskforces/CMDI|CMDI]] taskforces * Wednesday 15 January 2020, 14:00 - 15:00 CET (on basis of [https://doodle.com/poll/wupu4h7a4evnae32 doodle]) * Where? * Zoom: https://clarin.zoom.us/j/534115968 * Note: If using Zoom for the first time, **allow for some setup time before the meeting**. The above link should get you started. See [https://support.zoom.us/hc/en-us/categories/200101697-Getting-Started documentation] for more information. == Documents == - [https://docs.google.com/document/d/12xBfAzrBChY1HT-aQTzdcGv1yxtCgYpDxGZRNcqFmoc/edit?usp=sharing Proposal/working document for these two related tasks] - [https://docs.google.com/document/d/1GEmvkK89BD0ezc8nS2kgkVTPpNd_ueEwJtGUNuiNUtc/edit?usp=sharing Earlier draft document collecting some thouhgts and preliminary work on use of external vocabularies and concepts] - [https://docs.google.com/document/d/1cRJKiise8HFS16gqG4gC3TsI0W-TnLSnebh_YS6mZ3U/edit?usp=sharing CMDI strategy] == Preparation == * Suggest/think about more specific task definitions * Collect relevant use cases * Prepare a summary of any ongoing, completed or abandoned work you are or have been involved in that can contribute to these tasks == Agenda == (Tentative) 1. Definition of tasks 1. Overall objective 1. Use cases 1. Task definition: CMDI core components & profiles 1. Task definition: FAIR vocabularies for concepts and controlled vocabularies in CLARIN metadata 1. Current state 1. Related activities & existing work 1. General info/recommended component activities as part of CMDI Best Practices task (Marcin, Oddrun, Twan, ...) 1. Common use cases (Andreas, Twan) 1. ISO 24622 - Part 3: "Recommended components" (Penny) 1. META-SHARE schema work (Penny) 1. Quest project (Felix) 1. CMDI <> DDI, !DataCite conversions (Twan, Dieter) 1. Work on vocabularies, ontologies in European projects (Dieter, Matej) * [https://sshopencloud.eu/sshoc-wp4-workshop-developing-sshoc-reference-ontology SSHOC WP4 "Developing the SSHOC Reference Ontology"] * [https://www.dariah.eu/activities/working-groups/thesaurus-maintenance/ DARIAH-EU Thesaurus Maintenance] 1. [https://www.rd-alliance.org/groups/research-metadata-schemas-wg RDA Research Metadata Schemas] (anyone?) 1. Planning 1. Approach/actions, distribution of work 1. Timeline 1. Next meeting == Notes == === Definition of tasks === Agreement on overall objective. Some rephrasing of specific task descriptions: __Core metadata and extensions__: Define a set of general purpose components, together forming a basis for (more or less) use case specific profiles. The components must implement CMDI best practices, enable adherence to FAIR principles, and facilitate interoperability with the broader research infrastructure for research resources and technologies. Aim is not to define a 'most generic/minimal metadata' profile but rather keep the approach modular and create components to be combined into more use case specific profiles. However, it would be nice if we could implement a (non-instantiable) template for such profiles. __FAIR vocabularies__: Evaluate which broadly supported/used vocabularies can be used '''[..in two ways - at two levels (value domain in instance vs semantics for schema constituents..]*''' in the CMDI core metadata definitions (see above), and possibly also replace concepts currently in use. FAIR criteria for vocabularies need to be taken into account. On the nature of FAIR vocabularies: essential property is that vocabulary items have a globally unique identifier assigned to them. '''*''' ''this phrasing needs to be finalised; important to have consistent terminology to distinguish between two uses of vocabularies (semantic annotaiton at component/profile/schema level vs value domain at instance level)'' === Current state: Related activities & existing work === __Common use cases:__ besides the [https://docs.google.com/document/d/1yIH2d12Z5c0JJEobsNntefsuUCRxDdMe1qkKsfI7yzw/edit?usp=sharing report document] (work in progress), there is also a new [https://docs.google.com/spreadsheets/d/1-GkqT3W1adp-dYL7OZr8IoYB8xgwbwD_8bf5SuPAJd0/edit#gid=308554633 spreadsheet] that quantifies and aggregates the survey responses in terms of information aspects per resource type. __ISO 24622 - Part 3: "Recommended components":__ Although frozen as an activitiy, quite a bit of preliminary output of analysis and modelling work is available that can be used as input in our activities. See public [https://drive.google.com/drive/folders/12imnL4n_yOwu_YSZAHGuQQTw3ATG_ftQ Google Drive share]. __META-SHARE schema work:__ Work in progress in context of European Language Grid, based on Metashare ontology and aiming to reuse existing vocabularies and ontologies where possible (prioritise on specific ones like DCAT). Penny will share documents. === Planning === Until the next meeting: work on the [https://docs.google.com/spreadsheets/d/1a8rCRqAbCMm_XO2-oV1oWAqZY2-JfUGc7faa3uTlKvY/edit?usp=sharing inventory spreadsheets]. Discussion and other related communication continues on the [https://clarineric.slack.com/archives/CSAKYPUFL #tf-cmdi-core-and-vocabs] [[Slack]] channel. If the need for a meeting before the [[../Meeting20200212|next general CMDI meeting (2020-02-12)]] arises, this can be discussed and organised in that channel as well.