= Metadata Curation taskforce = == Goals (non-exhaustive) == Primary concern of this taskforce is the quality of the CMD records (i.e. instances), especially wrt to their purpose of resource discovery. However the quality of the schemas/profiles obviously plays a role. Towards this goal, next to regularly '''manually checking''' the harvested metadata (in the VLO) and analyzing problems, work is planned on '''automatic checks''' of the md records (working towards a Metadata Quality Assessement Service) and '''curation of the values''' in selected facets/fields/data categories (e.g. organization name). == Public website == (none) == Documents and wiki pages == * [[MDCuration]] - general notes on the issue * report for tracking curation issues {18} * #676 - a ticket to "Create a metadata curation module" * !Proposal/Specification for the [[Curation Module]] * #688 - implementation of metadata curation inside the VLO importer (-> to be deprecated in favor of #676) * [[Taskforces/Curation/ValueNormalization]] - discussion of the issue of value curation * [wiki:MDQAS] - draft proposal of a Metadata Quality Assessment Service (deprecated by [[Curation Module]]) * [wiki:CmdiBestPracticeGuide#qualitycheck-list CMDI Best practice document including quality indications] * Some valuable suggestions from Data Cite on the semantics of publication date: http://www.datacite.org/node/130 * existing curation approaches: * [https://github.com/DASISH/jmd-scripts/tree/master/workflow-scripts by DASISH] * [https://github.com/ekoi/cmd2rdf/blob/dev-refactoring/src/main/resources/xsl/addOrganisationEntity.xsl by the CMD2RDF team] (includes Menzo) * Analysis of the VLO proposed by ACDH (see [https://docs.google.com/document/d/1rjNUqwr9KgUY4XLQiuvzpWIvb3zEIvza8PGvX93Bm8Y/edit# working gdoc]) * Accompanying [[https://docs.google.com/presentation/d/1Cd53jzu9iXybF6cirFUyIGj7bO5up5m4toXUgKA0iM8/edit#slide=id.p3|presentation]] * [[https://clarin.oeaw.ac.at/vlo|Developer/experimental instance of VLO at ACDH-OEAW "minerva"]] == Meetings == * 2014-10-24: [[Meeting20141024|meeting in Soesterberg (CAC 2014)]] * 2015-07-03: [[Taskforces/Curation/Meeting20150703|meeting in Vienna]] * 2015-10-15: [[Taskforces/Curation/Meeting20151015|meeting in Wroclaw (CAC 2015)]] * 2016-05-11: [[Taskforces/Curation/Meeting20160511|meeting in Utrecht (Centre Meeting)]] * 2017-06-28: [[Taskforces/Curation/Meeting20170628|telco]] * 2018-01-30: [[Taskforces/Curation/Meetings/2018-01-30|meeting in Vienna]] * 2018-02-21: [[Taskforces/Curation/Meetings/2018-02-21|telco]] * 2018-06-05: [[Taskforces/Curation/Meetings/2018-06-05|meeting in Utrecht (Centre Meeting)]] * 2018-07-04: [[Taskforces/Curation/Meetings/2018-07-04|virtual]] * 2018-12-06: [[Taskforces/Curation/Meetings/2018-12-06|meeting and hands-on curation session in Vienna]] == Members == * Coordinator: Matej Ďurčo * Uwe Reichel, Florian Schiel * Axel Herold * Neeme Kahusk * Lene Offersgaard * Hanna Hedeland * Kerstin Eckart / Jens Stegmann * Jozef Misutka / Pavel Straňák * Jussi Piitulainen * Susanne Haaf * Alexander König