Opened 9 years ago

Last modified 9 years ago

#806 accepted defect

OTA OLAC records mingle multi-values (keyword to dc:type field)

Reported by: matej.durco@oeaw.ac.at Owned by: matej.durco@oeaw.ac.at
Priority: major Milestone:
Component: MetadataCuration Version:
Keywords: Cc: herold@bbaw.de

Description

In data from OTA multiple values (OTA keywords) are merged into one string in the field dc:type

Example: resourceClass:Linguistic+corporaCorpus
https://vlo.clarin.eu/search?2&fq=resourceClass:Linguistic+corporaCorpus

The records in original context actually have two values:
Linguistic corpora and Corpus
http://ota.ox.ac.uk/headers/1046.xml

The values arrive mingled already at the OAI-harvester:
https://vlo.clarin.eu/data/clarin/oai-pmh/Oxford_Text_Archive/oai_ota_oucs_1046.xml

So the errors seems to be in the conversion at OTA side

Change History (4)

comment:1 Changed 9 years ago by DefaultCC Plugin

Cc: herold@bbaw.de added

comment:2 Changed 9 years ago by matej.durco@oeaw.ac.at

Summary: OTA OLAC records mingle multi-values (inOTA OLAC records mingle multi-values (keyword to dc:type field)

comment:3 Changed 9 years ago by matej.durco@oeaw.ac.at

wrote to martin.wynne about the issue

comment:4 Changed 9 years ago by matej.durco@oeaw.ac.at

Owner: changed from Dieter Van Uytvanck to matej.durco@oeaw.ac.at
Status: newaccepted

partly resolution (by removing "Corpus" values as obsoleted - info by Martin Wynne),
remaining values still mingled together. Check at the OAI-endpoint:

http://ota.ox.ac.uk/cgi-bin/oai.pl?verb=GetRecord&identifier=oai:ota:oucs:0161&metadataPrefix=olac

Note: See TracTickets for help on using tickets.