Opened 6 years ago

Last modified 6 years ago

#1045 reopened defect

Records with NO title

Reported by: matej.durco@oeaw.ac.at Owned by: matej.durco@oeaw.ac.at
Priority: major Milestone:
Component: MetadataCuration Version:
Keywords: Cc: hanna.hedeland@uni-hamburg.de, Twan Goosen, Menzo Windhouwer

Description

There are a number of records which have an empty title (they appear with Unnamed record)

Query: https://vlo.clarin.eu/search?8&q=-name:*

(Strangely while it was |62407| records on 2018-01-30
it is |168884| as of 2018-02-20 !)

This is really bad for the users, so we need to investigate these and see how we can remedy (either by adding concept-mappings, or going back to the providers asking them to fill out this basic information somehow.)

Main collection contributing:

 Meertens collection: Liederenbank (110031)
 Institut für Deutsche Sprache, CLARIN-D Zentrum, Mannheim (14195)
 CLARIN Centres (9543)
 AGD (6695)
 CLARIN NL : D-LUCEA (3824)
 Meertens Collection: Soundbites (3448)
 Institut für Deutsche Sprache, CLARIN-D Zentrum, Mannheim (2295)
 TalkBank (1787)
 TLA: DiscAn (1271)
 Meertens collection: Etstoel (765)

Change History (8)

comment:1 Changed 6 years ago by Twan Goosen

At least some of these cases appear to be issues with the application of the mapping by the VLO; see https://github.com/clarin-eric/VLO/issues/147

comment:2 Changed 6 years ago by Twan Goosen

We have found a way of mitigating this and now are back to ~62k cases in production (and alpha). See VLO issue for details and updates.

comment:3 Changed 6 years ago by matej.durco@oeaw.ac.at

added two sub-issues for individual main offending collections:
#1049 - Liederbanken@Meertens
#1050 - IDS

comment:4 Changed 6 years ago by matej.durco@oeaw.ac.at

Cc: hanna.hedeland@uni-hamburg.de Twan Goosen Menzo Windhouwer added

I recollect, we had a special incosistent case at the meeting of two quite identical WeblichtService records (both with title in the metadata) - still one with title-facet empty, but I can't seem to find them again.
Can you help me spot it again?

comment:5 in reply to:  4 Changed 6 years ago by Twan Goosen

Replying to matej.durco@…:

I recollect, we had a special incosistent case at the meeting of two quite identical WeblichtService records (both with title in the metadata) - still one with title-facet empty, but I can't seem to find them again.
Can you help me spot it again?

These were probably an artefact of an issue with the importer. It has been resolved in the meantime, see VLO #147 on GitHub.

comment:6 Changed 6 years ago by Twan Goosen

Resolution: fixed
Status: newclosed

No longer a general issue, collection specific issues are covered by #1049 and #1050

comment:7 Changed 6 years ago by matej.durco@oeaw.ac.at

Well there are still a few collections, but let's see how much remains, if Meertens manages to resolve theirs.

comment:8 Changed 6 years ago by Twan Goosen

Resolution: fixed
Status: closedreopened

Good point, better to keep this open as a reminder to evaluate

Note: See TracTickets for help on using tickets.