Opened 3 years ago

Closed 2 years ago

#1088 closed defect (fixed)

OTA item missing from VLO search results

Reported by: martin.wynne@bodleian.ox.ac.uk Owned by: teckart@informatik.uni-leipzig.de
Priority: minor Milestone:
Component: VLO importer Version:
Keywords: Cc:

Description

I expected to be able to find the 'Arabic Speech Corpus' (hosted by the OTA) in the VLO. When I searched for 'Arabic Speech Corpus' at vlo.clarin.eu, the OTA item did not appear among the results.

I tried searching for several other OTA items in the VLO, and they appeared in the results.

The item at the OTA can be found here.

http://hdl.handle.net/20.500.12024/2561

Best,
Martin

Change History (2)

comment:1 Changed 3 years ago by teckart@informatik.uni-leipzig.de

Hello Martin,

thanks for the hint! The resource is included in the VLO index ([1]). The reason why it isn't directly visible on the results page is the duplicate detection of the VLO that groups similar records (based on their title and resource language) and presents only one item directly in the results list. When searching for "Arabic Speech Corpus" [2] you will find the OTA record when you click on "The search results include 1 record with the same title." on the first record.

We will have to investigate how we can favour records provided by the resource owner over those provided by catalogues like ELRA's. The discussion and future work will be tracked in a Github issue [3].

Best,
Thomas

[1] https://vlo.clarin.eu/record/https_58__47__47_hdl.handle.net_47_20.500.12024_47_2561_64_format_61_cmdi
[2] https://vlo.clarin.eu/search?q=Arabic+Speech+Corpus
[3] https://github.com/clarin-eric/VLO/issues/316

comment:2 Changed 2 years ago by Twan Goosen

Resolution: fixed
Status: newclosed
Note: See TracTickets for help on using tickets.