Opened 3 years ago
Closed 2 years ago
#1088 closed defect (fixed)
OTA item missing from VLO search results
Reported by: | martin.wynne@bodleian.ox.ac.uk | Owned by: | teckart@informatik.uni-leipzig.de |
---|---|---|---|
Priority: | minor | Milestone: | |
Component: | VLO importer | Version: | |
Keywords: | Cc: |
Description
I expected to be able to find the 'Arabic Speech Corpus' (hosted by the OTA) in the VLO. When I searched for 'Arabic Speech Corpus' at vlo.clarin.eu, the OTA item did not appear among the results.
I tried searching for several other OTA items in the VLO, and they appeared in the results.
The item at the OTA can be found here.
http://hdl.handle.net/20.500.12024/2561
Best,
Martin
Change History (2)
comment:1 Changed 3 years ago by
comment:2 Changed 2 years ago by
Resolution: | → fixed |
---|---|
Status: | new → closed |
Note: See
TracTickets for help on using
tickets.
Hello Martin,
thanks for the hint! The resource is included in the VLO index ([1]). The reason why it isn't directly visible on the results page is the duplicate detection of the VLO that groups similar records (based on their title and resource language) and presents only one item directly in the results list. When searching for "Arabic Speech Corpus" [2] you will find the OTA record when you click on "The search results include 1 record with the same title." on the first record.
We will have to investigate how we can favour records provided by the resource owner over those provided by catalogues like ELRA's. The discussion and future work will be tracked in a Github issue [3].
Best,
Thomas
[1] https://vlo.clarin.eu/record/https_58__47__47_hdl.handle.net_47_20.500.12024_47_2561_64_format_61_cmdi
[2] https://vlo.clarin.eu/search?q=Arabic+Speech+Corpus
[3] https://github.com/clarin-eric/VLO/issues/316