Opened 13 years ago
Closed 12 years ago
#41 closed defect (wontfix)
subject mapping for OLAC
Reported by: | dietuyt | Owned by: | dietuyt |
---|---|---|---|
Priority: | major | Milestone: | |
Component: | VLO web app | Version: | |
Keywords: | Cc: |
Description
/CMD/Components/OLAC-DcmiTerms?/subject -> subject when multiple subject elements exist, give a preference to (in order of preference):
[@dcterms-type="LCSH"]
ignore (for now, until we have a resolution solution for the numeric codes): [@dcterms-type="DDC"] [@dcterms-type="LCC"]
Change History (9)
comment:1 Changed 13 years ago by
comment:2 Changed 13 years ago by
Also, if the subject contains an attribute olac-language that should be mapped on the language facet:
/CMD/Components/OLAC-DcmiTerms/subject[@olac-language] -> language
comment:5 Changed 13 years ago by
The LCSH subjects (several terms divided by --) could be processed by the OLAC2CMDI.xsl. However it would result in CMDI files with duplicate lines,eg:
<subject>Tagalog language</subject> <subject>Tagalog language</subject>
The best way to removed duplicate elements in XSLT is with a postprocessing XSL file, like source:metadata/trunk/toolkit/xslt/comp2schema-v2/cleanup-xsd.xsl
However this complicates the processing of all incoming OLAC records quite a lot as it depends on saxon-specific extensions of XSLT. So we should reconsider doing this during the import process.
comment:6 Changed 13 years ago by
The vlo importer ignored double values so:
<subject>Tagalog language</subject> <subject>Tagalog language</subject>
will result in one facet subject -> Tagalog language.
This:
<subject>Tagalog language</subject> <subject>Another lang</subject>
will result in two subject facets with different values.
comment:7 Changed 12 years ago by
Owner: | changed from patdui to herste |
---|---|
Status: | new → assigned |
comment:8 Changed 12 years ago by
Owner: | changed from herste to dietuyt |
---|
comment:9 Changed 12 years ago by
Resolution: | → wontfix |
---|---|
Status: | assigned → closed |
I double-checked and with the removal of GIAL the last provider with LCSH olac subjects has gone. This ticket is thus no longer relevant (it might be re-opened one day if we get such a provider again).
split the string by "--"
add the first 2 elements as a separate facet
eg Tagalog language--Foreign words and phrases--Spanish should become 2
facet values:
Tagalog language
Foreign words and phrases