Changes between Version 31 and Version 32 of CmdiVirtualLanguageObservatory
- Timestamp:
- 12/16/10 11:22:36 (13 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
CmdiVirtualLanguageObservatory
v31 v32 1 = = Notes on the re-implementation of the VLO ==1 = Notes on the re-implementation of the VLO = 2 2 3 == = inspiration sources ===3 == inspiration sources == 4 4 5 5 * VLO 1.0: http://ems06.mpi.nl/cgi-bin/flamenco.cgi/flamh/Flamenco 6 6 * OLAC facet browser: http://syslsl01.library.upenn.edu/dla/olac/index.html 7 7 8 == = technology to be used ===8 == technology to be used == 9 9 10 10 * SOLR for the backend … … 12 12 * CMDI for the input 13 13 14 == = data sources ===14 == data sources == 15 15 16 16 * The CMDI'zed IMDI corpus: http://www.mpi.nl/imdi/documents/cmdi-20100924.tar.gz … … 18 18 * CMDI version of the LRT inventory: http://www.mpi.nl/imdi/documents/lrt-20101201.tar.gz 19 19 20 == = requirements ===20 == requirements == 21 21 * in first instance same functionality as VLO 1.0 22 22 * using CMDI as direct input … … 26 26 * link the ISO-639-3 language codes to http://www.clarin.eu/external/language.php?code=eng so that users can get more information about a used language at each point 27 27 28 === suggested mapping CMDI OLAC profile > VLO 2.0 fields/facets === 28 == suggested mapping CMDI OLAC profile > VLO 2.0 fields/facets == 29 30 === Name === 29 31 30 32 /CMD/Components/OLAC-!DcmiTerms/title -> name 31 33 34 === Subject === 35 36 Use: 32 37 33 38 /CMD/Components/OLAC-!DcmiTerms/subject[@olac-linguistic-field] -> subject 34 35 if this is not available use: 36 37 /CMD/Components/OLAC-!DcmiTerms/subject -> subject 38 when multiple subject elements exist, give a preference to (in order of preference): 39 40 [@dcterms-type="LCSH"] 39 /CMD/Components/OLAC-!DcmiTerms/subject -> subject when [@dcterms-type="LCSH"] 41 40 42 41 ignore (for now, until we have a resolution solution for the numeric codes): 43 [@dcterms-type="DDC"]44 [@dcterms-type="LCC"]45 42 43 /CMD/Components/OLAC-!DcmiTerms/subject when [@dcterms-type="DDC"] 44 /CMD/Components/OLAC-!DcmiTerms/subject when [@dcterms-type="LCC"] 45 46 ignore (because it results in too much noise): 47 48 /CMD/Components/OLAC-!DcmiTerms/subject 49 50 === Organisation === 46 51 47 52 /CMD/Components/OLAC-!DcmiTerms/publisher -> organisation 48 53 54 === Id === 55 49 56 /CMD/Header/MdSelfLink -> id 57 58 === Language === 50 59 51 60 /CMD/Components/OLAC-!DcmiTerms/language[@olac-language] -> language … … 53 62 /CMD/Components/OLAC-!DcmiTerms/subject[@olac-language] -> language 54 63 64 === Origin === 65 55 66 /CMD/Header/MdSelfLink (URL after first ":") (via OAI-PMH) -> origin 67 68 === Genre === 56 69 57 70 /CMD/Components/OLAC-!DcmiTerms/type[@olac-linguistic-type] -> genre 58 71 72 === Description === 73 59 74 /CMD/Components/OLAC-!DcmiTerms/description -> description 60 75 76 === open in original context === 77 61 78 /CMD/Components/OLAC-!DcmiTerms/identifier (if starting with !http:// or hdl:) -> open in original context (now: IMDI browser) 79 80 === Year === 62 81 63 82 /CMD/Components/OLAC-!DcmiTerms/date -> year (new facet, extract yyyy from yyyy-mm-dd or yyyy-mm-ddThh:mm:ssZ or take over yyyy) … … 76 95 }}} 77 96 97 === Resource Type === 78 98 79 99 /CMD/Components/OLAC-!DcmiTerms/type[@dcterms-type="DCMIType"] -> resource type (new facet) 80 100 101 === Country === 102 81 103 /CMD/Components/OLAC-!DcmiTerms/spatial[@dcterms-type="ISO3166"] -> country 104 /CMD/Components/OLAC-!DcmiTerms/coverage[@dcterms-type="ISO3166"] -> country 82 105 83 /CMD/Components/OLAC-!DcmiTerms/coverage[@dcterms-type="ISO3166"] -> country 106 === Format === 84 107 85 108 /CMD/Components/OLAC-!DcmiTerms/format[@dcterms-type="IMT"] -> format (new facet, contains mime type)