Changes between Version 31 and Version 32 of CmdiVirtualLanguageObservatory


Ignore:
Timestamp:
12/16/10 11:22:36 (13 years ago)
Author:
dietuyt
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • CmdiVirtualLanguageObservatory

    v31 v32  
    1 == Notes on the re-implementation of the VLO ==
     1= Notes on the re-implementation of the VLO =
    22
    3 === inspiration sources ===
     3== inspiration sources ==
    44
    55 * VLO 1.0: http://ems06.mpi.nl/cgi-bin/flamenco.cgi/flamh/Flamenco
    66 * OLAC facet browser: http://syslsl01.library.upenn.edu/dla/olac/index.html
    77
    8 === technology to be used ===
     8== technology to be used ==
    99
    1010 * SOLR for the backend
     
    1212 * CMDI for the input
    1313
    14 === data sources ===
     14== data sources ==
    1515
    1616 * The CMDI'zed IMDI corpus: http://www.mpi.nl/imdi/documents/cmdi-20100924.tar.gz
     
    1818 * CMDI version of the LRT inventory: http://www.mpi.nl/imdi/documents/lrt-20101201.tar.gz
    1919
    20 === requirements ===
     20== requirements ==
    2121 * in first instance same functionality as VLO 1.0
    2222 * using CMDI as direct input
     
    2626 * link the ISO-639-3 language codes to http://www.clarin.eu/external/language.php?code=eng so that users can get more information about a used language at each point
    2727
    28 === suggested mapping CMDI OLAC profile > VLO 2.0 fields/facets ===
     28== suggested mapping CMDI OLAC profile > VLO 2.0 fields/facets ==
     29
     30=== Name ===
    2931
    3032/CMD/Components/OLAC-!DcmiTerms/title -> name
    3133
     34=== Subject ===
     35
     36Use:
    3237
    3338/CMD/Components/OLAC-!DcmiTerms/subject[@olac-linguistic-field] -> subject
    34 
    35 if this is not available use:
    36 
    37 /CMD/Components/OLAC-!DcmiTerms/subject -> subject
    38 when multiple subject elements exist, give a preference to (in order of preference):
    39 
    40 [@dcterms-type="LCSH"]
     39/CMD/Components/OLAC-!DcmiTerms/subject -> subject when [@dcterms-type="LCSH"]
    4140
    4241ignore (for now, until we have a resolution solution for the numeric codes):
    43 [@dcterms-type="DDC"]
    44 [@dcterms-type="LCC"]
    4542
     43/CMD/Components/OLAC-!DcmiTerms/subject when [@dcterms-type="DDC"]
     44/CMD/Components/OLAC-!DcmiTerms/subject when [@dcterms-type="LCC"]
     45
     46ignore (because it results in too much noise):
     47
     48/CMD/Components/OLAC-!DcmiTerms/subject
     49
     50=== Organisation ===
    4651
    4752/CMD/Components/OLAC-!DcmiTerms/publisher -> organisation
    4853
     54=== Id ===
     55
    4956/CMD/Header/MdSelfLink -> id
     57
     58=== Language ===
    5059
    5160/CMD/Components/OLAC-!DcmiTerms/language[@olac-language] -> language
     
    5362/CMD/Components/OLAC-!DcmiTerms/subject[@olac-language] -> language
    5463
     64=== Origin ===
     65
    5566/CMD/Header/MdSelfLink (URL after first ":") (via OAI-PMH) -> origin
     67
     68=== Genre ===
    5669
    5770/CMD/Components/OLAC-!DcmiTerms/type[@olac-linguistic-type] -> genre
    5871
     72=== Description ===
     73
    5974/CMD/Components/OLAC-!DcmiTerms/description -> description
    6075
     76=== open in original context ===
     77
    6178/CMD/Components/OLAC-!DcmiTerms/identifier (if starting with !http:// or hdl:) -> open in original context (now: IMDI browser)
     79
     80=== Year ===
    6281
    6382/CMD/Components/OLAC-!DcmiTerms/date -> year (new facet, extract yyyy from yyyy-mm-dd or yyyy-mm-ddThh:mm:ssZ or take over yyyy)
     
    7695}}}
    7796
     97=== Resource Type ===
    7898
    7999/CMD/Components/OLAC-!DcmiTerms/type[@dcterms-type="DCMIType"] -> resource type (new facet)
    80100
     101=== Country ===
     102
    81103/CMD/Components/OLAC-!DcmiTerms/spatial[@dcterms-type="ISO3166"] -> country
     104/CMD/Components/OLAC-!DcmiTerms/coverage[@dcterms-type="ISO3166"] -> country
    82105
    83 /CMD/Components/OLAC-!DcmiTerms/coverage[@dcterms-type="ISO3166"] -> country
     106=== Format ===
    84107
    85108/CMD/Components/OLAC-!DcmiTerms/format[@dcterms-type="IMT"] -> format (new facet, contains mime type)