Changes between Version 3 and Version 4 of CMD2RDF/sysarch


Ignore:
Timestamp:
05/21/14 14:31:06 (10 years ago)
Author:
Menzo Windhouwer
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • CMD2RDF/sysarch

    v3 v4  
    75751. Process the CMD records (in multiple threads)
    7676  a. for each record process the actions sequentially, where the output of each action is the input of the next
    77   b. the result of the last action should be RDF XML to be uploaded to the Graph Store
     77  b. the input of the first action should be a CMD record from the OAI harvester result set
     78  c. the result of the last action should be RDF XML to be uploaded to the Graph Store
    78792. Process the profiles (as used by the records) (in multiple threads)
    7980  a. for each profile process the actions sequentially
    80   b. the result of the last action should be RDF XML to be uploaded to the Graph Store
     81  b. the input of the first action is a CMD profile from the CMD2RDF cache
     82  c. the result of the last action should be RDF XML to be uploaded to the Graph Store
    81833. Process the components (as used by the profiles) (in multiple threads)
    8284  a. for each component process the actions sequentially
    83   b. the result of the last action should be RDF XML to be uploaded to the Graph Store
     85  b. the input of the first action is a CMD component from the CMD2RDF cache
     86  c. the result of the last action should be RDF XML to be uploaded to the Graph Store
    8487
    8588'''Note''': for phase 3 phase 2 could output the components, which might complicated as until now they would just request resources from the cache not store them. Phase 2 could also just request the used components from the Component Registry and thus trigger caching from them. This is some extra load for the Component Registry, but might keep the CMD2RDF caching API simpler.
     
    8992For inspiration: the [[https://github.com/TheLanguageArchive/oai-harvest-manager|OAI harvester]] allows to configure actions on harvested records, e.g., to convert OLAC to CMDI.
    9093
     94'''TODO:''' the harvest isn't incremental, but CMD2RDF could be. It could handle only the delta of a previous run, i.e., convert and enrich new CMD records, convert and enrich updated CMD records, remove graphs for deleted CMD records, the same for profiles and components. MD5 checksums might help to determine if CMD records have been changed ...
     95
     96
    9197= CMD-RDF
    9298