Changes between Version 3 and Version 4 of CMD2RDF/sysarch
- Timestamp:
- 05/21/14 14:31:06 (10 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
CMD2RDF/sysarch
v3 v4 75 75 1. Process the CMD records (in multiple threads) 76 76 a. for each record process the actions sequentially, where the output of each action is the input of the next 77 b. the result of the last action should be RDF XML to be uploaded to the Graph Store 77 b. the input of the first action should be a CMD record from the OAI harvester result set 78 c. the result of the last action should be RDF XML to be uploaded to the Graph Store 78 79 2. Process the profiles (as used by the records) (in multiple threads) 79 80 a. for each profile process the actions sequentially 80 b. the result of the last action should be RDF XML to be uploaded to the Graph Store 81 b. the input of the first action is a CMD profile from the CMD2RDF cache 82 c. the result of the last action should be RDF XML to be uploaded to the Graph Store 81 83 3. Process the components (as used by the profiles) (in multiple threads) 82 84 a. for each component process the actions sequentially 83 b. the result of the last action should be RDF XML to be uploaded to the Graph Store 85 b. the input of the first action is a CMD component from the CMD2RDF cache 86 c. the result of the last action should be RDF XML to be uploaded to the Graph Store 84 87 85 88 '''Note''': for phase 3 phase 2 could output the components, which might complicated as until now they would just request resources from the cache not store them. Phase 2 could also just request the used components from the Component Registry and thus trigger caching from them. This is some extra load for the Component Registry, but might keep the CMD2RDF caching API simpler. … … 89 92 For inspiration: the [[https://github.com/TheLanguageArchive/oai-harvest-manager|OAI harvester]] allows to configure actions on harvested records, e.g., to convert OLAC to CMDI. 90 93 94 '''TODO:''' the harvest isn't incremental, but CMD2RDF could be. It could handle only the delta of a previous run, i.e., convert and enrich new CMD records, convert and enrich updated CMD records, remove graphs for deleted CMD records, the same for profiles and components. MD5 checksums might help to determine if CMD records have been changed ... 95 96 91 97 = CMD-RDF 92 98