Changes between Version 26 and Version 27 of VLO/CMDI data workflow framework


Ignore:
Timestamp:
11/13/15 11:41:22 (9 years ago)
Author:
go.sugimoto@oeaw.ac.at
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • VLO/CMDI data workflow framework

    v26 v27  
    122122
    123123
     124== After a meeting 2015-11-11 (Menzo, Dieter, Davor, Matej and Go) ==
     125
     126This is the new diagram that Go comes up with after a technical discussion, which focuses on the Dashboard part. It may be different from what others think, but he thinks that the more modualisation of data ingestion steps will make the process much more clear. In his opinion, the roles of current harvester+viewer and curation modules are a little underspecified and overlapping, therefore, the diagram is an attempt to clarify a bit more what module/component will work on what in order to produce the expected results in Dashboard.
     127
     128
     129One of the important aspect toward the Dashboard is that it has two primary functionalities. The first functionality is to control all the ingestion modules (harvester, ), so that it can, for example, manually stop the harvesting, or changes the mapping definition, and re-index the published data sets. The second functionality is to
     130monitor the ingestion process. That means each module will communicate with the reports database to provide statistics about a particular data transaction. For instance, harvester will supply the statistics about the outcome of the harvesting, while mapper/normaliser will tell the coverage of facets and controlled vocabularies. Indexer will tell the total number of indexed records and broken links. Based on this database, the Dashboard will be able to produce data quality reports which can not only be viewed in the Dashboard itself, but also in a PDF file which each data provider can access. The reports database could provide API for internal and external services to create a viewer (eg harvesting viewer), but it is optional, because the Dashboard is the main interface.
     131
    124132== Reference ==
    125133The initial idea is developed under another document [https://docs.google.com/document/d/1OoxDEFoZKhmotk7tbrElqcn79acKnj4T897sNctMYH8/edit?usp=sharing] in which you can see the idea of even further future.