Changes between Version 8 and Version 9 of Collaborations/Perseus
- Timestamp:
- 09/25/15 15:48:09 (9 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
Collaborations/Perseus
v8 v9 14 14 '''Possible RDA Collaboration Project:''' 15 15 16 User Stories 16 ''Objective:'' 17 18 Leverage the PID Types API and Data Types Registry define and implement a common, interoperable model for the relationship between images, ocr scans and books. 19 20 This solves a data management problem for Perseus/OPP, and make the data and solution available to other CLARIN centers. 21 22 Secondary benefits: 23 24 * makes Perseus/OPP Catalog data through CLARIN Federated search 25 * provides a step towards interoperability between CTS URNs and Handles. 26 27 ''User Stories:'' 17 28 18 29 * I want to be able to search for medieval German texts from between the 11th and 12th century and retrieve pictures of this book so I can feed it to (1) an OCR pipeline, (2) a transcription/crowdsourcing platform and (3) other machine learning processes. … … 20 31 * I want to be able to create a monograph using linked data where I reference a picture of the manuscript of a cited work. 21 32 22 Solutions provided: 33 ''Data Perseus and OPP have:'' 23 34 24 Data Management solution for OPP Image/OCR/Book data25 * defines a common interoperable model for the relationship between images, ocr scans and books26 * solves a data management problem for Perseus/OPP27 * makes this data and solution available to other CLARIN centers28 29 PID Management solution for CTS URNs30 * step towards interoperability between CTS URNs and Handles31 32 Availability of Perseus/OPP Catalog data through CLARIN Federated search33 34 Data Perseus and OPP have:35 35 * Images of Manuscripts and Books [ OPP ] 36 36 * OCR Scans of Manuscripts and Books [ OPP ] 37 37 * Catalog Metadata (MODS and MADS and CTS) [ Perseus ] 38 38 39 What we want to be able to do: 39 ''What we want to be able to do:'' 40 40 41 * clearly distinguish between these data types 41 42 * assign a persistent identifier to each image … … 46 47 * make all of this data searchable and retrieval through CLARIN Federated Search endpoint 47 48 48 Relevant RDA Components: 49 ''Relevant RDA Components:'' 49 50 50 51 * Data Types Registry to manage and describe the data types