wiki:Collaborations/Europeana

Version 8 (modified by Dieter Van Uytvanck, 9 years ago) (diff)

--

our contact persons

Christian Thomas, Susanne Haaf, Axel Herold (BBAW), Dieter Van Uytvanck

their contact persons

Alastair Dunning (coordination), Nuno Freire (technical, metadata), Antoine Isaac (metadata, LOD)

collaboration

harvesting metadata

full-text access

  • look at use cases with full text OCRed data (see note below)

information from Nuno Freire

Meanwhile, maybe you can have a look at what we have in Github. It contains only one newspaper title, but you can see the metadata available and the fulltext (as plain text files).
https://github.com/nfreire/HistoricalNewspapersCorpus/

Some more notes:
- We will not continue using Github (so don't expect to have the materials available through Git)
- The metadata formats in there are the final available ones: Dublin Core, EDM (Europeana Data Model - a richer format used in the Europeana network)
- If we find the storage capacity, we will make available also the ALTO files, along with the plain text. 

more information

Attachments (4)