Opened 6 years ago

Last modified 6 years ago

#1051 assigned task

tei-consolidation sprint

Reported by: matej.durco@oeaw.ac.at Owned by: haaf@bbaw.de
Priority: major Milestone:
Component: MetadataCuration Version:
Keywords: Cc: haaf@bbaw.de, Twan Goosen, Menzo Windhouwer

Description (last modified by matej.durco@oeaw.ac.at)

Currently we have (at least) 3 teiHeader-profiles and especially BBAW is suffering bad curation marks due to some modelling issues resulting in bad facet coverage.
Thus we would like to try to update / create a new TEI-based profile that is:

  • less faithful to the TEI for the sake of better facet coverage,
  • as general as possible (based on teiHeaders we know of)
  • but still possible to generate automatically (through a XSL-script) from at least the subset of teiHeaders we know

teiHeader developed by CLARIN-DK:
cr1:p_1380106710826/xsd curation: clarin.eu:cr1:p_1380106710826 - uses as concept-links concepts defined in CCR as "proxies" to tei-elements.

Older (2013) report in smc-browser on the various teiHeader profiles: https://clarin.oeaw.ac.at/smc-browser/docs/smc-report_teiHeader.html

DTA-teiHeader

Next to the published teiHeader: clarin.eu:cr1:p_1345180279115
curate: clarin.eu:cr1:p_1345180279115

A new one is currently in development: clarin.eu:cr1:p_1381926654438
curate: clarin.eu:cr1:p_1381926654438 (coverage: 7/15)

Example record in VLO

See attached reports from mapping checker for both profiles.

Following facets are not covered by the new profile:

facet XPath - issue
modality textDesc/channel/@mode - needs concept-link on @mode-attribute
keywords profileDesc/textClass/classCode
subject profileDesc/textClass/classCode
format ?
description ?
availability publicationStmt/availability/licence/@target,p - concept-link is on licence element, but content is inside p
license needs conceptlink on the licence/@targetattribute
resourceClass /textClass/textDesc/channel

teiHeader provides generic mechanism for all kinds of classifications: profileDesc/textClass/classCode qualified with @scheme-attribute. The values would be usable for the facets: keywords, subject, genre. But the mapping would need to be distinguished based on the scheme. Thus a single concept link cannot be applied. However, concept-facet-mapping should still be possible through fallback Xpath patterns.

Attachments (2)

VLO mapping for profile teiHeader (clarin.eu cr1 p_1345180279115).htm (13.0 KB) - added by matej.durco@oeaw.ac.at 6 years ago.
VLO mapping for profile teiHeader (clarin.eu cr1 p_1381926654438).htm (14.3 KB) - added by matej.durco@oeaw.ac.at 6 years ago.

Download all attachments as: .zip

Change History (5)

comment:1 Changed 6 years ago by matej.durco@oeaw.ac.at

Description: modified (diff)

comment:2 Changed 6 years ago by matej.durco@oeaw.ac.at

Cc: Twan Goosen Menzo Windhouwer added
Description: modified (diff)
Owner: changed from matej.durco@oeaw.ac.at to haaf@bbaw.de
Status: newassigned

comment:3 Changed 6 years ago by haaf@bbaw.de

I updated: clarin.eu:cr1:p_1381926654438 yesterday:

  1. corrected the conceptLinks for channel, abstract and channel/@mode according to VLO mapping for resource class, description and modalitiy
  2. eliminated channel/p and abstract/p

... still bad results in the Curation Module for these facets. How come?

Note: See TracTickets for help on using tickets.