Opened 7 years ago

Last modified 6 years ago

#1009 assigned defect

BBAW - teiHeader - licensing info missing

Reported by: matej.durco@oeaw.ac.at Owned by: haaf@bbaw.de
Priority: major Milestone:
Component: MetadataCuration Version:
Keywords: Cc: Twan Goosen, Menzo.Windhouwer@mpi.nl, matej.durco@oeaw.ac.at, Christian Thomas, haaf@bbaw.de, teckart@informatik.uni-leipzig.de, herold@bbaw.de

Description (last modified by Twan Goosen)

The licensing info is not being displayed in the VLO for the ~ 3000 resources of the BBAW collection:
https://vlo.clarin.eu/?fqType=organisation:or&fq=organisation:Berlin-Brandenburgische+Akademie+der+Wissenschaften+%28BBAW%29

The CMDI records are instances of the teiHeader profile
https://catalog.clarin.eu/ds/ComponentRegistry/rest/registry/1.2/profiles/clarin.eu:cr1:p_1381926654438/xsd

The profile actually has a licence element
even with the correct ConceptLink?
http://hdl.handle.net/11459/CCR_C-3800_12a79edd-0ffe-8d82-9831-45d125c54aee

However the element is of complexType and carries another <p> element with a verbose description of the licensing information
and the actual URI-reference to the license is in the licence@target attribute.

Snippet of the schema:

 <xs:element name="licence"
	cmd:ConceptLink="http://hdl.handle.net/11459/CCR_C-3800_12a79edd-0ffe-8d82-9831-45d125c54aee"
	minOccurs="1" maxOccurs="1">
	<xs:complexType>
		<xs:sequence>
			<xs:element name="p" minOccurs="1"
				maxOccurs="unbounded" type="xs:string"
				cue:DisplayPriority="1">
			</xs:element>
		</xs:sequence>
		<xs:attribute ref="xml:base"/>
		<xs:attribute ref="cmd:ref"/>
		<xs:attribute name="target" type="xs:anyURI">
			<xs:annotation/>
		</xs:attribute>
	</xs:complexType>
 </xs:element>

Sample of original metadata:
http://www.deutschestextarchiv.de/api/cmdi/sprengel_blumen_1793

licence-element snippet:

	<cmdp:licence target="http://creativecommons.org/licenses/by-nc-sa/3.0/de/">
	 <cmdp:p>Die Textdigitalisate des Polytechnischen Journals stehen unter der Lizenz "Creative Commons Namensnennung - Nicht-kommerziell - Weitergabe unter gleichen Bedingungen 3.0 Deutschland" (CC BY-NC-SA 3.0 DE).
	 </cmdp:p>
	</cmdp:licence>

Considering also attribute values in the concept to facet mapping actually has been already implemented as well (#836) so it needs to be tracked down, why this specific case is not being picked up.
First guess is that the attribute is disregarded because it is in no namespace.

This record (and 4 others) actually has the licensing information stated:
https://vlo.clarin.eu/record?1&docId=http_58__47__47_hdl.handle.net_47_11858_47_00-203C-0000-0029-FA32-D

This seems to be the case, because next to the target-attribute it has a string value
Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) that is recognized by the license-normalizer.

You can also visit the data in the curation instance of the vlo, where there are additional facets available for exploring:

https://vlo.minerva.arz.oeaw.ac.at/vlo/search;jsessionid=1DE499BD9E892F288264F188AB0C7285?1&fqType=_componentProfile:or&fq=_componentProfile:teiHeader&fqType=organisation:or&fq=organisation:Berlin-Brandenburgische+Akademie+der+Wissenschaften+%28BBAW%29&fqType=profileId:or&fq=profileId:clarin.eu:cr1:p_1381926654438

Change History (7)

comment:1 Changed 7 years ago by DefaultCC Plugin

Cc: herold@bbaw.de added

comment:2 Changed 7 years ago by Twan Goosen

Description: modified (diff)

Added reference to #836 which implements concept link based value extraction for attributes

comment:3 Changed 7 years ago by Twan Goosen

Considering also attribute values in the concept to facet mapping actually has been already implemented as well (#836) so it needs to be tracked down, why this specific case is not being picked up.
First guess is that the attribute is disregarded because it is in no namespace.

Rather the issue seems to be that in this (version of) the profile, the licence/@target URI has no concept link:

<Component name="availability" ConceptLink="http://hdl.handle.net/11459/CCR_C-2453_1f0c3ea5-7966-ae11-d3c6-448424d4e6e8" CardinalityMin="0" CardinalityMax="1">
    <Component name="licence" ConceptLink="http://hdl.handle.net/11459/CCR_C-3800_12a79edd-0ffe-8d82-9831-45d125c54aee" CardinalityMin="1" CardinalityMax="1">
        <AttributeList>
            <Attribute name="target" ValueScheme="anyURI"/>
        </AttributeList>
        <Element name="p" ValueScheme="string" CardinalityMin="1" CardinalityMax="unbounded" Multilingual="false" cue:DisplayPriority="1" xmlns:cue="http://www.clarin.eu/cmdi/cues/1"/>
    </Component>
</Component>

IFF the @target attribute in this specific context is supposed to contain a licence URI, I would suggest linking this attribute to concept http://hdl.handle.net/11459/CCR_C-6586_2c79d86a-5a75-0890-d407-7d9cb86b9beb.

comment:4 Changed 7 years ago by matej.durco@oeaw.ac.at

Thomas, Susanne,
do you agree to Twan's proposition to add a ConceptLink for the @target attribute?

comment:5 Changed 7 years ago by teckart@informatik.uni-leipzig.de

Sounds reasonable. I assume that there won't be more centres using this profile, so it is unlikely that we'll add noise to the VLO.

comment:6 Changed 7 years ago by haaf@bbaw.de

Yes, thanks for the hints and infos! We are about to update our profile wrt several other aspects shortly. So, I guess, we can include the concept link for @target, as well.

comment:7 Changed 6 years ago by matej.durco@oeaw.ac.at

Owner: changed from Dieter Van Uytvanck to haaf@bbaw.de
Status: newassigned

will be handled in the course of work on #1011

Note: See TracTickets for help on using tickets.