wiki:Viewable

Viewable

This pages collects the information about different data/resource types wrt to their displaying to the user and the corresponding Viewers.

Data / Resource Types

Resource Types This is a tentative taxonomy of relevant data types. This should map (more-or-less 1-to-1) to the DataView@type-attribute in the FCS-record data model.

Data

The tentative distinction between Data and Resource, is meant to express the need for (structured) inline content, that may not be addressable. We also may need to think of some kind of "Temporary Resources", ie some intermediate (computed) data that could be available/addressable only for a limited period of time (wouldn't get PID). This need was actually expressed on multiple occassions, we have similar situation with Virtual Collections or CMD-Profiles/Components.

Another point regarding Data is, that this is expected to be (not exclusively but) mainly computed data, aggregating/summarizing result-sets, collections, etc. While some services may be able to provide such summaries "natively", we have to think of a separate service, that would be able to perform the computation - a Summarizer.

Formats and Viewers

work in progress TODO: make consistent with DataTypes?

Collecting existing Formats and corresponding possible Viewers.

Type Format Viewer URL
List XML, HTML HTML-Lists (nested)
Table XML, HTML HTML-Tables
KWIC FCS "natively" as HTML-List/Table?, linfovis-DoubleTree for advanced Visualization
Annotations EAF Annexviewer provided by MPI sample view
Annotations TCF AnnotationViewer? provided by Tübingen sample view
Geolocation KML Google-Maps? http://openlayers.org/; europeana4D e4D interface
Time-based data table? chart, timeline?
Book eg TEI bookviewer?
Syntax tree TCF AnnotationViewer?
Graph dot, graphml GraphViz?, IsaViz?
Metadata CMD MDBrowser, ARBIL MDEditor

KWIC

The all-to-well-known basic kwic table-view, with keyword aligned: and an advanced interactive visualization with the DoubleTree-tool developed by linfovis,Bozen:
screen KWIC-view Korpus C4 screen Linfovis DoubleTree

Annotations

Annex, Annotation Viewer, Annis

Annotation exploration/visualization Tool Annex operates on EAF (and other?) formats, and displays the annotations layers synchronized with the video: Annotation Viewer provided by Uni Tübingen supports the TCF format and has dedicated display types depending on the tier-type. Especially there is a special display for syntax trees:
screenshot of Annex screenshot of AnnotationViewer (for TCF) by Uni Tübingen

Another tool for exploring multilevel linguistic corpora is Annis A simple possibility for displaying syntax trees are online services providing the syntax tree rendered as SVG/PNG based on phrase entered in bracket notation, like ironcreek
screenshot of the tool Annis sample simple syntax tree rendering by online-service ironcreek

Collections

Visualizing collections operates primarily on Metadata of the resources, that the collection groups, selecting certain fields/facets for display, e.g. publicationDate, ResourceType, Country or similar.

So one suitable display is Faceted browser as http://catalog.clarin.eu/ds/vlo/ VLO does.

One interesting visualization type could be Treemap (aka areamap), that allows to visualize the inner structure of arbitrarily large data in a bound space.

Treemap sample (from SmarMoney portal)

The main challenge is to introduce some sensible notion of size of individual resources.

Use of color allows to project selected phenomena on the map.

Time

simple frequency-distribution charts per time-frame (normally year): but also visualizing duration:
Screen DWDS corpus time statistics displaying periods
Occurrence of a word over time and text-type in the DWDS-"Kerncorpus" Tentative display of publishing period of various periodicals (at AAC)

The open-source JS-library SIMILE Widgets provides among others widgets to visualize time-information:

Geographic

Next to identifying locations on the map also visualizing distribution of something on map
screenshot of GoogleEarth with VLW-overlay (languages_sites.kmz screen Meertens names database geographic distribution
Screenshot of CLARIN Virtual Language World in Google Earth Screenshot of Corpus of First Names in The Netherlands

TimeMap

Combined geospatial and temporal visualization (europeana4D project page?, '''e4D''' interface):

Screenshot of the interactive TimeMap web user interface [[http://tinyurl.com/e4d-project europeana4D]]

Open Issues

  • How do we proceed with "basic" types like List or Table - accept HTML directly?
Last modified 13 years ago Last modified on 08/05/11 14:27:04

Attachments (15)