Viewable
This pages collects the information about different data/resource types wrt to their displaying to the user and the corresponding Viewers.
Data / Resource Types
This is a tentative taxonomy of relevant data types.
This should map (more-or-less 1-to-1) to the DataView@type
-attribute in the FCS-record data model.
Data
The tentative distinction between Data
and Resource
, is meant to express the need for (structured) inline content, that may not be addressable. We also may need to think of some kind of "Temporary Resources", ie some intermediate (computed) data that could be available/addressable only for a limited period of time (wouldn't get PID
). This need was actually expressed on multiple occassions, we have similar situation with Virtual Collections
or CMD-Profiles/Components
.
Another point regarding Data
is, that this is expected to be (not exclusively but) mainly computed data, aggregating/summarizing result-sets, collections, etc. While some services may be able to provide such summaries "natively", we have to think of a separate service, that would be able to perform the computation - a Summarizer.
Formats and Viewers
work in progress TODO: make consistent with DataTypes?
Collecting existing Formats and corresponding possible Viewers.
Type | Format | Viewer | URL |
List | XML, HTML | HTML-Lists (nested) | |
Table | XML, HTML | HTML-Tables | |
KWIC | FCS | "natively" as HTML-List/Table?, linfovis-DoubleTree for advanced Visualization | |
Annotations | EAF | Annexviewer provided by MPI | sample view |
Annotations | TCF | AnnotationViewer? provided by Tübingen | sample view |
Geolocation | KML | Google-Maps? http://openlayers.org/; europeana4D | e4D interface |
Time-based data | table? | chart, timeline? | |
Book | eg TEI | bookviewer? | |
Syntax tree | TCF | AnnotationViewer? | |
Graph | dot, graphml | GraphViz?, IsaViz? | |
Metadata | CMD | MDBrowser, ARBIL MDEditor |
KWIC
The all-to-well-known basic kwic table-view, with keyword aligned: | and an advanced interactive visualization with the DoubleTree-tool developed by linfovis,Bozen: |
Annotations
Annex, Annotation Viewer, Annis
Annotation exploration/visualization Tool Annex operates on EAF (and other?) formats, and displays the annotations layers synchronized with the video: | Annotation Viewer provided by Uni Tübingen supports the TCF format and has dedicated display types depending on the tier-type. Especially there is a special display for syntax trees: |
| |
Another tool for exploring multilevel linguistic corpora is Annis | A simple possibility for displaying syntax trees are online services providing the syntax tree rendered as SVG/PNG based on phrase entered in bracket notation, like ironcreek |
Collections
Visualizing collections operates primarily on Metadata of the resources, that the collection groups, selecting certain fields/facets for display, e.g. publicationDate
, ResourceType
, Country
or similar.
So one suitable display is Faceted browser as http://catalog.clarin.eu/ds/vlo/ VLO does.
One interesting visualization type could be Treemap (aka areamap), that allows to visualize the inner structure of arbitrarily large data in a bound space.
The main challenge is to introduce some sensible notion of size of individual resources.
Use of color allows to project selected phenomena on the map.
Time
simple frequency-distribution charts per time-frame (normally year): | but also visualizing duration: |
Occurrence of a word over time and text-type in the DWDS-"Kerncorpus" | Tentative display of publishing period of various periodicals (at AAC) |
The open-source JS-library SIMILE Widgets provides among others widgets to visualize time-information:
Geographic
Next to identifying locations on the map | also visualizing distribution of something on map |
Screenshot of CLARIN Virtual Language World in Google Earth | Screenshot of Corpus of First Names in The Netherlands |
TimeMap
Combined geospatial and temporal visualization (europeana4D project page?, '''e4D''' interface):
Open Issues
- How do we proceed with "basic" types like
List
orTable
- accept HTML directly?
Attachments (15)
-
EDC_FCS_Viewable.png (69.6 KB) - added by 13 years ago.
Resource Types
-
screen_DWDSbeta_stats_time.png (20.5 KB) - added by 13 years ago.
Screen DWDS corpus time statistics
-
periodicals03_charly.png (57.9 KB) - added by 13 years ago.
displaying periods
-
screen_meertens_nvb_map.png (171.5 KB) - added by 13 years ago.
screen Meertens names database geographic distribution
-
sample_TreeMap_smartmoney.png (64.4 KB) - added by 13 years ago.
Treemap sample (from SmarMoney? portal)
-
annis_main3.PNG (100.5 KB) - added by 13 years ago.
Annis - Tool for Search and Visualization in Multilevel Linguistic Corpora
-
screen_VLW_MPI.png (776.7 KB) - added by 13 years ago.
screenshot of GoogleEarth? with VLW-overlay (languages_sites.kmz
-
screen_AnnotationViewer.png (82.4 KB) - added by 13 years ago.
screenshot of AnnotationViewer? (for TCF) by Uni Tübingen
-
annis_main3.2.PNG (100.5 KB) - added by 13 years ago.
screenshot of the tool Annis
-
screen_VLW_MPI.jpg (231.5 KB) - added by 13 years ago.
screenshot of GoogleEarth? with VLW-overlay (languages_sites.kmz)
-
sample_syntax_tree.png (2.3 KB) - added by 13 years ago.
sample simple syntax tree rendering by online-service ironcreek
-
screenAnnex.png (160.0 KB) - added by 13 years ago.
screenshot of Annex
-
screen_korpusC4.png (54.9 KB) - added by 13 years ago.
screen KWIC-view Korpus C4
-
LinfoVis_DoubleTree.png (187.4 KB) - added by 13 years ago.
screen Linfovis DoubleTree?
-
screen_europeana4D_resize.png (506.9 KB) - added by 13 years ago.
Screenshot of the interactive TimeMap? web user interface http://tinyurl.com/e4d-project europeana4D