Changes between Version 16 and Version 17 of LanguageResourceSwitchboard/Hackathon


Ignore:
Timestamp:
05/18/17 08:45:02 (7 years ago)
Author:
Twan Goosen
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • LanguageResourceSwitchboard/Hackathon

    v16 v17  
    8484* Video conference: 15 May 2017, 10:00 CEST
    8585
    86 == Participants ==
    87 * Pavel Stranak (UFAL, Prague) stranak@ufal.mff.cuni.cz
    88 * Amir Kamran (UFAL, Prague) kamran@ufal.mff.cuni.cz
     86== Hackathon report ==
    8987
    90 * Bart Jongejan (Copenhagen) bartj@hum.ku.dk
     88=== Participants ===
     891. Pavel Stranak (UFAL, Prague) stranak@ufal.mff.cuni.cz
     901. Amir Kamran (UFAL, Prague) kamran@ufal.mff.cuni.cz
     911. Bart Jongejan (Copenhagen) bartj@hum.ku.dk
     921. Martin Matthiesen (CSC Finland) martin.matthiesen@csc.fi
     931. Tero Aalto (CSC Finland) tero.aalto@csc.fi
     941. Matej Durco (ACDH Austria) Matej.Durco@oeaw.ac.at
     951. Wolfgang Sauer (ACDH Vienna) wolfgang.sauer@oeaw.ac.at
     961. Tommi Pirinen (HZSK Hamburg) tommi.antero.pirinen@uni-hamburg.de
     971. Riccardo Del Gratta (ILC/CNR Italy) riccardo.delgratta@ilc.cnr.it
     981. Krista Liin (Center of Estonian Language Resources) krista.liin@ut.ee
    9199
    92 * Martin Matthiesen (CSC Finland) martin.matthiesen@csc.fi
    93 * Tero Aalto (CSC Finland) tero.aalto@csc.fi
     100=== Results ===
     101Three web services were connected to some degree during and after the session:
     1021. [https://ufal.mff.cuni.cz/udpipe UDPipe] web service version, carries out tokenization, morphological analysis, tagging, lemmatization, dependency parsing. Required some ad-hoc mapping from language codes to models, a more sustainable solution is to be implemented either in the switchboard or on the side of the tool. The version with a user friendly front end is to be connected as well.
     1031. ILC's tokenizer for various languages was connected as a proof of concept, but the service is not publicly available yet. Some complications with the request were encountered but resolved.
     1041. HTML to plain text conversion (by Bart Jongejan), a service provided by CLARIN-DK.
     105 * [https://cst.dk/html2text/?F=https://www.google.com&iLang=en&base=& example]
    94106
    95 * Matej Durco (ACDH Austria) Matej.Durco@oeaw.ac.at
    96 * Wolfgang Sauer (ACDH Austria) wolfgang.sauer@oeaw.ac.at
     107There is a good potential for integration for tools from HZSK (conversion service for transcriptions) and ACDH (REST API + web views for named entity recognition, entity linking), but some adaptations on the applications themselves are required. CSC and the Center of Estonian Language Resources also see possibilities (e.g. [http://keeleliin.keeleressursid.ee Keeleliin]) but have not made any concrete steps yet.
    97108
    98 * Tommi Pirinen (HZSK Hamburg) tommi.antero.pirinen@uni-hamburg.de
     109Many tools don't support processing of resources on basis of a URI through a query parameter of a GET request (yet), but do accept content to be processed via POST. We have to think about whether this should be resolved generically in the LRS, or by the individual tools, or maybe through some generic wrapper service.
    99110
    100 * Riccardo Del Gratta (CNR, Italy) riccardo.delgratta@ilc.cnr.it
    101 
    102 * Krista Liin (Center of Estonian Language Resources) krista.liin@ut.ee
     111A virtual follow-up meeting is to be scheduled in ~4 weeks time after the event.