Changes between Version 16 and Version 17 of LanguageResourceSwitchboard/Hackathon
- Timestamp:
- 05/18/17 08:45:02 (7 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
LanguageResourceSwitchboard/Hackathon
v16 v17 84 84 * Video conference: 15 May 2017, 10:00 CEST 85 85 86 == Participants == 87 * Pavel Stranak (UFAL, Prague) stranak@ufal.mff.cuni.cz 88 * Amir Kamran (UFAL, Prague) kamran@ufal.mff.cuni.cz 86 == Hackathon report == 89 87 90 * Bart Jongejan (Copenhagen) bartj@hum.ku.dk 88 === Participants === 89 1. Pavel Stranak (UFAL, Prague) stranak@ufal.mff.cuni.cz 90 1. Amir Kamran (UFAL, Prague) kamran@ufal.mff.cuni.cz 91 1. Bart Jongejan (Copenhagen) bartj@hum.ku.dk 92 1. Martin Matthiesen (CSC Finland) martin.matthiesen@csc.fi 93 1. Tero Aalto (CSC Finland) tero.aalto@csc.fi 94 1. Matej Durco (ACDH Austria) Matej.Durco@oeaw.ac.at 95 1. Wolfgang Sauer (ACDH Vienna) wolfgang.sauer@oeaw.ac.at 96 1. Tommi Pirinen (HZSK Hamburg) tommi.antero.pirinen@uni-hamburg.de 97 1. Riccardo Del Gratta (ILC/CNR Italy) riccardo.delgratta@ilc.cnr.it 98 1. Krista Liin (Center of Estonian Language Resources) krista.liin@ut.ee 91 99 92 * Martin Matthiesen (CSC Finland) martin.matthiesen@csc.fi 93 * Tero Aalto (CSC Finland) tero.aalto@csc.fi 100 === Results === 101 Three web services were connected to some degree during and after the session: 102 1. [https://ufal.mff.cuni.cz/udpipe UDPipe] web service version, carries out tokenization, morphological analysis, tagging, lemmatization, dependency parsing. Required some ad-hoc mapping from language codes to models, a more sustainable solution is to be implemented either in the switchboard or on the side of the tool. The version with a user friendly front end is to be connected as well. 103 1. ILC's tokenizer for various languages was connected as a proof of concept, but the service is not publicly available yet. Some complications with the request were encountered but resolved. 104 1. HTML to plain text conversion (by Bart Jongejan), a service provided by CLARIN-DK. 105 * [https://cst.dk/html2text/?F=https://www.google.com&iLang=en&base=& example] 94 106 95 * Matej Durco (ACDH Austria) Matej.Durco@oeaw.ac.at 96 * Wolfgang Sauer (ACDH Austria) wolfgang.sauer@oeaw.ac.at 107 There is a good potential for integration for tools from HZSK (conversion service for transcriptions) and ACDH (REST API + web views for named entity recognition, entity linking), but some adaptations on the applications themselves are required. CSC and the Center of Estonian Language Resources also see possibilities (e.g. [http://keeleliin.keeleressursid.ee Keeleliin]) but have not made any concrete steps yet. 97 108 98 * Tommi Pirinen (HZSK Hamburg) tommi.antero.pirinen@uni-hamburg.de 109 Many tools don't support processing of resources on basis of a URI through a query parameter of a GET request (yet), but do accept content to be processed via POST. We have to think about whether this should be resolved generically in the LRS, or by the individual tools, or maybe through some generic wrapper service. 99 110 100 * Riccardo Del Gratta (CNR, Italy) riccardo.delgratta@ilc.cnr.it 101 102 * Krista Liin (Center of Estonian Language Resources) krista.liin@ut.ee 111 A virtual follow-up meeting is to be scheduled in ~4 weeks time after the event.