Collection of Research Papers of the 1st International Conference on Language Technologies for All

PROGRAMME



December 5-6, 2019

 Session 5. Innovative aspects related to applications of Language Technologies in various areas, products and services
 Speech Technology for Swedish: Current Impact Areas for Applications and Edyson, an Innovative Tool for Accessing Speech Data
David House, Per Fallgren and Jens Edlund
 Using Language Technologies to Automate the UNDP Rapid Integrated Assessment Mechanism in Serbian
Vuk Batanović and Boško Nikolić
 CALL Solutions That Support Multilingualism: Application to the “Nano” Languages in the West-Nordic Region
Birna Arnbjörnsdóttir and Auður Hauksdóttir
 Linguistic Linked Open Data for All
John P. McCrae and Thierry Declerck
 Multilingual Natural Language Processing and Transformers: A Giant Step Forward
Radu Florian, Taesun Moon, Jian Ni and Parul Awasthy
 Multi-lingual Support in Connective Learning Scheme for Refining and Connecting the Open Educational Videos
Virach Sornlertlamvanich, Nannam Aksorn and Thatsanee Charoenporn
 Improvement of Thai NER and the Corpus
Thatsanee Charoenporn, Virach Sornlertlamvanich and Kitiya Suriyachay
 Toward Narrative-Based Conversational Interfaces
Ethel Ong
 SukatWika: An Analysis Software for Linguistic Properties of Texts
Kathrina Lorraine Lucasan, Angelina Aquino, Francis Paolo Santelices and Dina Ocampo
 GeTa - a Tool for the Controlled Semi-Automatic Multilevel Annotation of Classical Ethiopic
Cristina Vertan
 Session 6. Scientific aspects related to the state of the art in Language Technologies, for spoken, written and sign languages
 FarSpeech: Arabic Natural Language Processing for Live Arabic Speech
Naassih Gopee, Ahmed Ali, Mohamed Eldesouki and Kareem Darwish
 Handling Prosody and Tone Languages
Aijun Li and Wei Wang
 Talking with Robots: Opportunities and Challenges
Roger Moore
 Bangla Speech Synthesizer System for Bangladesh
Mohammad Nurul Huda
 A Real Time Instruction Extractor from Traffic Signal for Translation
Mohammad Nurul Huda
 Automated Bangla Sign Language Conversion System: Present and Future
Mohammad Nurul Huda
 Bangla Phonetic Features Extraction for Automatic Speech Recognition
Mohammad Nurul Huda
 Russian Sign Language: History, Grammar and Sociolinguistic Situation in Brief
Ildar Kagirov, Dmitry Ryumin, Denis Ivanko, Alexander Axyonov and Alexey Karpov
 Language Technology Research at MILE Laboratory
Angarai Ganesan Ramakrishnan and Madhavaraj Ayyavu
 How Aspects of Descriptive and Formal Linguistics Can Inform LT for All Languages
Lars Hellan
 Session 7. Infrastructural aspects
 Multilingual Crowd Sourcing Methodology for Developing Resources for Under-resourced Indian Languages
Karunesh Arora, Sunita Arora, Mukund Roy and Shyam Sundar Agrawal
 IARPA’s Contribution to Human Language Technology Development for Low Resource Languages
Carl Rubino
 Google Crowdsourced Speech Corpora and Related Open-Source Resources for Low-Resource Languages and Dialects: An Overview
Alena Butryna, Shan-Hui Cathy Chu, Isin Demirsahin, Alexander Gutkin, Linne Ha, Fei He, Martin Jansche, Cibu Johny, Anna Katanova, Oddur Kjartansson, Chenfang Li, Tatiana Merkulova, Yin May Oo, Knot Pipatsrisawat, Clara Rivera, Supheakmungkol Sarin, Pasindu de Silva, Keshan Sodimana, Richard Sproat, Theeraphol Wattanavekin and Jaka Aris Eko Wibawa
 Red T Translator/Interpreter Incident Database
Maya Hess
 Preserving and Developing Indigenous Languages in the South African Context
Justus Roux and Sonja Bosch
 Be Not like the Wind: Access to Language and Music Records, Next Steps
Nick Thieberger and Amanda Harris
 Machine Translation Literacy as a Social Responsibility
Lynne Bowker
 The Endangered Languages Project (ELP): Collaborative Infrastructure and Knowledge-Sharing to Support Indigenous and Endangered Languages
Anna Belew
 Importance of Frameworks in Language Technology - Case of Arabic
Karim Bouzoubaa
 Promoting Language Technology for Endangered Languages with Shared Tasks
Gina-Anne Levow
 Towards a Global Lexicographic Infrastructure
Thierry Declerck, Simon Krek, John P. McCrae and Tanja Wissik
 Language Technologies for Less-Resourced-Languages. LRL Workshop Series at the Language and Technology Conferences (LTC) from 2009 to 2019
Zygmunt Vetulani, Khalid Choukri, Joseph Mariani and Patrick Paroubek
 The Linguistic Data Consortium: Developing and Distributing Language Resources 4All
Denise DiPersio and Christopher Cieri
 European Language Grid: Language Technologies for Europe
Georg Rehm
 Spoof-Vulnerable Rendering in Khmer Unicode Implementations
Marc Durdin, Joshua Horton, Makara Sok and Rasmey Ty
 PanLex: A Lexical Infrastructure Tool
Laura Welcher and David Kamholz
 Development of the Parallel Corpus of Mexican Languages (CPLM)
Cynthia Montaño, Gerardo Sierra Martínez and Gemma Bel-Enguix
 Dictionary 4.0: Alternative Presentations for Indonesian Multilingual Dictionaries
Arbi Haza Nasution and Totok Suhardijanto
 MultiTAL - an Online Platform to List NLP Tools for Under-Resourced Languages
Damien Nouvel, Mathieu Valette and Driss Sadoun
 Session 8. Minority and Indigenous Languages
 A Speaking Atlas of Indigenous Languages of France and Its Overseas
Philippe Boula de Mareüil, Gilles Adda, Albert Rilliard and Frédéric Vernier
 ELLORA: Enabling Low Resource Languages with Technology
Kalika Bali, Monojit Choudhury, Sunayana Sitaram and Vivek Seshadri
 Dependency Parsing Based on Uzbek Corpus
Nilufar Abdurakhmonova
 Womb Grammars: A Constraint Solving Model for Learning the Grammar of YorùBá
Ife Adebara
 The Contribution of StoryWeaver in Keeping Indigenous Languages Alive
Archana Nambiar
 Can We Defuse the Digital Timebomb? Linguistics, Speech Technology and the Irish Language Community
Ailbhe Ní Chasaide, Neasa Ní Chiaráin, Harald Berthelsen, Christoph Wendler, Andrew Murphy, Emily Barnes and Christer Gobl
 Spoken Language Technology for North-East Indian Languages
Viyazonuo Terhiija, Samudra Vijaya and Priyankoo Sarmah
 Nenek: Digital Self-documentation for Minority and Under-resourced Languages
Anuschka van t Hooft and José Luis González
 Indigenous Language Revitalisation: The Context of Inpui Naga in Northeast India
Rajiandai Bariam
 Language Shift, Language Technology, and Language Revitalization: Challenges and Possibilities for St. Lawrence Island Yupik
Lane Schwartz
 Copyright in the Context of Tooling up Corsican and Other Less-resourced Languages
Laurent Kevers and Stella Retali-Medori
 Formal Models and Software Tools for the Computer Processing of the Tatar Language
Suleymanov Szhavdet, Aidar Khusainov and Rinat Gilmullin
 Indonesian Phoneme Set, Vocabulary, and Pronunciation for Automatic Speech Recognition and Speech Synthesizer
Dessi Puji Lestari, Roland Hartanto, Devin Hoesen, Guntario Sukma Cahyani and Sakriani Sakti
 Analysis of the Ethiopic Twitter Dataset for Abusive Speech in Amharic
Seid Muhie Yimam, Abinew Ali Ayele and Chris Biemann
 The IRCAM Realizations for the Amazigh Preservation and Revitalization in Morocco
Fadoua Ataa Allah and Aicha Bouhjar
 Language Technology for Indigenous Languages: Achievements and Challenges
Sjur Moshagen, Lene Antonsen and Trond Trosterud
 Language Technology Applications in Africa within an “Inclusive, Innovative and Reflective” Crisis Interface
Evelyn Chibaka
 Digital Surveillance and Digitally-disadvantaged Language Communities
Isabelle Zaugg
 Language Resources and Tools Development for Indonesian Languages
Budi Irmawati, Arik Aranta, Wirarama Wedhaswara, M. Iqbal D. Putra and Siti Oryza Khairunnisa
 Languages and Technology in Bhutan
Tenzin Namgyel and Tshewang Norbu
 Towards Speech Technologies for Romani Language in Slovakia
Milan Rusko, Sakhia Darjaa, Marián Trnka, Róbert Sabo and Štefan Beňuš
 NTeALan - AI/NLP/NLU Platforms for Sharing and Leveraging African Language Resources for Education in Africa
Elvis Mboning Tchiaze, Assoumou Jules, Jean Marc Bassahak, Juanita Fopa and Damien Nouvel
 Multilingual Neural Machine Translation in Low Resource Settings
Pulkit Madaan and Fatiha Sadat
 Komi Latin Letters Degrees of UNICODE Facilitation
Jack Rueter and Larisa Ponomareva
 Challenges with Minority Indigenous Languages and Language Technologies
Apolonia Tamata
 Session 9. Activities for language preservation, reclamation, and enhancement
 Building Corpora for Under-Resourced Languages in Indonesia
Totok Suhardijanto and Arawinda Dinakaramani
 Language Is the Carrier of Our Culture : Language Documentation as Revitalisation in Badimaya and Warriyangga
Rosie Sitorus, Jacqui Cook and Peter Salmon
 Lessons Learned after Development and Use of a Data Collection App for Language Documentation (Lig-Aikuma)
Laurent Besacier, Elodie Gauthier and Sylvie Voisin
 Language Technologies for Regional Languages of France: The RESTAURE Project
Delphine Bernhard, Myriam Bras, Pascale Erhart, Anne-Laure Ligozat and Marianne Vergez-Couret
 Cardamom: Comparative Deep Models for Minority and Historical Languages
John P. McCrae and Theodorus Fransen
 Empowering Indigenous Communities through Citizen Linguistics, Language Resources and Human Language Technologies
Christopher Cieri and Mark Liberman
 Can a Robot Help Save an Endangered Language?
Maximiliano Duran
 Accessing and Understanding Contents in Portuguese by Foreigners in Scientific Digital Libraries: Can This Methodology Be Generalizedo to Other Languages ?
Francisco Claudio de Menezes
 Challenges and Opportunities in Processing Low Resource Languages: A Study on Persian
Mehrnoush Shamsfard
 Designing for Language Revitalisation
Steven Bird
 Contribution to the Universal Dependencies Treebank of Non-Standard Romanian Texts
Victoria Bobicev, Catalina Maranduc, Tudor Bumbu, Ludmila Malahov, Alexandru Colesnicov and Svetlana Cojocaru
 Translation Commons: No Language and No Linguist Left behind
Jeannette Stewart
 Wa7 Szum’In’stum’ Ti Nqweleutenlhkalha Technology Help and Hindrance in Indigenous Language Revitalization
Lorna Williams
 21St Century Language Technology Tools – 21St Century Challenges vs. 21St Century Opportunities
Antti Arppe and Jordan Lachler
 Session 10. Scientific aspects related to handling language diversity
 Indicators of Languages in the Internet
Daniel Pimienta
 Challenges for Language Technologies in Critically Endangered Languages
Jhonnatan Rangel
 Situation and Challenges of Technologies for Indigenous Languages of India
Shweta Sinha and Shyam Sundar Agrawal
 Towards ASR That Supports Linguistic Diversity in Norway
Benedicte Haraldstad Frostad, Verena Schall and Sonja Myhre Holten
 European Language Monitor – Exploring European Language Policies On-Line
Sabine Kirchmeier
 Achieving the Goal of Language Technology for All
Angarai Ganesan Ramakrishnan
 Semi-supervised Learning by Machine Speech Chain for Multilingual Speech Processing, and Recent Progress on Automatic Speech Interpretation
Satoshi Nakamura, Sakriani Sakti and Katsuhito Sudoh
 Parsing the Less-configurational Georgian Language with a Context-Free Grammar
Oleg Kapanadze
 On Practical Realisation of Autosegmental Representations in Lexical Transducers of Tonal Bantu Languages
Anssi Yli-Jyrä
 Text-Independent Dialect Classification in Read and Spontaneous Speech
Oliver Jokisch and Johanna Dobbriner
 Session 11. Developing Language Technologies for all: Best Practices
 Rediscovering past Narrations: The Oral History of the Romanian Language Preserved within the National Phonogramic Archive
Oana Niculescu, Maria Marin and Daniela Răuțu
 Developing Technologies for the Documentation and Description of the Low-resource Uralic Languages Zyrian Komi and North Saami
Niko Partanen, Thierry Poibeau and Michael Rießler
 Development of Technology for Indian Languages: Indian Government Initiatives
Sunil Srivastava
 Coherent Planning for Language Technology Development and Language Revitalization
Delyth Prys, Dewi Jones and Gruffudd Prys
 Archiving System of Endangered Languages in Japan: A Preliminary Report
Natsuko Nakagawa, Masahiro Yamada, Nobuko Kibe and Yukinori Takubo
 Efforts in the Development of an Augmented English–Nepali Parallel Corpus
Sharad Duwal and Bal Krishna Bal
 Rich Morphology, No Corpus – and We Still Made It. the SáMi Experience
Sjur Moshagen and Trond Trosterud
 Democratizing Access to Information : An Open and Inclusive Localization Model
Amel Fraisse
 The Case of Polish on Its Way to Become a Well-Resourced-Language
Zygmunt Vetulani and Grażyna Vetulani
 Analysis of Language Relatedness for the Development of Multilingual Automatic Speech Recognition for Ethiopian Languages
Martha Yifiru Tachbelie, Solomon Teferra Abate and Tanja Schultz
 Session 12.The Future of Language Technologies for All
 Increasing Diversity via Augmented and Distributed Online Conferences
Alejandrina Cristia
 Indigenous Language Technologies & Language Reclamation in Canada
Nathan Thanyehténhas Brinklow, Patrick Littell, Delaney Lothian, Aidan Pine and Heather Souter
 Poster Sessions
 LT4All Poster Summaries