Dr MIHAEL ARCAN

Ph.D

Contact Details

Research Fellow
E: mihael.arcan@nuigalway.ie
 
researcher
 

Biography

I am a research fellow at the Data Science Institute (DSI) and Adjunct Lecturer at the School of Computer Science at the National University of Ireland Galway (NUI Galway). I am working in the Unit for Natural Language Processing (UNLP) under the supervision of Dr Paul Buitelaar, where my main research topic focuses on terminology and knowledge graph injection into neural machine translation architecture. Recently, I am also following the work on dialogue systems and natural language generation with multi-modal data. I am primarily funded by the nationwide Insight SFI Research Centre for Data Analytics, which is also hosted within DSI.

Research Interests

Natural Language Processing
Neural Machine Translation
Natural Language Generation
Knowledge Graphs

Research Projects

  Project Start Date End Date
EI H2020 - Proposal Preparation Support: Passive Prediction for Mental Health Disorders 13-JAN-20 13-SEP-20
EI Commercial Case Feasibility Support Grant: Passive prediction of mental health disorders 01-NOV-19 30-APR-20
EI Innovation Partnership Programme: NURS – Neural Machine Translation for Under-Resourced Scenarios 01-JUL-18 31-AUG-19

Peer Reviewed Journals

  Year Publication
(2018) 'MixedEmotions: An Open-Source Toolbox for Multi-Modal Emotion Analysis'
Buitelaar, P; Wood, I; Negi, S; Arcan, M; McCrae, J; Abele, A; Robin, C; Andryushechkin, V; Ziad, H; Sagha, H; Schmitt, M; Schuller, B; Sanchez, J; Iglesias, C; Navarro, C; Giefer, A; Heise, N; Masucci, V; Danza, F; Caterino, C; Smrz, P; Hradis, M; Povolny, F; Klimes, M; Matejka, P; Tummarello, G (2018) 'MixedEmotions: An Open-Source Toolbox for Multi-Modal Emotion Analysis'. Ieee Transactions On Multimedia, [Details]
(2017) 'Leveraging bilingual terminology to improve machine translation in a CAT environment'
Arcan, M; Turchi, M; Tonelli, S; Buitelaar, P (2017) 'Leveraging bilingual terminology to improve machine translation in a CAT environment'. Natural Language Engineering, :1-26 [Details]
(2016) 'Domain adaptation for ontology localization'
McCrae, J; Arcan, M; Asooja, K; Gracia, J; Buitelaar, P; Cimiano, P (2016) 'Domain adaptation for ontology localization'. Journal Of Web Semantics, 36 :23-31 [Details]

Book Chapters

  Year Publication
(2020) 'Translation asymmetries of multiword expressions in machine translation'
Monti, J; Arcan, M; Sangati, F (2020) 'Translation asymmetries of multiword expressions in machine translation' In: Computational Phraseology. USA: John Benjamins. [DOI] [Details]

Conference Publications

  Year Publication
(2020) 28th International Conference on Computational Linguistics
Sarkar, R; Goswami, K; Arcan, M; McCrae, J (2020) Suggest me a movie for tonight: Leveraging Knowledge Graphs for Conversational Recommendation 28th International Conference on Computational Linguistics [Details]
(2020) Proceedings of the 5th Workshop on Indian Language Data Resource and Evaluation (WILDRE-5)
Suryawanshi, S; Chakravarthi, B; Verma, P; Arcan, M; McCrae, J; Buitelaar, B (2020) A Dataset for Troll Classification of Tamil Memes Proceedings of the 5th Workshop on Indian Language Data Resource and Evaluation (WILDRE-5) Marseille, France, [ARAN Link] [Details]
(2020) Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying
Suryawanshi, S; Chakravarthi, B; Arcan, M; Buitelaar, B (2020) Multimodal Meme Dataset (MultiOFF) for Identifying Offensive Content in Image and Text Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying Marseille, France, [ARAN Link] [Details]
(2020) Proceedings of the 2020 Globalex Workshop on Linked Lexicography
McCrae, J; Arcan, M (2020) NUIG at TIAD: Combining Unsupervised NLP and Graph Metrics for Translation Inference Proceedings of the 2020 Globalex Workshop on Linked Lexicography [ARAN Link] [Details]
(2019) 27th AIAI Irish Conference on Artificial Intelligence and Cognitive Science
Masoud, M; Torregrosa, D; Buitelaar, P; Arcan, M (2019) Back-translation Approach for Code-switching Machine Translation: A Case Study 27th AIAI Irish Conference on Artificial Intelligence and Cognitive Science [ARAN Link] [Details]
(2019) Proceedings of the 2nd Workshop on Technologies for MT of Low Resource Languages (LoResMT 2019)
Chakravarthi, B; Priyadharshini, R; Stearns, B; Jayapal, A; Srivedy, S; Arcan, M; Zarrouk, M; McCrae, J (2019) Multilingual Multimodal Machine Translation for Dravidian Languages utilizing Phonetic Transcription Proceedings of the 2nd Workshop on Technologies for MT of Low Resource Languages (LoResMT 2019) [Details]
(2019) International Conference on Knowledge Capture (K-CAP)
Moussallem, D; Ngonga Ngomo, AC; Buitelaar, P; Arcan, M (2019) Utilizing Knowledge Graphs for Neural Machine Translation Augmentation International Conference on Knowledge Capture (K-CAP) [Details]
(2019) Proceedings of the Second Workshop on Multilingualism at the Intersection of Knowledge Bases and Machine Translation
Chakravarthi, B; Arcan, M; McCrae, J; (2019) WordNet Gloss Translation for Under-resourced Languages using Multilingual Neural Machine Translation Proceedings of the Second Workshop on Multilingualism at the Intersection of Knowledge Bases and Machine Translation Dublin, Ireland, , pp.1-7 [ARAN Link] [Details]
(2019) Literary Machine Translation Workshop, co-located with MT-Summit 2019
Kuzman, T; Vintar, Š; Arcan, M (2019) Neural Machine Translation of Literary Texts from English to Slovene Literary Machine Translation Workshop, co-located with MT-Summit 2019 [ARAN Link] [Details]
(2019) Proceedings of the Translation Inference Across Dictionaries Workshop (TIAD 2019)
Arcan, M; Torregrosa, D; Ahmadi, S; McCrae, J (2019) Inferring Translation Candidates for Multilingual Dictionary Generation with Multi-Way Neural Machine Translation Proceedings of the Translation Inference Across Dictionaries Workshop (TIAD 2019) [ARAN Link] [Details]
(2019) Machine Translation summit 2019
Torregrosa, D; Pasricha, N; Chakravarthi, B; Masoud, M; Alonso, J; Casas, N; Arcan, M (2019) Leveraging Rule-Based Machine Translation Knowledge for Under-Resourced Neural Machine Translation Models Machine Translation summit 2019 [ARAN Link] [Details]
(2019) Social Media Mining for Health Applications Workshop
Delahunty, F; Johansson, R; Arcan, M (2019) Passive Diagnosis Incorporating the PHQ-4 for Depression and Anxiety Social Media Mining for Health Applications Workshop [ARAN Link] [Details]
(2019) Translation Inference Across Dictionaries
Torregrosa, D; Arcan, M; Ahmadi, S; McCrae, J (2019) TIAD 2019 Shared Task: Leveraging Knowledge Graphs with Neural Machine Translation for Automatic Multilingual Dictionary Generation Translation Inference Across Dictionaries [ARAN Link] [Details]
(2019) Proceedings of the Poster Track of LDK 2019
Ahmadi, S; Arcan, M; McCrae J (2019) Lexical Sense Alignment using Weighted Bipartite b-Matching Proceedings of the Poster Track of LDK 2019 [Details]
(2019) 2nd Conference on Language, Data and Knowledge (LDK 2019)
Chakravarthi, B; Arcan, M; McCrae, J (2019) Comparison of Different Orthographies for Machine Translation of Under-Resourced Dravidian Languages 2nd Conference on Language, Data and Knowledge (LDK 2019) [Details]
(2018) Language Resources and Evaluation Conference (LREC 2018)
Arcan, M; Montiel-Ponsoda, E; McCrae, J; Buitelaar, P (2018) Automatic Enrichment of Terminological Resources: the IATE RDF Example Language Resources and Evaluation Conference (LREC 2018) [ARAN Link] [Details]
(2018) Proceedings of the 9th Global WordNet Conference (GWC 2018)
Chakravarthi, B; Arcan, M; McCrae, J (2018) Improving wordnets for under-resourced languages using Machine Translation Proceedings of the 9th Global WordNet Conference (GWC 2018) [Details]
(2018) Workshop on eLexicography: Between Digital Humanities and Artificial Intelligence
Ahmadi, S; Arcan, M; McCrae J (2018) On Lexicographical Networks Workshop on eLexicography: Between Digital Humanities and Artificial Intelligence [Details]
(2018) Language Technologies and Digital Humanities 2018
Arcan, M (2018) A comparison of Statistical and Neural Machine Translation for Slovene, Serbian and Croatian Language Technologies and Digital Humanities 2018 [ARAN Link] [Details]
(2018) Irish Conference on Artificial Intelligence and Cognitive Science
Delahunty, F; Wood, ID; Arcan, M (2018) First Insights on a Passive Major Depressive Disorder Prediction System with Incorporated Conversational Chatbot Irish Conference on Artificial Intelligence and Cognitive Science [ARAN Link] [Details]
(2017) Workshop on Multi-Language Processing in a Globalising World (MLP2017)
McCrae, J; Arcan, M; Buitelaar, P (2017) Linking Knowledge Graphs across Languages with Semantic Similarity and Machine Translation Workshop on Multi-Language Processing in a Globalising World (MLP2017) [ARAN Link] [Details]
(2016) International Semantic Web Conference (ISWC16), Poster&Demo Section
Arcan, M; Dragoni, M; Buitelaar, P (2016) The ESSOT System Goes Wild: an Easy Way For Translating Ontologies International Semantic Web Conference (ISWC16), Poster&Demo Section [ARAN Link] [Details]
(2016) The Twelfth Biennial Conference of the Association for Machine Translation in the Americas (AMTA 2016)
Chatterjee, R; Arcan, M; Negri, M; Turchi, M (2016) Instance Selection for Online Automatic Post-Editing in a Multi-domain Scenario The Twelfth Biennial Conference of the Association for Machine Translation in the Americas (AMTA 2016) [ARAN Link] [Details]
(2016) Third Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial-3)
Popovic, M; Arcan, M; Klubicka, F (2016) Language related issues for machine translation between closely related South Slavic languages Third Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial-3) [ARAN Link] [Details]
(2016) Proceedings of the 10th Conference on Language Technologies and Digital Humanities, Ljubljana, Slovenia, September
Arcan, M; Popovic, M; Buitelaar, P (2016) Asistent -- a machine translation system for Slovene, Serbian and Croatian Proceedings of the 10th Conference on Language Technologies and Digital Humanities, Ljubljana, Slovenia, September [ARAN Link] [Details]
(2016) European Association for Machine Translation (EAMT-2016)
Popovic, M; Arcan, M; Lommel, A (2016) Potential and Limits of Using Post-edits as Reference Translations for MT Evaluation European Association for Machine Translation (EAMT-2016) [ARAN Link] [Details]
(2016) International Conference on Applications of Natural Language to Information Systems
Arcan, M; Dragoni, M; Buitelaar, P (2016) ESSOT: an expert supporting system for ontology translation International Conference on Applications of Natural Language to Information Systems , pp.60-73 [ARAN Link] [Details]
(2016) International Semantic Web Conference
Arcan, M; Dragoni, M; Buitelaar, P (2016) Translating ontologies in real-world settings International Semantic Web Conference , pp.241-256 [ARAN Link] [Details]
(2016) Language Resource and Evaluation Conference (LREC)
Popovic, M; Arcan, M (2016) PE2rr corpus: manual error annotation of automatically pre-annotated MT post-edits Language Resource and Evaluation Conference (LREC) [ARAN Link] [Details]
(2016) International Conference on Computational Linguistics (COLING-2016)
Arcan, M; McCrae, J; Buitelaar, P (2016) Expanding wordnets to new languages with multilingual sense disambiguation International Conference on Computational Linguistics (COLING-2016) [ARAN Link] [Details]
(2016) Language Resource and Evaluation Conference (LREC)
Arcan, M; Lane, C; O Droighneain, E; Buitelaar, P (2016) IRIS: English-Irish Machine Translation System Language Resource and Evaluation Conference (LREC) [ARAN Link] [Details]
(2015) Second Italian Conference on Computational Linguistics (CLiC-it 2015)
Monti, J; Sangati, F; Arcan, M (2015) TED-MWE: a bilingual parallel corpus with MWE annotation Second Italian Conference on Computational Linguistics (CLiC-it 2015) [ARAN Link] [Details]
(2015) European Association for Machine Translation (EAMT-2015)
Popovic, M; Arcan, M; Avramidis, E; Burchardt, A; Lommel, A (2015) Poor man’s lemmatisation for automatic error classification European Association for Machine Translation (EAMT-2015) [ARAN Link] [Details]
(2015) International Semantic Web Conference (Posters \& Demos)
Arcan, M; Asooja, K; Ziad, H; Buitelaar, P (2015) OTTO - Ontology Translation System International Semantic Web Conference (Posters \& Demos) [ARAN Link] [Details]
(2015) 18th Annual Conference of the European Association for Machine Translation (EAMT-15)
Popovic, M; Arcan, M (2015) Identifying main obstacles for statistical machine translation of morphologically rich South Slavic languages 18th Annual Conference of the European Association for Machine Translation (EAMT-15) [ARAN Link] [Details]
(2015) Association for Computational Linguistics (ACL-2015)
Arcan, M; Turchi, M; Buitelaar, P (2015) Knowledge Portability with Semantic Expansion of Ontology Labels Association for Computational Linguistics (ACL-2015) , pp.708-718 [ARAN Link] [Details]
(2015) PARSEME 5th General Meeting (Poster)
Monti, J; Sangati, F; Arcan, M (2015) Multi-word expressions in a parallel bilingual spoken corpus: data annotation and initial identification results PARSEME 5th General Meeting (Poster) [Details]
(2015) Proceedings of the 18th Annual Conference of the European Association for Machine Translation
Arcan, M; Buitelaar, P (2015) MixedEmotions: Social Semantic Emotion Analysis for Innovative Multilingual Big Data Analytics Markets Proceedings of the 18th Annual Conference of the European Association for Machine Translation [Details]
(2014) 4th International Workshop on Computational Terminology (Computerm), co-located with COLING 2014
Arcan, M; Giuliano, C; Turchi, M; Buitelaar, P (2014) Identification of bilingual terms from monolingual documents for statistical machine translation 4th International Workshop on Computational Terminology (Computerm), co-located with COLING 2014 [ARAN Link] [Details]
(2014) 9th Edition of Language Resources and Evaluation Conference (LREC2014)
Vulcu, G; Buitelaar, P; Negi, S; Pereira, B; Arcan, M; Coughland, B; Sanchez Rada, JF; Iglesias Fernandez, CA (2014) Generating Linked-Data based Domain-Specific Sentiment Lexicons from Legacy Language and Semantic Resources 9th Edition of Language Resources and Evaluation Conference (LREC2014) , pp.1-4 [ARAN Link] [Details]
(2014) Proceedings of the 11th Biennial Conference of the Association for Machine Translation in the Americas (AMTA 2014)
Arcan, M; Turchi, M; Tonelli, S; Buitelaar, P (2014) Enhancing statistical machine translation with bilingual terminology in a CAT environment Proceedings of the 11th Biennial Conference of the Association for Machine Translation in the Americas (AMTA 2014) , pp.54-68 [ARAN Link] [Details]
(2013) Machine Translation Summit XIV
Arcan, M; Thomas, SM; De Brandt, D; Buitelaar, P (2013) Translating the FINREP taxonomy using a domain-specific corpus Machine Translation Summit XIV [ARAN Link] [Details]
(2013) Proceedings of the XIV Machine Translation Summit - European projects presentation
Arcan, M; Buitelaar, P (2013) MONNET: Multilingual Ontologies for Networked Knowledge Proceedings of the XIV Machine Translation Summit - European projects presentation [Details]
(2013) 2nd Workshop on Linked Data in Linguistics (LDL-2013): Representing and linking lexicons, terminologies and other language data
Buitelaar, P; Arcan, M; Iglesias Fernandez, CA; Sanchez Rada, JF; Strapparava, C (2013) Linguistic linked data for sentiment analysis 2nd Workshop on Linked Data in Linguistics (LDL-2013): Representing and linking lexicons, terminologies and other language data , pp.1-8 [ARAN Link] [Details]
(2013) Proceedings of the NAACL HLT 2013 Student Research Workshop
Arcan, M; Buitelaar, P (2013) Ontology Label Translation Proceedings of the NAACL HLT 2013 Student Research Workshop , pp.40-46 [ARAN Link] [Details]
(2012) Proceedings of the International Conference on Computational Linguistics (COLING)
Arcan, M; Federmann, C; Buitelaar, P (2012) Experiments with Term Translation Proceedings of the International Conference on Computational Linguistics (COLING) , pp.67-82 [ARAN Link] [Details]
(2012) Proceedings of the Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation
Arcan, M; Buitelaar, P; Federmann, C (2012) Using domain-specific and collaborative resources for term translation Proceedings of the Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation , pp.86-94 [ARAN Link] [Details]
(2011) Meddisciplinarnost v slovenistiki (Obdobja 30)
Arhar Holdt, S; Arcan, M (2011) Avtomatsko pridobivanje besednih zvez iz korpusa z uporabo leksikona SSJ Meddisciplinarnost v slovenistiki (Obdobja 30) , pp.13-20 [ARAN Link] [Details]
(2011) Proceedings of the 6th International Conference on Ontology Matching
Aggarwal, N; Wunner, T; Arcan, M; Buitelaar, P; O'Riain, S (2011) A similarity measure based on semantic, terminological and linguistic information Proceedings of the 6th International Conference on Ontology Matching , pp.264-265 [Details]
(2006) Konferenca za jezikovne tehnologije
Arcan, M; Vintar, S (2006) Avtomaticno Prepoznavanje Lastnih Imen Konferenca za jezikovne tehnologije , pp.1-6 [Details]

Thesis

  Year Publication
(2017) Machine translation of domain-specific expressions within ontologies and documents.
Arcan, M (2017) Machine translation of domain-specific expressions within ontologies and documents. Thesis [Details]

Modules Taught

  Term/Year Module Title Module Code Subject / Desc
2019, 2020 Advanced Natural Language Processing CT5121
2016, 2017 Introduction to Natural Language Processing CT5021