Dr MIHAEL ARCAN

Ph.D

Contact Details

Research Fellow
E: mihael.arcan@nuigalway.ie
 
researcher
 

Biography

I am a research fellow at Insight Centre for Data Analytics @NUI Galway. I am working in the Unit for Natural Language Processing (UNLP) under the supervision of Dr Paul Buitelaar, where my main research topic focuses on term translation with neural machine translation (NMT). I defended my PhD thesis with the title  Machine Translation of Domain-Specific Expressions within  Ontologies and Documents.

I studied German Language at the University of Ljubljana, Slovenia, where I defended my Diploma thesis "Named Entity Recognition for German and Slovene" under the supervision of Dr Stojan Bračič and Dr Špela Vintar. In 2009 I obtained my master's degree in Computational linguistics at the Ruhr University in Bochum, Germany. My thesis under the supervision of Prof. Dr Ralf Klabunde dealt with the extraction of semantic relations from the Slovene national corpus.
After my study, I worked for Lionbridge as a developer of language technologies for Slovene, and for the Slovenian Project “Communication in Slovene”.

At Insight@NUIG (and previously in DERI) I was working on the MONNET (Multilingual Ontologies for Networked Knowledge) project, which was a European Project with the goal to provide standards and technology to facilitate multilingual access to Semantic Web knowledge resources. During my studies, I focused on the challenges on the multilinguality of the language resources within different projects.

Research Projects

  Project Start Date End Date
NURS – Neural Machine Translation for Under-Resourced Scenarios 01-JUL-18 31-AUG-19

Peer Reviewed Journals

  Year Publication
(2018) 'MixedEmotions: An Open-Source Toolbox for Multi-Modal Emotion Analysis'
Buitelaar, P; Wood, ID; Negi, S; Arcan, M; McCrae, JP; Abele, A; Robin, C; Andryushechkin, V; Ziad, H; Sagha, H; Schmitt, M; Schuller, BW; Sanchez, JF; Iglesias, CA; Navarro, C; Giefer, A; Heise, N; Masucci, V; Danza, FA; Caterino, C; Smrz, P; Hradis, M; Povolny, F; Klimes, M; Matejka, P; Tummarello, G (2018) 'MixedEmotions: An Open-Source Toolbox for Multi-Modal Emotion Analysis'. Ieee Transactions On Multimedia, [Details]
(2017) 'Leveraging bilingual terminology to improve machine translation in a CAT environment'
Arcan, M; Turchi, M; Tonelli, S; Buitelaar, P (2017) 'Leveraging bilingual terminology to improve machine translation in a CAT environment'. Natural Language Engineering, :1-26 [Details]
(2016) 'Domain adaptation for ontology localization'
McCrae, JP; Arcan, M; Asooja, K; Gracia, J; Buitelaar, P; Cimiano, P (2016) 'Domain adaptation for ontology localization'. Journal Of Web Semantics, 36 :23-31 [Details]

Conference Publications

  Year Publication
(2019) Proceedings of the 2nd Workshop on Technologies for MT of Low Resource Languages (LoResMT 2019)
Bharathi Raja Chakravarthi and Ruba Priyadharshini and Bernardo Stearns and Arun Jayapal and S Srivedy and Mihael Arcan and Manel Zarrouk and John P. McCrae (2019) Multilingual Multimodal Machine Translation for Dravidian Languages utilizing Phonetic Transcription Proceedings of the 2nd Workshop on Technologies for MT of Low Resource Languages (LoResMT 2019) [Details]
(2019) International Conference on Knowledge Capture (K-CAP)
Moussallem, D; Ngonga Ngomo, AC; Buitelaar, P; Arcan, M (2019) Utilizing Knowledge Graphs for Neural Machine Translation Augmentation International Conference on Knowledge Capture (K-CAP) [Details]
(2019) Proceedings of the Second Workshop on Multilingualism at the Intersection of Knowledge Bases and Machine Translation
Chakravarthi, BR; Arcan, M; McCrae, JP; (2019) WordNet Gloss Translation for Under-resourced Languages using Multilingual Neural Machine Translation Proceedings of the Second Workshop on Multilingualism at the Intersection of Knowledge Bases and Machine Translation Dublin, Ireland, , pp.1-7 [ARAN Link] [Details]
(2019) Literary Machine Translation Workshop, co-located with MT-Summit 2019
Kuzman, T; Vintar, Š; Arcan, M (2019) Neural Machine Translation of Literary Texts from English to Slovene Literary Machine Translation Workshop, co-located with MT-Summit 2019 [ARAN Link] [Details]
(2019) Proceedings of the Translation Inference Across Dictionaries Workshop (TIAD 2019)
Arcan, M; Torregrosa, D; Ahmadi, S; McCrae, JP (2019) Inferring Translation Candidates for Multilingual Dictionary Generation with Multi-Way Neural Machine Translation Proceedings of the Translation Inference Across Dictionaries Workshop (TIAD 2019) [ARAN Link] [Details]
(2019) Machine Translation summit 2019
Torregrosa, D; Pasricha, N; Chakravarthi, BR; Masoud, M; Alonso, J; Casas, N; Arcan, M (2019) Leveraging Rule-Based Machine Translation Knowledge for Under-Resourced Neural Machine Translation Models Machine Translation summit 2019 [ARAN Link] [Details]
(2019) Social Media Mining for Health Applications Workshop
Delahunty, F; Johansson, R; Arcan, M (2019) Passive Diagnosis Incorporating the PHQ-4 for Depression and Anxiety Social Media Mining for Health Applications Workshop [ARAN Link] [Details]
(2019) Translation Inference Across Dictionaries
Torregrosa, D; Arcan, M; Ahmadi, S; McCrae, J (2019) TIAD 2019 Shared Task: Leveraging Knowledge Graphs with Neural Machine Translation for Automatic Multilingual Dictionary Generation Translation Inference Across Dictionaries [ARAN Link] [Details]
(2019) Proceedings of the Poster Track of LDK 2019
Ahmadi, S; Arcan, M; McCrae J (2019) Lexical Sense Alignment using Weighted Bipartite b-Matching Proceedings of the Poster Track of LDK 2019 [Details]
(2019) 2nd Conference on Language, Data and Knowledge (LDK 2019)
Chakravarthi, BR; Arcan, M; McCrae, JP; (2019) Comparison of Different Orthographies for Machine Translation of Under-Resourced Dravidian Languages 2nd Conference on Language, Data and Knowledge (LDK 2019) [Details]
(2018) Language Resources and Evaluation Conference (LREC 2018)
Arcan, M; Montiel-Ponsoda, E; McCrae, JP; Buitelaar, P (2018) Automatic Enrichment of Terminological Resources: the IATE RDF Example Language Resources and Evaluation Conference (LREC 2018) [ARAN Link] [Details]
(2018) Proceedings of the 9th Global WordNet Conference (GWC 2018)
Chakravarthi, BR; Arcan, M; McCrae, JP; (2018) Improving wordnets for under-resourced languages using Machine Translation Proceedings of the 9th Global WordNet Conference (GWC 2018) [Details]
(2018) Workshop on eLexicography: Between Digital Humanities and Artificial Intelligence
Ahmadi, S; Arcan, M; McCrae J (2018) On Lexicographical Networks Workshop on eLexicography: Between Digital Humanities and Artificial Intelligence [Details]
(2018) Language Technologies and Digital Humanities 2018
Arcan, M (2018) A comparison of Statistical and Neural Machine Translation for Slovene, Serbian and Croatian Language Technologies and Digital Humanities 2018 [ARAN Link] [Details]
(2018) Irish Conference on Artificial Intelligence and Cognitive Science
Delahunty, F; Wood, ID; Arcan, M (2018) First Insights on a Passive Major Depressive Disorder Prediction System with Incorporated Conversational Chatbot Irish Conference on Artificial Intelligence and Cognitive Science [ARAN Link] [Details]
(2017) Workshop on Multi-Language Processing in a Globalising World (MLP2017)
McCrae, JP; Arcan, M; Buitelaar, P (2017) Linking Knowledge Graphs across Languages with Semantic Similarity and Machine Translation Workshop on Multi-Language Processing in a Globalising World (MLP2017) [ARAN Link] [Details]
(2016) International Semantic Web Conference (ISWC16), Poster&Demo Section
Arcan, M; Dragoni, M; Buitelaar, P (2016) The ESSOT System Goes Wild: an Easy Way For Translating Ontologies International Semantic Web Conference (ISWC16), Poster&Demo Section [ARAN Link] [Details]
(2016) The Twelfth Biennial Conference of the Association for Machine Translation in the Americas (AMTA 2016)
Chatterjee, R; Arcan, M; Negri, M; Turchi, M (2016) Instance Selection for Online Automatic Post-Editing in a Multi-domain Scenario The Twelfth Biennial Conference of the Association for Machine Translation in the Americas (AMTA 2016) [ARAN Link] [Details]
(2016) Third Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial-3)
Popovic, M; Arcan, M; Klubicka, F (2016) Language related issues for machine translation between closely related South Slavic languages Third Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial-3) [ARAN Link] [Details]
(2016) Proceedings of the 10th Conference on Language Technologies and Digital Humanities, Ljubljana, Slovenia, September
Arcan, M; Popovic, M; Buitelaar, P (2016) Asistent -- a machine translation system for Slovene, Serbian and Croatian Proceedings of the 10th Conference on Language Technologies and Digital Humanities, Ljubljana, Slovenia, September [ARAN Link] [Details]
(2016) European Association for Machine Translation (EAMT-2016)
Popovic, M; Arcan, M; Lommel, A (2016) Potential and Limits of Using Post-edits as Reference Translations for MT Evaluation European Association for Machine Translation (EAMT-2016) [ARAN Link] [Details]
(2016) International Conference on Applications of Natural Language to Information Systems
Arcan, M; Dragoni, M; Buitelaar, P (2016) ESSOT: an expert supporting system for ontology translation International Conference on Applications of Natural Language to Information Systems , pp.60-73 [ARAN Link] [Details]
(2016) International Semantic Web Conference
Arcan, M; Dragoni, M; Buitelaar, P (2016) Translating ontologies in real-world settings International Semantic Web Conference , pp.241-256 [ARAN Link] [Details]
(2016) Language Resource and Evaluation Conference (LREC)
Popovic, M; Arcan, M (2016) PE2rr corpus: manual error annotation of automatically pre-annotated MT post-edits Language Resource and Evaluation Conference (LREC) [ARAN Link] [Details]
(2016) International Conference on Computational Linguistics (COLING-2016)
Arcan, M; McCrae, JP; Buitelaar, P (2016) Expanding wordnets to new languages with multilingual sense disambiguation International Conference on Computational Linguistics (COLING-2016) [ARAN Link] [Details]
(2016) Language Resource and Evaluation Conference (LREC)
Arcan, M; Lane, C; O Droighneain, E; Buitelaar, P (2016) IRIS: English-Irish Machine Translation System Language Resource and Evaluation Conference (LREC) [ARAN Link] [Details]
(2015) Second Italian Conference on Computational Linguistics (CLiC-it 2015)
Monti, J; Sangati, F; Arcan, M (2015) TED-MWE: a bilingual parallel corpus with MWE annotation Second Italian Conference on Computational Linguistics (CLiC-it 2015) [ARAN Link] [Details]
(2015) European Association for Machine Translation (EAMT-2015)
Popovic, M; Arcan, M; Avramidis, E; Burchardt, A; Lommel, A (2015) Poor man’s lemmatisation for automatic error classification European Association for Machine Translation (EAMT-2015) [ARAN Link] [Details]
(2015) International Semantic Web Conference (Posters \& Demos)
Arcan, M; Asooja, K; Ziad, H; Buitelaar, P (2015) OTTO - Ontology Translation System International Semantic Web Conference (Posters \& Demos) [ARAN Link] [Details]
(2015) 18th Annual Conference of the European Association for Machine Translation (EAMT-15)
Popovic, M; Arcan, M (2015) Identifying main obstacles for statistical machine translation of morphologically rich South Slavic languages 18th Annual Conference of the European Association for Machine Translation (EAMT-15) [ARAN Link] [Details]
(2015) Association for Computational Linguistics (ACL-2015)
Arcan, M; Turchi, M; Buitelaar, P (2015) Knowledge Portability with Semantic Expansion of Ontology Labels Association for Computational Linguistics (ACL-2015) , pp.708-718 [ARAN Link] [Details]
(2015) PARSEME 5th General Meeting (Poster)
Monti, J; Sangati, F; Arcan, M (2015) Multi-word expressions in a parallel bilingual spoken corpus: data annotation and initial identification results PARSEME 5th General Meeting (Poster) [Details]
(2015) Proceedings of the 18th Annual Conference of the European Association for Machine Translation
Arcan, M; Buitelaar, P (2015) MixedEmotions: Social Semantic Emotion Analysis for Innovative Multilingual Big Data Analytics Markets Proceedings of the 18th Annual Conference of the European Association for Machine Translation [Details]
(2014) 4th International Workshop on Computational Terminology (Computerm), co-located with COLING 2014
Arcan, M; Giuliano, C; Turchi, M; Buitelaar, P (2014) Identification of bilingual terms from monolingual documents for statistical machine translation 4th International Workshop on Computational Terminology (Computerm), co-located with COLING 2014 [ARAN Link] [Details]
(2014) 9th Edition of Language Resources and Evaluation Conference (LREC2014)
Vulcu, G; Buitelaar, P; Negi, S; Pereira, B; Arcan, M; Coughland, B; Sanchez Rada, JF; Iglesias Fernandez, CA (2014) Generating Linked-Data based Domain-Specific Sentiment Lexicons from Legacy Language and Semantic Resources 9th Edition of Language Resources and Evaluation Conference (LREC2014) , pp.1-4 [ARAN Link] [Details]
(2014) Proceedings of the 11th Biennial Conference of the Association for Machine Translation in the Americas (AMTA 2014)
Arcan, M; Turchi, M; Tonelli, S; Buitelaar, P (2014) Enhancing statistical machine translation with bilingual terminology in a CAT environment Proceedings of the 11th Biennial Conference of the Association for Machine Translation in the Americas (AMTA 2014) , pp.54-68 [ARAN Link] [Details]
(2013) Machine Translation Summit XIV
Arcan, M; Thomas, SM; De Brandt, D; Buitelaar, P (2013) Translating the FINREP taxonomy using a domain-specific corpus Machine Translation Summit XIV [ARAN Link] [Details]
(2013) Proceedings of the XIV Machine Translation Summit - European projects presentation
Arcan, M; Buitelaar, P (2013) MONNET: Multilingual Ontologies for Networked Knowledge Proceedings of the XIV Machine Translation Summit - European projects presentation [Details]
(2013) 2nd Workshop on Linked Data in Linguistics (LDL-2013): Representing and linking lexicons, terminologies and other language data
Buitelaar, P; Arcan, M; Iglesias Fernandez, CA; Sanchez Rada, JF; Strapparava, C (2013) Linguistic linked data for sentiment analysis 2nd Workshop on Linked Data in Linguistics (LDL-2013): Representing and linking lexicons, terminologies and other language data , pp.1-8 [ARAN Link] [Details]
(2013) Proceedings of the NAACL HLT 2013 Student Research Workshop
Arcan, M; Buitelaar, P (2013) Ontology Label Translation Proceedings of the NAACL HLT 2013 Student Research Workshop , pp.40-46 [ARAN Link] [Details]
(2012) Proceedings of the International Conference on Computational Linguistics (COLING)
Arcan, M; Federmann, C; Buitelaar, P (2012) Experiments with Term Translation Proceedings of the International Conference on Computational Linguistics (COLING) , pp.67-82 [ARAN Link] [Details]
(2012) Proceedings of the Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation
Arcan, M; Buitelaar, P; Federmann, C (2012) Using domain-specific and collaborative resources for term translation Proceedings of the Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation , pp.86-94 [ARAN Link] [Details]
(2011) Meddisciplinarnost v slovenistiki (Obdobja 30)
Arhar Holdt, S; Arcan, M (2011) Avtomatsko pridobivanje besednih zvez iz korpusa z uporabo leksikona SSJ Meddisciplinarnost v slovenistiki (Obdobja 30) , pp.13-20 [ARAN Link] [Details]
(2011) Proceedings of the 6th International Conference on Ontology Matching
Aggarwal, N; Wunner, T; Arcan, M; Buitelaar, P; O'Riain, S (2011) A similarity measure based on semantic, terminological and linguistic information Proceedings of the 6th International Conference on Ontology Matching , pp.264-265 [Details]
(2006) Konferenca za jezikovne tehnologije
Arcan, M; Vintar, S (2006) Avtomaticno Prepoznavanje Lastnih Imen Konferenca za jezikovne tehnologije , pp.1-6 [Details]

Thesis

  Year Publication
(2017) Machine translation of domain-specific expressions within ontologies and documents.
Arcan, M (2017) Machine translation of domain-specific expressions within ontologies and documents. Thesis [Details]

Modules Taught

  Term/Year Module Title Module Code Subject / Desc
2018 Advanced Natural Language Processing CT5121
2016, 2017 Introduction to Natural Language Processing CT5021