Machine Translation @ the NCLT & the CNGL

Dublin City University, Ireland

Home People Projects Publications Events Announcements Links
  • Apr 5, 2012 - 5 papers from our group accepted for publication at the 16th Annual Conference of European Association of Machine Translation(EAMT-2012). Trento, Italy
  • April 13, 2012 - DCLRS Seminar by Ana Guerberof, visiting researcher in SALIS, DCU on Productivity and quality in the post-editing of outputs from translation memories and machine translation at 4pm, Friday, 13th April, in L2.21 School of Computing, DCU.
  • April 25, 2012 - Irish Times publishes article on Sara Morrissey's work on Sign Language Machine Translation system
  • April 27, 2012 DCLRS Seminar: 3 Guest speakers, Grzegorz A. Kleparski, Marta Pikor-Niedziałek and Edyta Więcławska from University of Rzeszow, Poland at 4pm, Friday, 27th April, in L2.21 School of Computing, DCU.
  • Mar 27, 2012 - DCLRS Seminar 30th March 2012: On Stochastic Tree Distances and their training via Expectation-Maximisation by Dr Dr. Martin Emms from Trinity College Dublin at 4pm on Friday, 30th March 2012 in Room L2.21, School of Computing, Dublin City University
  • Mar 28, 2012 - CNGL/NCLT Seminar Series 28th March 2012: The Regression Model of Machine Translation Ergun Biçici (DCU) on Wednesday, 28th March 2012 at 4pm in Room L2.21, School of Computing, Dublin City University
  • Mar 22, 2012 - DCLRS Seminar 23rd March 2012: Finite-state temporality and the Content Value Chain by Dr Tim Fernando, Trinity College Dublin at 4pm on Friday, 23rd March 2012 in Room L2.21, School of Computing, Dublin City University
  • Mar 12, 2012 - CNGL/NCLT Seminar Series 14th March 2012: Domain and Personalised Tuning for Machine Translation: the CNGL demonstrator by Sara Morrissey (DCU) on Wednesday, 14th March 2012 at 4pm in Room L2.21, School of Computing, Dublin City University
  • Mar 5, 2012 - Hadoop Hackathon 2012 hosted by CNGL/NCLT on the weekend of 10th / 11th March 2012 in DCU. The Hackathon will consist of two days of group work, using Map Reduce and working on Natural language problems. Miles Osbourne from Edinburgh University will facilitate the event.
  • Feb-28, 2012 - CNGL to help devise W3C Standards for Multilingual Web Content: The Centre for Next Generation Localisation (CNGL) is to play a central role in the development of new W3C (World Wide Web Consortium) standards to simplify creation of Web content in the world's languages. CNGL researchers and industry partners will work with international collaborators on the new MultilingualWeb-LT (Language Technology) Working Group to develop standard means to support the translation and adaptation of Web content to local needs, from its creation through to its delivery to end users.
  • Feb 22, 2012 - DCLRS Seminar 24th Feb. 2012: MT evaluation: an industrial perspective by Dr Johann Roturier, Principal Research Engineer at CNGL partner, Symantec on Friday, 24th February 2012 at 4 pm in Room L2.21, School of Computing, Dublin City University
  • Apr 11 - The Centre for Next Generation Localisation (CNGL) is recruiting a Research Project Leader with an established international track record and reputation in the area of Machine Translation to join the Centre’s Research Leadership team and lead what is probably one of the world’s largest academic Machine Translation research groups comprising over 50 researchers. The position is based within the School of Computing at DCU and the post-holder will contribute to the normal teaching and administrative functions of a senior member of academic staff within the School. Read the complete advertisement in our new announcements section.
  • Apr 11 - We've had nine papers accepted for presentation at the 15th Annual Conference of the European Association for Machine Translation (EAMT 2011), to take place in Leuven, Belgium on May 30-31, 2011. The papers are entitled Combining Semantic and Syntactic Generalization in Example-Based Machine Translation. (with Sarah Ebling, Andy, Martin Volk and Sudip), CCG Contextual labels in Hierarchical Phrase-Based SMT (with Hal, Jie and Andy), Towards Using Web-Crawled Data for Domain Adaptation in Statistical Machine Translation (with Pavel, Antonio, Andy, Prokopis Prokopidis and Vassilis Papavassiliou), Using Example-Based MT to Support Statistical MT when Translating Homogeneous Data in a Resource-Poor Setting (with Sandipan, Sara, Andy and Mikel), (with Ankit, Yanjun and Andy), Assessing Three Transcription Methods for Sign Language Machine Translation and Evaluation (with Sara), Experiments on Domain Adaptation for Patent Machine Translation in the PLuTO project (with Alexandru Ceausu, John Tinsley, Andrew Way, Jian Zhang and Paraic Sheridan), Towards a User-Friendly Webservice Architecture for Statistical Machine Translation in the PANACEA project (with Antonio, Marc Poch, Pavel and Andy) and A Comparative Evaluation of Research vs. Online MT Systems (with Antonio, Federico Gaspari, Sudip and Andy)
  • Mar 7 - We've had a paper accepted for presentation at ACL-HLT 2011, to take place in Portland on June 19-24, 2011. The paper is entitled Consistent Translation using Discriminative Learning: A Translation Memory-inspired Approach. (with Yanjun, Yifan, Josef and Andy)
  • Jan 24 - We've had a paper accepted for presentation at the International Florida Artificial Intelligence Research Society Conference (FLAIRS-24), to take place on May 18-20, 2011 in Palm Beach, Florida. The paper is entitled Statistical Machine Translation with Terminology as Prior Knowledge (with Tsuyoshi and Andy).
  • Dec 9 - Dublin City University is supporting our proposal for collaboration with IIIT Hyderabad under the Indian Government Department of Science and Technology (DST) India-Ireland Cooperative Science Programme. The support provided by both DCU and DST will enable collaboration betweeen Irish and Indian researchers by covering travel and accomodation expenses over the following 24 months for visits from our group at DCU and from the group headed by Prof. Rajeev Sangal at IIIT Hyderabad.
  • Apr 11 - The Centre for Next Generation Localisation (CNGL) is recruiting a Research Project Leader with an established international track record and reputation in the area of Machine Translation to join the Centre’s Research Leadership team and lead what is probably one of the world’s largest academic Machine Translation research groups comprising over 50 researchers. The position is based within the School of Computing at DCU and the post-holder will contribute to the normal teaching and administrative functions of a senior member of academic staff within the School. Read the complete advertisement in our new announcements section.
  • Apr 11 - We've had nine papers accepted for presentation at the 15th Annual Conference of the European Association for Machine Translation (EAMT 2011), to take place in Leuven, Belgium on May 30-31, 2011. The papers are entitled Combining Semantic and Syntactic Generalization in Example-Based Machine Translation. (with Sarah Ebling, Andy, Martin Volk and Sudip), CCG Contextual labels in Hierarchical Phrase-Based SMT (with Hal, Jie and Andy), Towards Using Web-Crawled Data for Domain Adaptation in Statistical Machine Translation (with Pavel, Antonio, Andy, Prokopis Prokopidis and Vassilis Papavassiliou), Using Example-Based MT to Support Statistical MT when Translating Homogeneous Data in a Resource-Poor Setting (with Sandipan, Sara, Andy and Mikel), (with Ankit, Yanjun and Andy), Assessing Three Transcription Methods for Sign Language Machine Translation and Evaluation (with Sara), Experiments on Domain Adaptation for Patent Machine Translation in the PLuTO project (with Alexandru Ceausu, John Tinsley, Andrew Way, Jian Zhang and Paraic Sheridan), Towards a User-Friendly Webservice Architecture for Statistical Machine Translation in the PANACEA project (with Antonio, Marc Poch, Pavel and Andy) and A Comparative Evaluation of Research vs. Online MT Systems (with Antonio, Federico Gaspari, Sudip and Andy).
  • Mar 7 - We've had a paper accepted for presentation at ACL-HLT 2011, to take place in Portland on June 19-24, 2011. The paper is entitled Consistent Translation using Discriminative Learning: A Translation Memory-inspired Approach. (with Yanjun, Yifan, Josef and Andy).
  • Jan 24 - We've had a paper accepted for presentation at the International Florida Artificial Intelligence Research Society Conference (FLAIRS-24), to take place on May 18-20, 2011 in Palm Beach, Florida. The paper is entitled Statistical Machine Translation with Terminology as Prior Knowledge (with Tsuyoshi and Andy).
  • Dec 9 - Dublin City University is supporting our proposal for collaboration with IIIT Hyderabad under the Indian Government Department of Science and Technology (DST) India-Ireland Cooperative Science Programme. The support provided by both DCU and DST will enable collaboration betweeen Irish and Indian researchers by covering travel and accomodation expenses over the following 24 months for visits from our group at DCU and from the group headed by Prof. Rajeev Sangal at IIIT Hyderabad.
  • Nov 8 - We've had a paper accepted for presentation at the Machine Translation and Morphologically-rich Languages Research Workshop, to take place on 23-27 January, 2011 in Haifa, Israel. The paper is entitled MWE-sensitive Word Aligner in Factored Translation Model (with Tsuyoshi and Andy).
  • Oct 4 - We've had a paper accepted for presentation at the First Symposium on Patent Information Processing (SPIP), to take place on December 10, 2010 in Tokyo, Japan. The paper is entitled Statistical Machine Translation with Terminology (with Tsuyoshi and Andy).
  • Sep 29 - Two Postdoctoral research positions are available in machine translation at the Centre for Next Generation Localisation at Dublin City University, Ireland. The successful applicants will extend our active team of researchers. Post-doctoral positions are typically for 2 years, and funded in the range of Euro 38K-44K. We would like to fund these positions as soon as possible. Ideal candidates will have a strong background in statistical modelling, natural language processing and machine translation, as well as a track record of publications in the area. To apply, please send a CV and contact details of three referees by Friday 22nd October 2010 to Andy Way at away[AT]computing.dcu.ie in .pdf format, indicating also your preferred starting date, if successful. Please use this email address also for informal inquiries.
  • Sep 16 - We've had a paper accepted for presentation at the International Conference on Asian Language Processing 2010 (IALP 2010), to take place from December 28-30, 2010 in Harbin, China. The paper is entitled Hierarchical Pitman-Yor Language Model in Machine Translation (with Tsuyoshi and Andy). We've also had a poster accepted for presentation at the Workshop on Validation in Statistics and Machine Learning, to be held from October 6-7, 2010 in Berlin, Germany. The poster is entitled Statistical Significance Test in Machine Translation (with Tsuyoshi and Andy).
  • Aug 13 - We've had four papers accepted for presentation at the Fourth Workshop on Syntax and Structure in Statistical Translation (SSST-4), to take place on 28th August 2010, in Beijing, China, as part of COLING 2010. The papers are entitled Source-side Syntactic Reordering Patterns with Functional Words for Improved Phrase-based SMT (with Jie, Jinhua and Andy), HMM Word-to-Phrase Alignment with Dependency Constraints (with Yanjun and Andy), Deep Syntax Language Models and Statistical Machine Translation (with Yvette and Josef) and Seeding Statistical Machine Translation with Translation Memory Output through Tree-Based Structural Alignment (with Ventsislav).
  • Aug 13 - We've had a paper accepted for EMNLP 2010: Conference on Empirical Methods in Natural Language Processing, to take place from October 9-11, 2010 at MIT, Cambridge MA. The paper is entitled Facilitating Translation Using Source Language Paraphrase Lattices (with Jinhua, Jie and Andy).
  • Aug 03 - We have received €6,500 as part of the Faculty Research Capacity Building Fund from the Faculty of Computing and Engineering of DCU. The funded project, coordinated by Andy Way, Gareth Jones, Steve Gotz and Yanjun Ma, aims to develop MT and IR related mobile applications on devices such as iPad, iPhone and Nexus One. The Faculty Research Capacity Building Fund is being made available for the first time for the strategic development of research capacity within the Faculty of Computing and Engineering. Research Capacity Building (RCB) is, inter alia, about developing skills and confidence, supporting linkages and collaborative projects, developing appropriate dissemination, investing in infrastructure, ensuring the research is allied to School/Faculty/University strategy, and building elements of sustainability/continuity. It should concern the individual, the team(s), the institution and the wider national elements (economy, industry, community etc.).
  • Aug 03 - We've had eight papers accepted for publication at the Ninth Conference of the Association for Machine Translation in the Americas (AMTA 2010), which will be held on October 31-November 5, 2010 in Denver, Colorado. The papers are entitled Supertags as Source Language Context in Hierarchical Phrase-Based SMT (with Rejwanul, Sudip, Antal van den Bosch and Andy), Using TERp to Augment System Combination for SMT (with Jinhua and Andy), Improved Phrase-based SMT with Syntactic Reordering Patterns Learned from Lattice Scoring (with Jie, Jinhua and Andy), Combining Multi-Domain Statistical Machine Translation Models using Automatic Classification (with Pratyush, Jinhua, Sudip, Baoli Li, Andy and Josef), Improving the Post-Editing Experience Using Translation Recommendation: A User Study (with Yifan, Yanjun, Johann Roturier, Andy and Josef), Accuracy-Based Scoring for Phrase-Based Statistical Machine Translation (with Sergio, Yanjun, Daniel Galron and Andy), Maximising TM Performance through Sub-Tree Alignment and SMT (with Ventsislav) and f-align: An Open-Source Alignment Tool for LFG f-Structures (with Anton and Josef).
  • Jul 06 - We've had two papers accepted for publication. The first paper is to be presented at the Workshop on Applications of Pattern Analysis, and it is titled Gap Between Theory and Practice: Noise Sensitive Word Alignment in Machine Translation (with Tsuyoshi, Yvette and Andy). The second paper is to be presented at the Fourth International Workshop On Cross Lingual Information Access (collocated with COLING2010), and it is titled Multi-Word Expression Sensitive Word Alignment (with Tsuyoshi, Alfredo Maldonado-Guerra, Yvette and Andy).
  • Jun 30 - The CNGL has released a 'Twanslate' application which translates World Cup 2010 tweets from Twitter using our MaTrEx MT system for a range of European languages! This received some nice press in the Sunday Times, and in the Irish Times innovation supplement.
  • Jun 15 - We've had two papers accepted for COLING 2010, which will be held in Beijing from Aug 23-27, 2010. The papers are entitled A Discriminative Latent Variable-Based DE Classifier for Chinese--English SMT (with Jinhua and Andy) and SMT-TM Integration as Ranking (with Yifan, Yanjun, Andy and Josef).
  • Jun 15 - We've had a paper accepted for presentation at the 7th International Conference on Natural Language Processing (Icetal 2010), to take place on August 16-18, 2010, in Reykjavik, Iceland. The paper is entitled OpenMaTrEx: A free/open-source marker-driven example-based machine translation system (with Sandipan, Mikel, Declan, Sergio, John and Andy).
  • May 24 - We've just had a paper published in the Localisation Focus journal. The paper is entitled Integrated Language Technology as part of Next Generation Localisation (with Julie Carson-Berndsen, Harold, Carl Vogel and Andy).
  • May 24 - We've had three papers accepted for the Joint Fifth Workshop on Statistical Machine Translation and Metrics MATR, to be held at ACL 2010, which will be held in Uppsala, Sweden, on July 15-16, 2010. The first paper is titled MaTrEx: The DCU MT System for WMT 2010 (with Sergio, Rejwanul, Sandipan, Pratyush, Ankit, Jinhua, Pavel, Sudip, Mikel and Andy). The second paper is titled The DCU Dependency-based Metric in WMT-Metrics MATR 2010 (with Yifan, Jinhua, Andy and Josef), and the third paper is titled An Augmented Three-Pass System Combination Framework: DCU Combination System for WMT 2010 (with Jinhua, Pavel and Andy).
  • May 24 - We've had a paper accepted for ACL 2010, which will be held in Uppsala, Sweden from 11-16 July, 2010. The paper is entitled Bridging SMT and TM with Translation Recommendation (with Yifan, Yanjun, Josef and Andy)
  • May 24 - Andy is on the programme committee for the Student Session at the European Summer School in Logic, Language and Information, which will be held in Denmark from August 9-20, 2010. He is also on the programme committee for AMTA-2010, which will be held in Denver, Colorado, from October 31-November 5, 2010, and has agreed to act as co-Chair (with Patrick Pantel) for Tutorials at the 49th Annual Meeting of the Association for Computational Linguistics, to be held in Portland, Oregon, from June 19-24, 2011.
  • Apr 23 - Our Machine Translation group has released ‘OpenMaTrEx’, a free/open-source example-based machine translation (EBMT) system based on the marker hypothesis. The OpenMaTrEx EBMT system release comprises a marker-driven chunker, a collection of chunk aligners, and two engines: one based on the simple proof-of-concept monotone recombinator (released last January as 'Marclator') and a Moses-based decoder. This free/open-source release results from collaboration with Prof. Mikel L. Forcada of Universitat d’Alacant in Spain who is currently a visiting researcher within the CNGL MT group at DCU through an ETS Walton Award from Science Foundation Ireland (SFI).
  • Mar 31 - We've just had four papers accepted for presentation at the 14th Annual Conference of the EAMT (EAMT 2010), which will be held on May 27–-28 2010 in Saint-Raphaël, France. The papers are entitled Statistical Analysis of Alignment Characteristics for Phrase-based Machine Translation (with Patrik Lambert, Simon Petitrenaud, Yanjun and Andy), Lattice Score-Based Data Cleaning For Phrase-Based Statistical Machine Translation (with Jie, Andy amd Julie Berndsen), TMX Markup: A Challenge When Adapting SMT to the Localisation Environment (with Jinhua, Johann Roturier and Andy), and The Impact of Source-Side Reordering on Hierarchical Phrase-Based SMT (with Jinhua and Andy).
  • Dec 11 - We congratulate John on successfully defending his PhD. Thanks are due to the external examiner Prof. Jan Hajic (Charles Univ., Prague), to the internal examiner, Prof. Josef Van Genabith, and to Dr. Yanjun Ma for chairing the viva. Thanks are also due to Dr. Mary Hearne, Dr. Khalil Sima'an and to Dr. Ventzi Zhechev for their input into the ATTEMPT project on which John worked. John will be staying with us for some time yet in CNGL as a postdoctoral researcher.
  • Dec 8 - After 2 years of being looked after by John, the European Association for Machine Translation (EAMT) website is now administered by Sergio.
  • Dec 4 - Following the 3rd Workshop on Example-Based Machine Translation, which was held on November 12–-13 2009 at Dublin City University, a summary of the workshop was made available
  • Nov 17 - We've just had two journal papers accepted for publication. The first paper is entitled Metric and Reference Factors in Minimum Error Rate Training and will appear in Machine Translation (with Yifan and Andy). The second paper is entitled An Incremental Three-pass System Combination Framework by Combining Multiple Hypothesis Alignment Methods and will appear in the International Journal of Asian Language Processing (with Jinhua and Andy)
  • Nov 17 - Andy has been invited to the third Google Faculty Summit, to take place in Zurich 8-10 February 2010. The Natural Language Technologies stream of the 2010 Europe, Middle East and Africa (EMEA) Faculty Summit will address topics at the intersection of research in Natural Language Understanding and applied techniques for scalable Natural Language Processing.
  • Nov 17 - Andy is on the programme committee for NAACL-HLT 2010, which will be held in Los Angeles from June 1-June 6th, 2010. He is also on the programme committee for COLING 2010, which will be held in Beijing from Aug 23-27, 2010, and for ACL 2010, which will be held in Uppsala, Sweden from 11-16 July, 2010.
  • Oct 28 - We've just had three papers accepted for presentation at the 3rd Workshop on Example-Based Machine Translation, which will be held on November 12–-13 2009 at Dublin City University. The papers are entitled Hybrid Rule-Based -- Example-Based MT: Feeding Apertium with Sub-sentential Translation Units (with Felipe Sánchez-Martínez, Mikel, and Andy), Evaluating Syntax-Driven Approaches to Phrase Extraction for MT (with Ankit, Sergio, Declan and John), and A review of EBMT using proportional analogies (with Harold, Sandipan and Sudip)
  • Sep 30 - We've just had three papers accepted for presentation at the 23rd Pacific Asia Conference on Language, Information and Computation (PACLIC 23), which will be held on September 3--5 2009 at the City University of Hong Kong. The papers are entitled Experiments on Domain Adaptation for English--Hindi SMT (with Rejwanul, Sudip, Josef and Andy), Capturing lexical variation in MT evaluation using automatically built sense-cluster inventories (with Marianna, Yifan and Andy), and Dependency Relations as Source Context in Phrase-Based SMT (with Rejwanul, Sudip, Antal van den Bosch and Andy)
  • Sep 23 - Following the IAMT business meeting at MT Summit XII in Ottawa, Canada, a new IAMT Committee has been confirmed for the 2009-2011 period, including Andy as President-Elect
  • Sep 1 - We've just had a paper accepted for presentation at the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD), which will be held on September 7--11 2009 in Bled, Slovenia. The paper is entitled Noise Reduction Experiments in Machine Translation (with Tsuyoshi, Sudip and Andy)
  • Aug 11 - We've just had a paper accepted for presentation at the International Conference on Asian Language Processing (IALP 2009), which will be held on Dececember 7--9 2009 in Singapore. The paper is entitled A Three-pass System Combination Framework by Combining Multiple Hypothesis Alignment Methods (with Jinhua and Andy)
  • Jul 28 - Mikel and Andy are co-organising the 3rd Workshop on Example-Based Machine Translation with the theme Going open-source to revive EBMT at DCU November 12-13, 2009.
  • Jul 8 - Mikel L. Forcada, a visiting professor on sabbatical leave from Universitat d'Alacant, Spain, will be based at CNGL, DCU from now until June 2010. Mikel is working with Prof. Andy Way on integration of an existing rule-based open-source machine translation platform with efficient corpus-based machine translation modules and tools. Mikel got an ETS Walton Award to fund this work.
  • Jun 18 - We've just had two papers accepted for publication in Machine Translation. The first paper is entitled Bidirectional Data-Driven Machine Translation for Irish and German Sign Languages (with Sara and Andy). The second paper is entitled Parallel Treebanks and their Exploitability in Machine Translation (with John and Andy)
  • Jun 18 - We've just had a paper accepted for presentation at the Interspeech 2009 conference, which will be held on September 6--10, 2009, in Brighton, UK. The paper is entitled Using Same-Language Machine Translation to Create Alternative Target Sequences for Text-To-Speech Synthesis (with Peter Cahill, Julie Berndsen, Jinhua and Andy)
  • Jun 18 - We've just had four papers accepted for presentation at MT Summit 2009, the Twelfth Machine Translation Summit, which will be held on August 26--30 at the Château Laurier, Ottawa, Canada. The papers are entitled Source-Side Context-Informed Hypothesis Alignment for Combining Outputs from Machine Translation Systems (with Jinhua and Andy), Tracking Relevant Alignment Characteristics for Machine Translation (with Patrik, Yanjun, Sylwia and Andy), Improving the Objective Function in Minimum Error Rate Training (with Yifan and Andy) and Using Percolated Dependencies for Phrase Extraction in SMT (with Ankit and Andy)
  • Jun 4 - We've just had a paper accepted for presentation at the RANLP 2009 conference, which will be held on September 14--16, 2009, in Borovets, Bulgaria. It is entitled Lexicalized Semi-Incremental Dependency Parsing (with Hany, Khalil Sima'an and Andy)
  • Jun 2 - We've just had two papers accepted for presentation at the EMNLP 2009 conference, which will be held on August 6--7 at the Suntec Singapore International Convention & Exhibition Centre, immediately following ACL/IJCNLP 2009. The papers are entitled A Syntactified Direct Translation Model with Linear-time Decoding (with Hany, Khalil Sima'an and Andy) and Accuracy-Based Scoring for DOT: Towards Direct Error Minimization for Data-Oriented Translation (with Daniel Galron, Sergio, Andy and I. Dan Melamed)
  • Jun 2 - We've just had a paper accepted for presentation at the NEWS 2009 workshop to take place on 7th August, 2009 at ACL/IJCNLP 2009. It is entitled English--Hindi Transliteration Using Context-Informed PB-SMT: the DCU System (with Rejwanul, Ankit, Sudip and Andy)
  • May 27 - We've just had a paper accepted for presentation at the TALN'09 conference to be held in Senlis, France on June 24--26 2009. It is entitled Données bilingues pour la TAS français-anglais : impact de la langue source et direction de traduction originales sur la qualité de la traduction (with Sylwia)
  • Apr 27 - We've just had a paper accepted for presentation at the ACL-IJCNLP 2009 Student Research Workshop to be held in Singapore on August 2--7 2009. It is entitled Data Cleaning for Word Alignment (with Tsuyoshi)
  • Apr 27 - We've just had a paper accepted for presentation at the TALN'09 conference to be held in Senlis, France on June 24--26 2009. It is entitled La place de la désambiguïsation lexicale dans la Traduction Automatique Statistique (with Marianna)
  • Apr 27 - We've just had a paper accepted for presentation at the Sign Language Corpora: Linguistics Issues workshop to be held in London on July 24--25 2009. It is entitled An Assessment of Appropriate Sign Language Representation for Machine Translation in the Healthcare Domain (with Sara)
  • Apr 12 - We've just had a paper accepted for presentation at the 14th International LFG Conference (LFG09) to be held in Cambridge, UK on July 13--16 2009. It is entitled F-Structure Transfer-Based Statistical Machine Translation (with Yvette, Anton Bryl and Josef)
  • Apr 12 - Andy is on the programme committee for EMNLP, which will be held on August 6--7 at the Suntec Singapore International Convention & Exhibition Centre
  • Apr 12 - Andy is on the programme committee for MT Summit XII, which will take place from August 25--29 2009 at the Chateau Laurier in Ottawa, Ontario, Canada.
  • Apr 12 - Andy is giving a keynote at the 2nd International Conference on Arabic Language Resources and Tools run by the MEDAR project, to take place on 22-23 April 2009 at the Hotel Grand Hyatt Cairo, Egypt.
  • Apr 6 - We've just had 5 papers accepted for publication at EAMT-2009, to be held in in Barcelona, Spain from May 14--15, 2009. They are entitled Using Supertags as Source Language Context in SMT (with Rejwanul, Sudip, Yanjun and Andy), Tuning Syntactically Enhanced Word Alignment for Statistical Machine Translation (with Yanjun, Patrik and Andy), Learning Labelled Dependencies in Machine Translation Evaluation (with Yifan and Andy), Marker-based Filtering of Bilingual Phrase Pairs for SMT (with Felipe Sanchez-Martinez and Andy) and Optimal Bilingual Data for French-English PB-SMT (with Sylwia and Andy)
  • Feb 25 - We've just had a paper accepted for presentation at the eLexicography in the 21st century (eLEX2009) conference, to be held in Louvain-la-Neuve (Belgium) on October 22--24 2009. It is entitled Discovering word sense relations from corpora using translation and distributional information (with Marianna)
  • Jan 15 - DCU will be hosting the 12th annual CLUK(I) colloquium on April 23--24 this year. The first call for papers can be found here
  • Jan 12 - Congratulations to Hany who successfully defended his PhD thesis, entitled Lexical Syntax for Statistical Machine Translation
  • Jan 12 - We've just had a paper accepted for presentation at the Fourth Workshop on Statistical Machine Translation, EACL 2009, to take place in Athens, Greece, on March 30 and 31, 2009. The paper is entitled MaTrEx: the DCU MT System for WMT 2009 (with Jinhua, Yifan, Sergio and Andy).
  • Jan 9 - We've just had and article accepted for publication in the Special Issue on Machine Translation of Asian Languages by ACM TALIP. It's entitled Bilingually Motivated Word Segmentation for Statistical Machine Translation (with Yanjun and Andy)
  • Dec 17 - We've just had 2 papers accepted for publication at EACL-2009, to be held in Athens, Greece in from March 30--April 3, 2009. They are entitled Bilingually Motivated Domain-Adapted Word Segmentation for Statistical Machine Translation (with Yanjun and Andy)and Data-Driven Semantic Analysis for Multilingual WSD and Lexical Selection in Translation (with Marianna)
  • Dec 15 - We've just had a paper accepted for publication at CICLing-2009, to be held in Mexico City in from March 1--9, 2009, entitled Parallel Treebanks in Phrase-Based Statistical Machine Translation (with John, Mary and Andy)
  • Dec 10 - Marianna is on the programme committee for the RECITAL 2009 in Senlis, France from June 24--26, 2009.
  • Dec 10 - We've just submitted our system description paper for the NLP Tools Contest: Statistical Machine Translation (English to Hindi), part of the 6th International Conference on Natural Language Processing in Pune, India, to be held on 20-22 December 2008.
  • Nov 10 - Andy is on the programme committee for the 12th EACL conference in Athens, Greece from March 30--April 3, 2008.
  • Nov 6 - Andy is attending the JHU 2009 Summer Workshop Planning Meeting from Nov 7--9 to defend a joint proposal with Khalil Sima'an.
  • Sep 30 - Congratulations to Patrik and his partner who just had a little baby girl on September 19th!
  • Sep 26 - Sara received the runner-up prize in the Localisation Research Centre's 2008 Best Thesis Award. Congratulations to her!
  • Sep 25 - We've just had a paper accepted for publication at the Second IEEE Spoken Language Technology (SLT) workshop, to be held in Goa, India in December 2008, entitled A Syntactic Language Model based on Incremental CCG Parsing (with Hany, Andy and Khalil Sima'an)
  • Aug 22 - We've just had paper accepted for publication at IWSLT 2008 in Honolulu, Hawaii entitled Exploiting Alignment Techniques in MaTrEx: The DCU MT System for IWSLT 2008 (with Yanjun, John, Hany, Jinhua and Andy).
  • Aug 1 - We've just had 1 paper and 1 poster accepted for publication at EAMT 2008 in Hamburg. The paper is entitled "Packed Rules for Probabilistic Transfer-Based Machine Translation" (with Yvette and Josef) and the poster is entitled "Word Association Models and Search Strategies for Discriminative Word Alignment" (with Patrik and Rafael Banchs).
  • Jul 30 - We are currently recruiting one PhD student in Machine Translation. The main focus of the work will concern tuning our range of MT systems to the different genres and text types seen in our industrial partners' documentation. See here for more information.
  • Jul 28 - We are currently recruiting one Post-doctoral researcher in Machine Translation. Research will focus on incorporating syntax into mainstream models of Statistical Machine Translation, as well as research and development of Tree-to-Tree models. See here for more information.
  • Jul 28 - Andy is on the organising committee (as one of the publicity chairs) for ACL-IJCNLP to take place in Singapore 2-7 August 2009.
  • Jul 1 - Andy and John will serve on the programme committees of AMTA and the AMTA Student Research Workshop repectively, to take place in Waikiki, HI from October 21-25.
  • Jun 25 - Sara has been chosen to present her PhD work at the Young Researcher's Consortium at ICCHP (11th International Conference on Computers Helping People with Special Needs) in Linz, Austria from July 7-11.
  • Jun 15 - We've just had a paper accepted for publication at CoLing 2008 in Manchester. The paper is entitled Automatic Generation of Parallel Treebanks (with Ventsi and Andy).
  • May 07 - Congratulations to Ventsi and his wife who just had a little baby boy on April 23!
  • May 02 - We are pleased to announce that Prof. Mikel Forcada from the Universitat d'Alacant will be spending a year with our MT group from June 2009 given his successful application for an SFI Walton Scholarship
  • May 02 - Congratulations to Sara who successfully defended her PhD thesis, entitled Data-Driven Machine Translation for Sign Languages! See here for more info
  • Apr 29 - Congratulations to Patrik who successfully defended his PhD thesis, entitled Exploiting Lexical Information and Discriminative Alignment Training in Statistical Machine Translation, at the UPC in Barcelona.
  • Apr 25 - Congratulations to Karolina who successfully defended her PhD thesis entitled A Novel Dependency-Based Evaluation Metric for Machine Translation! See here for more details.
  • May 07 - We've just had a paper accepted for TALN-08 entitled Comparing Constituency and Dependency Representations for SMT Prase-Extraction (with Mary, Sylwia and John)
  • May 02 - We've just had two papers accepted for LREC-08. The papers are entitled The ATIS Sign Language Corpus (with Sara, Andy, Jan Bungeroth, Daniel Stein, Hermann Ney and Lynette van Zijl) and Cross-corpus Evaluation of Word Alignment (with Sylwia Ozdowska).
  • May 02 - We've just had a paper accepted for publication in Machine Translation. The paper is entitled Evaluating Machine Translation with LFG Dependencies (with Karolina, Andy and Josef)
  • Apr 29 - Andy is on the Programme Committee for IWSLT 2008, to take place in Hawaii on October 20--21. IWSLT is co-located with AMTA 2008 this year.
  • Mar 4 - Andy is a keynote speaker at the 2nd Symposium on Innovations in Machine Translation Technologies taking place in Tokyo between March 19--21. This is hosted by the National Institute of Information and Communications Technology (NICT).
  • Mar 4 - Andy is on the programme committee for the SSST workshop at ACL-08, to take place in June in Columbus, OH.
  • Jan 21 - John was awarded a Microsoft Postgraduate Research Scholarship for word entitled Exploiting Parallel Treebanks to Improve Data-Driven Machine Translation. This story is featured in an article on the main DCU page here (and a clearer version of the picture here)
  • Jan 21 - Andy will be reviewing the EuroMatrix project in Prague Feb 04-05.
  • Jan 20 - Andy is one of the keynote speakers at the Mixing Approaches to Machine Translation workshop in Donostia in the Basque Country on Feb 14.
  • Jan 18 - Andy will be giving a talk on EBMT, and conducting some reviewing at the Second Symposium on Innovations in Machine Translation Technologies put on by the National Institute of Information and Communications Technology (NICT) in Tokyo on March 19-21.
  • Dec 9 - John attended TLT'07 in Bergen, Norway and presented a paper entitled Exploiting Parallel Treebanks to Improve Statistical Machine Translation
  • Oct 25 - It's Official: our MaTrEx system is the best MT system (well, for one language pair, on one data set, according to one evaluation ...) At the recent IWSLT-07 workshop in Trento, Italy, it was announced officially that the output from our Arabic-->English system, built by Hany Hassan, was preferred more than any other system according to human evaluators. This is on top of good performance as indicated by the BLEU score. For those interested, consult the presentation of our system's results as prepared by Andy Way, or the full-length paper.
  • Oct 24 - Sara attended the 4th IBM CAS Software & Systems Engineering Symposium and presented a poster entitled Combining Data-Driven Systems for Improved Sign Language Translation which received a special mention and the 'Best Poster Awards Ceremony'
  • Oct 8 - We have a paper accepted for TLT '07 in Bergen, Norway entitled Exploiting Parallel Treebanks for use in Statistical Machine Translation (with John, Mary, and Andy)
  • Sep 19 - Publications section updated with pdfs of all recent papers, including theses
  • Sep 17 - New article in our Events section detailing our participation at the recent conferences: TMI and MT Summit
  • Aug 31 - Congratulations to Bart who successfully defended his PhD thesis. His extern was Dr. Paul Bennett of the University of Manchester, Mary was internal examiner and Ray Walshe was Chair. See here for more info
  • July 9 - Sara is featured in an article on the main DCU page, in a story regarding the Google Anita Borg Scholarship. You can read it here
  • June 29 - We have five papers accepted for TMI 2007 in Skövde, Sweden in September 2007. The papers are entitled Exploiting Source Similarity for SMT using Context-Informed Features (with Nicolas Stroppa, Andy Way, and Antal van den Bosch), Capturing Translational Divergences with a Statistical Tree-to-Tree Aligner (with Mary Hearne, John Tinsley, Ventsislav Zhechev, and Andy Way), Hand in Hand: Automatic Sign Language to Speech Translation (with Andy Way, Daniel Stein, Philippe Dreuw, Hermann Ney, and Sara Morrissey), A Cluster-Based Representation for Multi-System MT Evaluation (with Nicolas Stroppa and Karolina Owczarzak), and Alignment-Guided Chunking (with Yanjun Ma, Nicolas Stroppa, and Andy Way).
  • June 8 - We had a paper accepted for the Conference and Workshop on Assistive Technology for People with Vision and Hearing Impairments entitled Joining Hands: Developing a Sign Language Machine Translation System with and for the Deaf Community (with Sara Morrissey and Andy Way)
  • May 31 - We have three papers accepted for the MT Summit XI in Copenhagen in September 2007. The papers are entitled Comparing Rule-Based and Data-Driven Approaches to Spanish-to-Basque Machine Translation (with Gorka Labaka and Kepa Sarasola from the University of the Basque Country, and Nicolas Stroppa), Robust Language Pair-Independent Sub-Tree Alignment (with Mary Hearne, John Tinsley and Ventsislav Zhechev), and Towards a Hybrid Data-Driven MT System for Sign Language Translation (with Sara Morrissey, and Daniel Stein, Jan Bungeroth, and Hermann Ney from RWTH Aachen).
  • May 28 - We have a post-doctoral research position available from August 1st. Please view the job advertisement and contact Prof. Andy Way if you're interested!
  • May 17 - Congratulations to Yanjun Ma who has been successfully proposed as an EACL Student Board representative.
  • May 9 - We have a paper accepted for the 2nd Workshop on SMT at ACL 2007, to take place in Prague, in June. The paper is entitled Labelled Dependencies in Machine Translation Evaluation, and is joint work with Karolina Owczarzak, Josef Van Genabith and Andy Way.
  • April 11 - The 2nd call for papers for TMI 07, chaired by Andy, has been annouced, along with a new website.
  • April 3 - Sara has been selected as a finalist for the Google Europe Anita Borg Memorial Scholarship for Women in Computing
  • Mar 26 - Andy is on the programme committee for the MT track in EMNLP, which directly follows ACL 2007.
  • Mar 23 - We have two papers accepted for ACL 2007, to take place in Prague, in June. The first is entitled Integrating Supertags into Phrase-based Statistical Machine Translation (with Hany Hassan and Khalil Sima'an) and the second is called Bootstrapping Word Alignment Via Word Packing with Yanjun Ma and Nicolas Stroppa.
  • Mar 1 - We have a paper accepted for the Syntax and Structure in Statistical Translation (SSST) workshop at NAACL-HLT 2007 entitled Dependency-Based Automatic Evaluation for Machine Translation (with Karolina Owczarzak and Josef Van Genabith).
  • Feb 19 - 23: Daniel Stein, a PhD student in RWTH Aachen, visited to partake in collaborative research in the area of Sign Language MT
  • Jan 12 - Nicolas gave a talk entitled Memory-Based Machine Translation at CLIN in the Netherlands
  • Jan 11 - Nicolas attended the METIS-II workshop in Leuven, Belgium
  • Dec 18 - Congratulations to Declan who successfully defended his PhD thesis. His extern was Prof. Philippe Langlais of theUniversity of Montreal, Josef was internal examiner and Mary was Chair.
  • Andy is programme chair for TMI-07, to take place in Skövde, Sweden from September 7 - 9th
  • Andy will serve on the programme committee for HLT-NAACL 2007
  • November 27-28 : Nicolas presented the DCU EBMT system at IWSLT in Kyoto.
  • Mary gave an invited talk at the first International Symposium on Parallel Treebanks in Stockholm.
  • Hany, Khalil, Mary and Andy have had a paper accepted for the IEEE/ACL 2006 Workshop on Spoken Language Translation
  • Andy has just received a grant from the DCU OVPR under their International Visitors Programme to have Prof. Hermann Ney of RWTH Aachen visit us, probably in November 2006 and April 2007.
  • September 6-7: Nicolas presented the DCU EBMT system at the NIST MT Workshop in Washington.
  • August 8-12: Bart, Karolina and Andy present papers at AMTA in Boston.
  • July 10-13 : Ríona and Mary presented a joint paper at LFG-06 in Konstanz, Germany.
  • June 19-20: Bart, Declan and Mary present papers at EAMT in Oslo.
  • We took over as webmasters for the LFG-DOP site.
  • Andy is programme chair for the MT track at ACL-07 in Prague from June 24 - 29th 2007
  • Antal, Nicolas, and Andy have had a paper accepted for the METIS-II Workshop on New Approaches to Machine Translation in Leuven.
Last update: March 22 2012
Related Sites: NCLT | School of Computing | School of Applied Languages and Intercultural Studies | Dublin City University