BBC Russian

The task of Statistical Machine Translation depends on large amounts of training corpora. Despite the availability of several parallel corpora, these are typically composed of declarative sentences, which may not be appropriate when the... more

The task of Statistical Machine Translation depends on large amounts of training corpora. Despite the availability of several parallel corpora, these are typically composed of declarative sentences, which may not be appropriate when the goal is to translate other types of sentences, e.g., interrogatives. There have been efforts to create corpora of questions, specially in the context of the evaluation of Question-Answering systems. One of those corpora is the UIUC dataset, composed of nearly 6,000 questions, widely used in the task of Question Classification. In this work, we make available the Portuguese version of the UIUC dataset, which we manually translated, as well as the translation guidelines. We show the impact of this corpus in the performance of a state-of-the-art SMT system when translating questions. Finally, we present a taxonomy of translation errors, according to which we analyze the output of the automatic translation before and after using the corpus as training data.

Luisa Coheur

Publisher: LREC

Publication Date: 2012

Publication Date: 2018

Research Interests: Computer Science, Natural Language Processing, Machine Translation, and Collocations<div>()</div>

Publisher: Cornell University

Publication Date: May 19, 2023

Publication Name: arXiv (Cornell University)

Research Interests: Computer Science, Artificial Intelligence, and Machine Translation<div>()</div>

Publisher: Springer Science+Business Media

Publication Date: May 1, 2020

Publication Name: Language Resources and Evaluation

Publication Date: Aug 31, 2015

Research Interests: Computer Science and ACL<div>()</div>

Publication Date: 2019

Publisher: Association for Computational Linguistics

Publication Date: Jan 13, 2023

Publication Name: Computational Linguistics

Publication Date: Nov 14, 2022

Research Interests: Computer Science<div>()</div>

Publisher: Springer Science+Business Media

Publication Date: May 1, 2020

Publication Name: Language Resources and Evaluation

Research Interests: Computer Science, Sign Language, Animation, XML, Flexibility in engineering design, and Notation<div>()</div>

Publisher: Springer Science+Business Media

Publication Date: May 1, 2012

Publication Name: Language Resources and Evaluation

Publication Date: 2017

Research Interests: Computer Science, Artificial Intelligence, Natural Language Processing, Semantic similarity, Wordnet, and Similarity Geometry<div>()</div>

Publisher: Cornell University

Publication Date: Apr 26, 2023

Publication Name: arXiv (Cornell University)

Research Interests: Computer Science, Artificial Intelligence, and Sentence<div>()</div>

Publisher: Springer Science+Business Media

Publication Date: 2018

Publication Name: Lecture Notes in Computer Science

Research Interests: Computer Science, Artificial Intelligence, Cluster Analysis, and Graph<div>()</div>

Publisher: Cornell University

Publication Date: Jul 12, 2023

Publication Name: arXiv (Cornell University)

Publisher: Cornell University

Publication Date: Apr 13, 2023

Publication Name: arXiv (Cornell University)

Research Interests: Computer Science and Artificial Intelligence<div>()</div>

Publication Date: 2016

Research Interests: Computer Science, Natural Language Processing, Machine Translation, and Collocations<div>()</div>

Publisher: Springer Nature

Publication Date: 2016

Publication Name: Springer eBooks

Publication Date: 2015

Research Interests: Computer Science, Human Computer Interaction, Wizard of Oz, and Virtual agent<div>()</div>

Publisher: Springer Nature

Publication Date: 2022

Publication Name: Springer eBooks

Publisher: Springer Science+Business Media

Publication Date: 2020

Publication Name: Lecture Notes in Computer Science

Research Interests: Computer Science, Artificial Intelligence, Natural Language Processing, Portuguese, Textual Entailment, and Similarity Geometry<div>()</div>

Publisher: Springer Science+Business Media

Publication Date: 2018

Publication Name: Lecture Notes in Computer Science

Research Interests: Computer Science and Sentence<div>()</div>

Publisher: Springer Science+Business Media

Publication Date: 2017

Publication Name: Lecture Notes in Computer Science

Research Interests: Computer Science and Human Computer Interaction<div>()</div>

Publisher: Springer Science+Business Media

Publication Date: 2020

Publication Name: Communications in computer and information science

Research Interests: Computer Science and Logical Consequence<div>()</div>

Publisher: Cornell University

Publication Date: Jul 28, 2023

Publication Name: arXiv (Cornell University)

Research Interests: Computer Science<div>()</div>

Publisher: Association for Computational Linguistics

Publication Name: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

Research Interests: Computer Science, Artificial Intelligence, and Machine Translation<div>()</div>

Publisher: ISCA

Publication Name: IberSPEECH 2022

Research Interests: Computer Science<div>()</div>

Research Interests:
Computer Science, Natural Language Processing, Machine Translation, and Collocations

Research Interests:
Computer Science, Artificial Intelligence, and Machine Translation

Research Interests:
Computer Science and ACL

Research Interests:
Computer Science

Research Interests:
Computer Science, Sign Language, Animation, XML, Flexibility in engineering design, and Notation

Research Interests:
Computer Science, Artificial Intelligence, Natural Language Processing, Semantic similarity, Wordnet, and Similarity Geometry

Research Interests:
Computer Science, Artificial Intelligence, and Sentence

Research Interests:
Computer Science, Artificial Intelligence, Cluster Analysis, and Graph

Research Interests:
Computer Science and Artificial Intelligence

Research Interests:
Computer Science, Natural Language Processing, Machine Translation, and Collocations

Research Interests:
Computer Science, Human Computer Interaction, Wizard of Oz, and Virtual agent

Research Interests:
Computer Science, Artificial Intelligence, Natural Language Processing, Portuguese, Textual Entailment, and Similarity Geometry

Research Interests:
Computer Science and Sentence

Research Interests:
Computer Science and Human Computer Interaction

Research Interests:
Computer Science and Logical Consequence

Research Interests:
Computer Science

Research Interests:
Computer Science, Artificial Intelligence, and Machine Translation

Research Interests:
Computer Science

Research Interests:
Computer Science, Artificial Intelligence, Natural Language Processing, Estimator, Sentence, and Matching statistics

Research Interests:
Computer Science

Research Interests:
Computer Science, Artificial Intelligence, Persona, and Generative grammar

Research Interests:
Computer Science, Artificial Intelligence, and weighting

Research Interests:
Computer Science, Information Retrieval, Questions and Answers, and Question Answering

Research Interests:
Computer Science

Research Interests:
Information Systems, Psychology, Cognitive Science, Computer Science, and Computers In Human Behavior

Research Interests:
Computer Science, Architecture, Portuguese, Animation, and Avatar

Research Interests:
Computer Science, Artificial Intelligence, Natural Language Processing, Cluster Analysis, and FrameNet

Research Interests:
Computer Science, Heuristics, Algorithm, and Phrase

Research Interests:
Philosophy

Research Interests:
Computer Science, Wordnet, and Abstraction Layer