1. Articles in category: Machine Translation

    7465-7488 of 7717 « 1 2 ... 309 310 311 312 313 314 315 ... 320 321 322 »
    1. Machine Learning Methods in Natural Language Processing

      Machine Learning Methods in Natural Language Processing Michael Collins MIT CSAIL Some NLP Problems Information extraction – Named entities – Relationships between entities Finding linguistic structure – Part-of-speech tagging – Parsing Machine translation Common Themes Need to learn mapping from one discrete structure to another – Strings to hidden state sequences Named-entity extraction, part-of-speech tagging – Strings to strings Machine translation – Strings to underlying trees Pa
      Read Full Article
    2. Slides from a talk given at Conll 2006

      An SVM Approach for Natural Language Learning Michael Collins MIT EECS/CSAIL Joint work with Peter Bartlett, David McAllester, Ben Taskar Supervised Learning in NLP • Goal is to learn a function F : X → Y , where X is a set of possible inputs, Y is a set of possible outputs. • We have a training sample (x1 , y1 ), (x2 , y2 ), . . . , (xn , yn ) where each (xi , yi ) ∈ X × Y E.g., each xi is a sentence, each yi is a gold-standard parse Global Linear Models • Three components: GEN is a func
      Read Full Article
    3. Hidden-Variable Models for Discriminative Reranking.

      Hidden–Variable Models for Discriminative Reranking Terry Koo MIT CSAIL maestro@mit.edu Michael Collins MIT CSAIL mcollins@csail.mit.edu Abstract We describe a new method for the representation of NLP structures within reranking approaches. We make use of a conditional log–linear model, with hidden variables representing the assignment of lexical items to word clusters or word senses. The model learns to automatically make these assignments based on a discriminative training criterion. Trainin
      Read Full Article
    4. Clause Restructuring for Statistical Machine Translation.

      Clause Restructuring for Statistical Machine Translation Michael Collins MIT CSAIL mcollins@csail.mit.edu Philipp Koehn School of Informatics University of Edinburgh pkoehn@inf.ed.ac.uk Ivona Kuˇ erov´ c a MIT Linguistics Department kucerova@mit.edu Abstract We describe a method for incorporating syntactic information in statistical machine translation systems. The first step of the method is to parse the source language string that is being translated. The second step is to apply a series of
      Read Full Article
    5. Discriminative Syntactic Language Modeling for Speech Recognition.

      Discriminative Syntactic Language Modeling for Speech Recognition Michael Collins MIT CSAIL mcollins@csail.mit.edu Brian Roark OGI/OHSU roark@cslu.ogi.edu Murat Saraclar Bogazici University murat.saraclar@boun.edu.tr Abstract We describe a method for discriminative training of a language model that makes use of syntactic features. We follow a reranking approach, where a baseline recogniser is used to produce 1000-best output for each acoustic input, and a second “reranking” model is then used
      Read Full Article
    6. Discriminative Reranking for Natural Language Parsing.

      Discriminative Reranking for Natural Language Parsing Michael Collins and Terry Koo Massachusetts Institute of Technology This paper considers approaches which rerank the output of an existing probabilistic parser. The base parser produces a set of candidate parses for each input sentence, with associated probabilities that de ne an initial ranking of these parses. A second model then attempts to improve upon this initial ranking, using additional features of the tree as evidence. The strength
      Read Full Article
    7. A Discriminative Model for Tree-to-Tree Translation.

      A Discriminative Model for Tree-to-Tree Translation Brooke Cowan MIT CSAIL brooke@csail.mit.edu Ivona Ku˘ erov´ c a MIT Linguistics Department kucerova@mit.edu Michael Collins MIT CSAIL mcollins@csail.mit.edu Abstract This paper proposes a statistical, treeto-tree model for producing translations. Two main contributions are as follows: (1) a method for the extraction of syntactic structures with alignment information from a parallel corpus of translations, and (2) use of a discriminative, fea
      Read Full Article
    8. Chinese Syntactic Reordering for Statistical Machine Translation.

      Chinese Syntactic Reordering for Statistical Machine Translation Philipp Koehn Michael Collins Chao Wang MIT CSAIL School of Informatics MIT CSAIL 32 Vassar Street, Room 362 32 Vassar Street, Room G-484 2 Buccleuch Place, 5BP 2L2 Edinburgh, EH8 9LW, UK Cambridge, MA 02139, USA Cambridge, MA 02139, USA wangc@csail.mit.edu mcollins@csail.mit.edu pkoehn@inf.ed.ac.uk Abstract Syntactic reordering approaches are an effective method for handling word-order differences between source and target langua
      Read Full Article
    9. Head-Driven Statistical Models for Natural Language Parsing

      HEAD-DRIVEN STATISTICAL MODELS FOR NATURAL LANGUAGE PARSING Michael Collins A DISSERTATION in Computer and Information Science Presented to the Faculties of the University of Pennsylvania in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy 1999 Professor Mitch Marcus Supervisor of Dissertation Professor Jean Gallier Graduate Group Chair COPYRIGHT Michael Collins 1999 Acknowledgements Mitch Marcus was a wonderful advisor. He gave consistently good advice, and all
      Read Full Article
    10. Machine translation of online product support articles using a data-driven MT system.

      Machine translation of online product support articles using a data-driven MT system Stephen D. Richardson September 2004 Available Documents: Word 187KB At AMTA 2002, we reported on a pilot project to machine translate Microsoft's Product Support Knowledge Base into Spanish. The successful pilot has since resulted in the permanent deployment of both Spanish and Japanese versions of the knowledge base, as well as ongoing pilot projects for French and German. The translated articles in each case
      Read Full Article
    11. Towards a Simple and Accurate Statistical Approach to Learning Translation Relationships among Words.

      Towards a Simple and Accurate Statistical Approach to Learning Translation Relationships among Words Robert C. Moore July 2001 Available Documents: PDF 57 KB We report on a project to derive word translation relationships automatically from parallel corpora. Our effort is distinguished by the use of simpler, faster models than those used in previous high-accuracy approaches. Our methods achieve accuracy on single-word translations that seems comparable to any work previously reported, up to near
      Read Full Article
    12. Statistical Machine Translation Using Labeled Semantic Dependency Graphs.

      Statistical Machine Translation Using Labeled Semantic Dependency Graphs Anthony Aue; Arul Menezes; Robert Moore; Chris Quirk; Eric Ringger October 2004 Available Documents: PDF 120K We present a series of models for doing statistical machine translation based on labeled semantic dependency graphs. We describe how these models were employed to augment an existing example-based MT system, and present results showing that doing so led to a significant improvement in translation quality as measured
      Read Full Article
    13. Training a Sentence-Level Machine Translation Confidence Measure.

      Training a Sentence-Level Machine Translation Confidence Measure Chris Quirk May 2004 Available Documents: PDF 240KB We present a supervised method for training a sentence level confidence measure on translation output using a human-annotated corpus. We evaluate a variety of machine learning methods. The resultant measure, while trained on a very small dataset, correlates well with human judgments, and proves to be effective on one task based evaluation. Although the experiments have only been r
      Read Full Article
      Mentions: Lisbon Portugal
    14. Fast and Accurate Sentence Alignment of Bilingual Corpora.

      Fast and Accurate Sentence Alignment of Bilingual Corpora Robert C. Moore October 2002 Available Documents: PDF 60 KB We present a new method for aligning sentences with their translations in a parallel bilingual corpus. Previous approaches have generally been based either on sentence length or word correspondences. Sentence-length-based methods are relatively fast and fairly accurate. Word-correspondence-based methods are generally more accurate but much slower, and usually depend on cognates o
      Read Full Article
    7465-7488 of 7717 « 1 2 ... 309 310 311 312 313 314 315 ... 320 321 322 »
  1. Categories

    1. Default:

      Discourse, Entailment, Machine Translation, NER, Parsing, Segmentation, Semantic, Sentiment, Summarization, WSD
  2. Popular Articles

  3. Organizations in the News

    1. (24 articles) Google
    2. (14 articles) Google Translate
    3. (14 articles) Microsoft
    4. (8 articles) Microsoft Word
    5. (8 articles) ETS
    6. (8 articles) SDL
    7. (6 articles) NLP
    8. (5 articles) PR Newswire
    9. (5 articles) IBM
    10. (5 articles) Tsinghua University
    11. (4 articles) Systran
    12. (4 articles) Symantec
  4. Locations in the News

    1. (16 articles) China
    2. (8 articles) Beijing
    3. (8 articles) Wakefield
    4. (8 articles) Maidenhead
    5. (8 articles) Amazon
    6. (7 articles) New York
    7. (7 articles) Massachusetts
    8. (7 articles) Baidu
    9. (6 articles) North Korea
    10. (6 articles) England
    11. (4 articles) Singapore
    12. (4 articles) India
  5. People in the News

    1. (3 articles) Yann LeCun
    2. (3 articles) Timo Honkela
    3. (2 articles) Mark Zuckerberg
    4. (2 articles) John McCarthy
    5. (2 articles) John Markoff
    6. (2 articles) Jean Senellart
    7. (2 articles) Sundar Pichai
    8. (1 articles) Eric Schmidt
    9. (1 articles) Michael Jordan
    10. (1 articles) Rajnish Kumar
    11. (1 articles) Jeff Dean
    12. (1 articles) Jörg Tiedemann