    1. Marissa Mayer of Google Speaks about the Future of Search

      On the Official Google Blog , Marissa Mayer has spoken about her role at Google and the future of search. She explains that search will be becoming more mobile, that voice and natural language search will probably be more emphasized, and that other advances will be made as well. Additionally, Mayer talks about the impact of universal search and personalization. Finally, Google will likely ...
    2. An interlingua-based Chinese-English MT system

      Abstract  Chinese-English machine translation is a significant and challenging problem in information processing. The paper presents an interlingua-based Chinese-English natural language translation system (ICENT). It introduces the realization mechanism of Chinese language analysis, which contains syntactic parsing and semantic analyzing and gives the design of interlingua in details. Experimental results and system evaluation are given. The result is satisfying. Content Type Journal ArticleCategory CorrespondenceDOI 10.1007/BF02943286Authors Xuan Qi, National University of Defense Technology School of Computer Science 410073 Changsha P.R. ChinaHuiping Zhou, National University of Defense Technology School of Computer Science 410073 Changsha P.R. ChinaHuowang Chen, National ...
    3. Disambiguation of partial cognates

      Abstract  Partial cognates are pairs of words in two languages that have the same meaning in some, but not all contexts. Detecting the actual meaning of a partial cognate in context can be useful for Machine Translation tools and for Computer-Assisted Language Learning tools. We propose a supervised and a semi-supervised method to disambiguate partial cognates between two languages: French and English. The methods use only automatically-labeled data; therefore they can be applied to other pairs of languages as well. The aim of our work is to automatically detect the meaning of a French partial cognate word in a specific ...
    4. AlfaBetic Translates Your Blog For A Worldwide Audience, Free of Charge

      AlfaBetic is a translation service that offers publishers a free way to have their content translated to ten of the web's most popular languages, which it says will expose them 842 million potential new readers. The service first runs text through a computer, which is similar to the web-based translation systems offered by Google Translate and a number of other sites. After generating this ...
    5. MSD Recombination for Statistical Machine Translation into Highly-Inflected Languages

      Freely available tools and language resources were used to build the VoiceTRAN statistical machine translation (SMT) system. Various configuration variations of the system are presented and evaluated. The VoiceTRAN SMT system outperformed the baseline conventional rule-based MT system in both English-Slovenian in-domain test setups. To further increase the generalization capability of the translation model for lower-coverage out-of-domain test sentences, an “MSD-recombination” approach was proposed. This approach not only allows a better exploitation of conventional translation models, but also performs well in the more demanding translation direction; that is, into a highly inflectional language. Using this approach in the out-of-domain setup ...
    6. Cross Sentence Alignment for Structurally Dissimilar Corpus Based on Singular Value Decomposition

      Extracting the alignment pairs is a critical step for constructing bilingual corpus knowledge base for Example Based Machine Translation Systems. Different methods have been proposed in aligning parallel corpus between two different languages. However, most of them focus on structurally similar languages like English-French. This paper presents a method of cross aligning Portuguese-Chinese bilingual comparable corpus. The proposed approach is based on Singular Value Decomposition techniques and similarity measurement, which covers the problem in aligning structurally dissimilar corpus and enhances the accuracy of the alignment result. Content Type Book ChapterDOI 10.1007/978-3-540-85984-0_104Authors Anna Ho, University of Macau Faculty of ...
    7. Semantic language and multi-language MT approach based on SL

      Abstract  Any natural language is regarded as a representation of semantic language. The translation between two languages (I, J) is regarded as a transformation between two representations. A natural language-I is translated into another natural language-J only by two steps. One is semantic-analysis of language-I based on “semantic-element-representation-base of language-I”, the other is deploying into the representation of language-J baded on “semantic-element-representation-base of language-J”. For translating inN natural languages, it is needed to developN translation systems only, rather thanN(N-1)/2, or 2N systems. Content Type Journal ArticleDOI 10.1007/BF02945475Authors QingShi Gao, University of Science and Technology 100083 Beijing ...
    8. ParaMT: A Paraphraser for Machine Translation

      In this paper we present ParaMT, a bilingual/multilingual paraphraser to be applied in machine translation. We select paraphrases of support verb constructions and use the NooJ linguistic environment to formalize and generate translation equivalences through the use of dictionary and local grammars with syntactic and semantic content. Our research shows that linguistic paraphrasal knowledge constitutes a key element in conversion of source language into controlled language text that presents more successful translation results. Content Type Book ChapterDOI 10.1007/978-3-540-85980-2_21Authors Anabela Barreiro, New York University Faculdade de Letras da Universidade do Porto & CLUP-Linguateca Book Series Lecture Notes in Computer ...
    9. Statistical Machine Translation of Broadcast News from Spanish to Portuguese

      In this paper we describe the work carried out to develop an automatic system for translation of broadcast news from Spanish to Portuguese. Two challenging topics of speech and language processing were involved: Automatic Speech Recognition (ASR) of the Spanish News and Statistical Machine Translation (SMT) of the results to the Portuguese language. ASR of broadcast news is based on the AUDIMUS.MEDIA system, a hybrid ANN/HMM system with multiple stream decoding. A 22.08% Word Error Rate (WER) was achieved in a Spanish Broadcast News task, which is comparable to other international state of the art systems. Parallel ...
    10. Adaptive Translation: Finding Interlingual Mappings Using Self-Organizing Maps

      This paper presents a method for creating interlingual word-to-word or phrase-to-phrase mappings between any two languages using the self-organizing map algorithm. The method can be used as a component in a statistical machine translation system. The conceptual space created by the self-organizing map serves as a kind of interlingual representation. The specific problems of machine translation are discussed in some detail. The proposed method serves in alleviating two problems. The main problem addressed here is the fact that different languages divide the conceptual space differently. The approach can also help in dealing with lexical ambiguity. Content Type Book ChapterDOI 10 ...
