1. Articles in category: Machine Translation

    7369-7392 of 7543 « 1 2 ... 305 306 307 308 309 310 311 ... 313 314 315 »
    1. Tokenizer for a natural language processing system

      The present invention is a segmenter used in a natural language processing system. The segmenter segments a textual input string into tokens for further natural language processing. In accordance with one feature of the invention, the segmenter includes a tokeinzer engine that proposes segmentations and submits them to a linguistic knowledge component for validation. In accordance with another feature of the invention, the segmentation system includes language specific data that contains a precedence hierarchy for punctuation. If proposed tokens in the input string contain punctuation, they can illustratively be broken into subtokens based on the precedence hierarchy.
      Read Full Article
    2. Training machine learning by sequential conditional generalized iterative scaling

      A system and method facilitating training machine learning systems utilizing sequential conditional generalized iterative scaling is provided. The invention includes an expected value update component that modifies an expected value based, at least in part, upon a feature function of an input vector and an output value, a sum of lambda variable and a normalization variable. The invention further includes an error calculator that calculates an error based, at least in part, upon the expected value and an observed value. The invention also includes a parameter update component that modifies a trainable parameter based, at least in part, upon the ...
      Read Full Article
      Mentions: Scgis lamda
    3. Linguistic disambiguation system and method using string-based pattern training learn to resolve ambiguity sites

      A linguistic disambiguation system and method creates a knowledge base by training on patterns in strings that contain ambiguity sites. The string patterns are described by a set of reduced regular expressions (RREs) or very reduced regular expressions (VRREs). The knowledge base utilizes the RREs or VRREs to resolve ambiguity based upon the strings in which the ambiguity occurs. The system is trained on a training set, such as a properly labeled corpus. Once trained, the system may then apply the knowledge base to raw input strings that contain ambiguity sites. The system uses the RRE- and VRRE-based knowledge base ...
      Read Full Article
    4. Computer-aided reading system and method with cross-language reading wizard

      A computer-aided reading system offers assistance to a user who is reading in a non-native language, as the user needs help, without requiring the user to divert attention away from the text. In one implementation, the reading system is implemented as a reading wizard for a browser program. The reading wizard is exposed via a graphical user interface (UI) that allows the user to select a word, phrase, sentence, or other grouping of words in the non-native text. The reading wizard automatically determines whether the selected one word comprises part of a phrase; allows the user to choose whether to ...
      Read Full Article
    5. What's the Use of a Crummy Translation?

      I'm currently visiting Microsoft Research Asia (in Beijing) for two weeks (thanks for having me, guys!). I speak basically no Chinese. I took one half of a semester about 6 years ago. I know much more Japanese; enough so that I can read signs that indicate direction, dates and times, but that's about it... the remainder is too divergent for me to make out at all (perhaps a native Japanese speaker would feel differently, but certainly not a like me). My experience here has reminded me of a paper
      Read Full Article
    6. New ImTranslator.com Portal Released

      Irvine, Calif. May, 2007 Smart Link Corporation announced the launch of the ImTranslator.com portal that is packed with new features. Completely redesigned and re-thought. No more annoying popup windows or ads. You can now enjoy all our applications from one convenient location. Improved dictionaries now contain all the suggested words/translations that we have received from [...]
      Read Full Article
      Mentions: Firefox Google Yahoo
    7. ACL and Emnlp 2007 Report

      ACL/EMNLP just concluded. Overall, I thought both conferences were a success, though by now I am quite ready to return home. Prague was very nice. I especially enjoyed Tom Mitchell's invited talk on linking fMRI experiments to language. They actually use lexical semantic information to be able to identify what words people are thinking about when they scan their brains. Scary mind-reading stuff going on here. I think this is a very interesting avenue of research---probably not one I'll follow my
      Read Full Article
    8. Promt Translation Server on AOL Reference portal

      Irvine, Calif. August, 2006 - Smart Link Corporation and PROMT Ltd. - the leading providers of innovative multilingual technologies - announced that AOL has chosen ImTranslator powered by PROMT® Translation Server for AOL Research and Learn’s online reference area. Through the partnership, ImTranslator will provide AOL with advanced ImTranslator tools, such as multilingual Spell-checker, Virtual [...]
      Read Full Article
    9. Speech recognition with feedback from natural language processing for adaptation of acoustic model

      A speech processing system including a speech recognition unit to receive input speech, and a natural language processing unit. The speech recognition unit performs speech recognition on input speech using acoustic models to produce a speech recognition result. The natural-language processing unit performs natural language processing on the speech recognition result, and includes: a machine translation unit; a dialog management unit; and a feedback unit. The feedback unit configured to feed back information obtained as a result of the natural language processing performed in the machine translation unit and the dialog management unit. The speech recognition unit processes feedback information ...
      Read Full Article
    10. Computer-aided reading system and method with cross-language reading wizard

      A computer-aided reading system offers assistance to a user who is reading in a non-native language, as the user needs help, without requiring the user to divert attention away from the text. In one implementation, the reading system is implemented as a reading wizard for a browser program. The reading wizard is exposed via a graphical user interface (UI) that allows the user to select a word, phrase, sentence, or other grouping of words in the non-native text, and view multiple translations of the selected text in the user's own native language. The multiple translations are presented in a ...
      Read Full Article
    11. Computer-aided reading system and method with cross-language reading wizard

      A computer-aided reading system offers assistance to a user who is reading in a non-native language, as the user needs help, without requiring the user to divert attention away from the text. In one implementation, the reading system is implemented as a reading wizard for a browser program. The reading wizard is exposed via a graphical user interface (UI) that allows the user to select a word, phrase, sentence, or other grouping of words in the non-native text. The reading wizard automatically determines whether the selected one word comprises part of a phrase; presents one or more translations of at ...
      Read Full Article
    12. Normalizing and classifying locale-specific information

      A method and apparatus are disclosed for transforming information from one semantic environment to another, for example, to facilitate electronic information searches. In one implementation, a SOLx system 1700 includes a Normalization/Translation NorTran Workbench 1702 and a SOLx server 1708. The NorTran Workbench 1702 is used to develop a knowledge base based on information from a source system 1712, to normalize legacy content 1710 according to various rules, and to develop a database 1706 of translated content. During run time, the SOLx server 1708 receives transmissions from the source system 1712, normalizes the transmitted content, accesses the database 1706 ...
      Read Full Article
    13. System and method for machine learning a confidence metric for machine translation

      A machine translation system is trained to generate confidence scores indicative of a quality of a translation result. A source string is translated with a machine translator to generate a target string. Features indicative of translation operations performed are extracted from the machine translator. A trusted entity-assigned translation score is obtained and is indicative of a trusted entity-assigned translation quality of the translated string. A relationship between a subset of the extracted features and the trusted entity-assigned translation score is identified.
      Read Full Article
    14. Scaleable machine translation

      A method translates a textual input in a first language to a textual output in a second language. An input logical form is generated based on the textual input. When a plurality of transfer mappings in a transfer mapping database match the input logical form (or at least a portion thereof) one or more of those plurality of matching transfer mappings is selected based on a predetermined metric. Textual output is generated based on the selected transfer logical form.
      Read Full Article
    15. Projecting dependencies to generate target language dependency structure

      In one embodiment of the present invention, a decoder receives a dependency tree as a source language input and accesses a set of statistical models that produce outputs combined in a log linear framework. The decoder also accesses a table of treelet translation pairs and returns a target dependency tree based on the source dependency tree, based on access to the table of treelet translation pairs, and based on the application of the statistical models.
      Read Full Article
    16. Method for extracting name entities and jargon terms using a suffix tree data structure

      A method for entity name and jargon term recognition and extraction. An embodiment of the present invention uses a suffix tree data structure to determine frequently occurring phrases. In one embodiment text to be analyzed is preprocessed. The text is then separated into clauses and a suffix tree is created for the text. The suffix tree is used to determine repetitious segments. Unrecognized text fragment, occurring with a high frequency, have a comparably high probability of being a name entity or jargon term. The set of repetitious segments is then filtered to obtain a set of possible entity names and ...
      Read Full Article
    17. Apparatus, method, and computer-readable medium for language translation

      An interface unit issues input/output instructions regarding an input of a translation target sentence, an output of a translated sentence, and a translation control. A machine translating apparatus translates a document in a certain language into a document in another language. A translation memory device translates a sentence by searching an original/translation database in which sentences in a certain language and original/translation sentences in another language corresponding thereto have been accumulated. A data compatible processing unit makes original/translation information translated by the machine translating apparatus and original/translation information translated by the translation memory device common ...
      Read Full Article
      Mentions: Tokyo Yokohama
    18. Statistical method and apparatus for learning translation relationships among words

      A parallel bilingual training corpus is parsed into its content words. Word association scores for each pair of content words consisting of a word of language L1 that occurs in a sentence aligned in the bilingual corpus to a sentence of language L2 in which the other word occurs. A pair of words is considered "linked" in a pair of aligned sentences if one of the words is the most highly associated, of all the words in its sentence, with the other word. The occurrence of compounds is hypothesized in the training data by identifying maximal, connected sets of linked ...
      Read Full Article
    19. Method and system for theme-based word sense ambiguity reduction

      Word sense ambiguity, for "thematic" words in a sentence, is achieved based on thematic prediction. The senses of "thematic" words are disambiguated in a sentence by determining and weighting possible themes for that sentence. Possible themes are determined for that sentence based on thematic information associated with the different senses of each word in the sentence. A highly deterministic thematic-based word sense disambiguation method is used to preprocess the sentence prior to further syntactic and semantic analysis, thereby enhancing accuracy and decreasing the demand for computational resources (memory and CPU) by reducing input ambiguities.
      Read Full Article
      Mentions: United Nations
    20. Method for named-entity recognition and verification

      A method for named-entity (NE) recognition and verification is provided. The method can extract at least one to-be-tested segments from an article according to a text window, and use a predefined grammar to parse the at least one to-be-tested segments to remove ill-formed ones. Then, a statistical verification model is used to calculate the confidence measurement of each to-be-tested segment to determine where the to-be-tested segment has a named-entity or not. If the confidence measurement is less than a predefined threshold, the to-be-tested segment will be rejected. Otherwise, it will be accepted.
      Read Full Article
      Mentions: Taipei
    21. Method for generating natural language in computer-based dialog systems

      The invention relates to a method for generating natural language in computer based dialog system. The invention generating an input tree from an input file of a simple semantic representation language (SSRL) statements; inserting elementary syntactic information into the tree; applying elementary operations such as sub-tree insertion and manipulation; accessing lexical and language specific information; generating syntactic structures oriented towards normalized interface structures from the structured deep syntactic input tree; realizing and expanding structure types by inserting and/or transforming of pre-defined structures, inserting determiners and ordering sub-structures; generating morphologic transformations; carrying out final refinements, comprising phonetic refinement, pretty print ...
      Read Full Article
    22. Speech recognition with feedback from natural language processing for adaptation of acoustic model

      A speech processing system including a speech recognition unit to receive input speech, and a natural language processor. The speech recognition unit performs speech recognition on input speech using acoustic models to produce a speech recognition result. The natural-language processor performs natural language processing on speech recognition result, and includes: a speech zone detector configured to detect correct zones from the speech recognition result; a feedback unit to feed back information obtained as a result of the natural language processing performed on the speech recognition result to said speech recognition unit. The feedback information includes the detected correct zones. The ...
      Read Full Article
    23. Discovery of inference rules from text

      The present invention provides a facility for discovering a set of inference rules, such as "X is author of Y.apprxeq.X wrote Y", "X solved Y.apprxeq.X found a solution to Y", and "X caused Y.apprxeq.Y is triggered by X", by analyzing a corpus of natural language text. The corpus is parsed to identify grammatical relationships between words and to build dependency trees formed of the relationships between the words. Paths linking words in the dependency trees are identified. If two paths tend to link the same sets of words, their meanings are taken to be ...
      Read Full Article
      Mentions: England Canada Boston
    24. System, method and computer program product for performing unstructured information management and automatic text analysis, including a search operator functioning as a Weighted And (wand)

      Disclosed is a system architecture, components and a searching technique for an Unstructured Information Management System (UIMS). The UIMS may be provided as middleware for the effective management and interchange of unstructured information over a wide array of information sources. The architecture generally includes a search engine, data storage, analysis engines containing pipelined document annotators and various adapters. The searching technique makes use of a two-level searching technique. A search query includes a search operator containing of a plurality of search sub-expressions each having an associated weight value. The search engine returns a document or documents having a weight value ...
      Read Full Article
    7369-7392 of 7543 « 1 2 ... 305 306 307 308 309 310 311 ... 313 314 315 »
  1. Categories

    1. Default:

      Discourse, Entailment, Machine Translation, NER, Parsing, Segmentation, Semantic, Sentiment, Summarization, WSD
  2. Popular Articles

  3. Organizations in the News

    1. (73 articles) Google
    2. (50 articles) Google Translate
    3. (35 articles) Punjabi
    4. (35 articles) Malayalam
    5. (35 articles) Kannada
    6. (32 articles) Telugu
    7. (26 articles) Google Search
    8. (18 articles) Oxford University Press
    9. (16 articles) Google India
    10. (15 articles) Google Maps
    11. (12 articles) Microsoft
    12. (10 articles) PR Newswire
  4. Locations in the News

    1. (45 articles) India
    2. (36 articles) Indian
    3. (25 articles) China
    4. (22 articles) Indonesia
    5. (16 articles) Brazil
    6. (15 articles) Thailand
    7. (13 articles) Urdu
    8. (12 articles) Japan
    9. (12 articles) New Delhi
    10. (12 articles) Africa
    11. (11 articles) Beijing
    12. (10 articles) Hong Kong
  5. People in the News

    1. (2 articles) Geoffrey Hinton
    2. (2 articles) Sharon Goldwater
    3. (2 articles) Tom Gruber
    4. (2 articles) Tommi Jaakkola
    5. (2 articles) Tony O'Dowd
    6. (2 articles) Regina Barzilay
    7. (1 articles) Yoshua Bengio
    8. (1 articles) Cade Metz
    9. (1 articles) Ilya Sutskever
    10. (1 articles) Richard Socher
    11. (1 articles) Sundar Pichai
    12. (1 articles) Slav Petrov