1. Articles in category: Machine Translation

    7849-7872 of 7939 « 1 2 ... 325 326 327 328 329 330 331 »
    1. Translating apparatus, dictionary search apparatus, and translating method

      If a context process range extending unit cannot obtain context information required by a context processing unit, from a range to be translated, it extends the context process range. Then, the context processing unit performs a context process, and passes extracted context information to a translation processing unit in order to perform translation, based on the extended context process range.
      Read Full Article
    2. Compression method, method for compressing entry word index data for a dictionary, and machine translation system

      A n-gram statistical analysis is employed to acquire frequently appearing character strings of n characters or more, and individual character strings having n characters or more are replaced by character translation codes of 1 byte each. The correlation between the original character strings having n characters and the character translation codes is registered in a character translation code table. Assume that a character string of three characters, i.e., a character string of three bytes, "sta," is registered as 1-byte code "e5" and that a character string of four characters, i.e., a character string of four bytes, "tion," is ...
      Read Full Article
    3. Apparatus and method for information retrieval, and storage medium storing program therefor

      A user inputs a retrieval query represented by a set of propositions using a modal operator through an interface. The retrieval query is passed to a document set gathering unit through a retrieval input unit. The document set gathering unit refers to an index, gathers a set of documents having a true proposition, and writes it to a work area. A similarity computation unit computes the similarity of the gathered set of documents and writes it to the work area. The retrieval result output unit refers to the work area, ranks the gathered sets of documents in consideration of a ...
      Read Full Article
      Mentions: anAND
    4. Multilingual electronic transfer dictionary containing topical codes and method of use

      A multilingual electronic transfer dictionary provides for automatic topic disambiguation by including one or more topic codes in definitions contained the dictionary. Automatic topic disambiguation is accomplished by determining the frequencies of topic codes within a block of text. Dictionary entries having more frequently occurring topic codes are preferentially selected over those having less frequently occurring topic codes. When the topic codes are members of a hierarchical topical coding system, such as the International Patent Classification system, an iterative method can be used with starts with a coarser level of the coding system and is repeated at finer levels until ...
      Read Full Article
    5. Automated translation of annotated text based on the determination of locations for inserting annotation tokens and linked ending, end-of-sentence or language tokens

      A system and method for translating an annotated source document in a first natural language to a target document in a second natural language having corresponding annotations, includes computer storage, a computer receiving module for receiving input textual information in a first language and for storing the input textual information in the computer storage, the input textual information including annotations and a translation engine for creating a first token string including first language tokens, annotation tokens that apply to the first language tokens, and ending tokens. Prior to translation, the annotation tokens are removed from the first token string and ...
      Read Full Article
    6. Systems and methods for determinizing and minimizing a finite state transducer for pattern recognition

      A pattern recognition system and method for optimal reduction of redundancy and size of a weighted and labeled graph presents receiving speech signals, converting the speech signals into word sequence, interpreting the word sequences in a graph where the graph is labeled with word sequences and weighted with probabilities and determinizing the graph by removing redundant word sequences. The size of the graph can also be minimized by collapsing some nodes of the graph in a reverse determinizing manner. The graph can further be tested for determinizability to determine if the graph can be determinized. The resulting word sequence in ...
      Read Full Article
      Mentions: Detroit sub
    7. Analyzing inflectional morphology in a spoken language translation system

      At least one speech input is received and at least one token is generated from speech input. Morphemes of the tokens are reduced to at least one feature. Furthermore, an inflection type of the token is identified. At least one dictionary is searched for entries comprising features that match the features reduced from the morphemes. At least one lexical feature structure is generated for the token by inserting at least one morphological feature associated with the inflection type into the entry feature. An output is provided comprising at least one lexical feature structure.
      Read Full Article
      Mentions: Japan Inventiona
    8. Type-based selection of rules for semantically disambiguating words

      In semantically disambiguating words, where more than one disambiguation applies to the context in which a word occurs, a rule can be selected based on the type of information from which it was obtained. The rules can be derived from different types of information in a corpus such as a dictionary, and rules can be selected in accordance with a prioritization of the types of information.
      Read Full Article
    9. Using ranked translation choices to obtain sequences indicating meaning of multi-token expressions

      To provide information about the meaning of a multi-token expression in a first language, where the information is understandable in a second language, subexpressions are obtained, such as tokens, chunks, and sentences. The multi-token expression could, for example, be a sentence or an input text with more than one sentence. Translation choices are obtained in the second language for a set of the subexpressions. A subset of the translation choices of a subexpression are ranked, and the ranked translation choices are used to produce a sequence of translation choices for the multi-token expression as a whole. Information is then presented ...
      Read Full Article
    10. Method and apparatus for style control in natural language generation

      A method and an apparatus for style control in natural language recognition and generation are provided, wherein an acoustic input is received comprising at least one source language. The acoustic input comprises words, sentences, and phrases in a natural spoken language. Source expressions are recognized in the source language. Style parameters are determined for the source expression. The style parameters may be extracted from the source expression, set by the user, or randomly selected by the natural language system. A recognized source expression is selected and confirmed by a user through a user interface. The recognized source expressions are translated ...
      Read Full Article
      Mentions: Osaka Inventiona
    11. Method and apparatus for performing spoken language translation

      A method and an apparatus for performing spoken language translation are provided, wherein a speech input is received comprising at least one source language. The speech input comprises words, sentences, and phrases in a natural spoken language. Source expressions are recognized in the source language. Misrecognitions of the source expressions resulting from factors comprising noise and speaker variation are minimized by the generation of intermediate data structures that encode at least one recognition hypothesis. Furthermore, misrecognitions are minimized by the generation of candidate recognized source expressions by processing the intermediate data structures using models comprising a general language model and ...
      Read Full Article
      Mentions: Japan Inventiona
    12. Method and system for managing a common dictionary and updating dictionary data selectively according to a type of local processing system

      A common dictionary management system that can reduce the work of data registration into dictionaries. Exchange data generated in any local dictionary of a natural language processing system is sent to the local dictionaries of other natural language processing systems automatically. A local dictionary management portion and a common dictionary management portion collect updated dictionary data from a plurality of local dictionaries and updating time information. The local dictionary management portion updates the common dictionary and the local dictionaries according to the collected dictionary data and the time information. The local dictionary management portion compares the information with the latest ...
      Read Full Article
    13. Related sentence retrieval system having a plurality of cross-lingual retrieving units that pairs similar sentences based on extracted independent words

      The present invention implements retrieval of mutually related sentences even between sentences in a wide variety of languages. A related sentence retrieval system is provided with n (n is 3 or a greater natural number) cross-lingual retrieval systems for bidirectionally retrieving related sentences between sentences written in language P and sentences written in other languages A to F, a language P sentence input unit is shared by the cross-lingual retrieval systems, and the language P sentence output unit is also shared by them to implement bidirectional retrieval of related sentences via language P between sentences in the other n languages ...
      Read Full Article
    14. Cross-lingual retrieval system and method that utilizes stored pair data in a vector space model to process queries

      The cross-lingual retrieval system of the present invention retrieves second language sentences which are appropriate translations of a query written in a first language using pair data without being greatly influenced by the difference in expression between the query and sentences to be the object of retrieval, length of the query or sentences to be the object of retrieval or the like. In the system, a pair data storing unit stores multiple pairs each having a first language sentence and a second language sentence having the same meaning. As a query receiving unit receives a query written in the first ...
      Read Full Article
    15. Grouping words with equivalent substrings by automatic clustering based on suffix relationships

      A set of words of a natural language is grouped by automatically obtaining suffix relation data that indicate a relation value for each of a set of relationships between suffixes that occur in the natural language, and, then, by automatically clustering the words in the set using the relation values from the suffix relation data, to obtain group data indicating groups of words. Two or more words in a group have suffixes as in one of the relationships and, preceding the suffixes, equivalent substrings. The relationships can be pairwise relationships, and the relation value can indicate the number of occurrences ...
      Read Full Article
    16. Automatic construction of conditional exponential models from elementary features

      An apparatus for translating a series of source words in a first language to a series of target words in a second language. For an input series of source words, at least two target hypotheses, each comprising a series of target words, are generated. Each target word has a context comprising at least one other word in the target hypothesis. For each target hypothesis, a language model match score comprises an estimate of the probability of occurrence of the series of words in the target hypothesis. At least one alignment connecting each source word with at least one target word ...
      Read Full Article
    17. System and method for estimating accuracy of an automatic natural language translation

      A computer system and method for natural language translation uses a translation process to translate a source natural language segment (e.g. English) of one or more source words/elements into a target natural language (e.g. German) segment of one or more target words/elements. An evaluation module determines a confidence measure of the natural language translation. Typically, the confidence measure indicates less confidence as the complexity of the translation increases. Various novel features for determining complexity and confidence measure at different steps in the translation are used. The translation process can be terminated if the confidence measure fails ...
      Read Full Article
    18. Method and apparatus for interactive source language expression recognition and alternative hypothesis presentation and selection

      In a language translation system, a method for providing a guide for a user. In one embodiment, the method comprises the steps of receiving an input that is representative of at least one word in a source language, and generating at least one recognition hypothesis in the source language in response to the input. The method further comprise the steps of selecting a best hypothesis from the at least one recognition hypothesis in the source language, presenting the best hypothesis in the source language to a user, and presenting alternatives to a portion of the best hypothesis in the source ...
      Read Full Article
      Mentions: Japan Inventiona
    19. Method and apparatus for adaptive speech recognition hypothesis construction and selection in a spoken language translation system

      A method and apparatus for adaptive speech recognition hypothesis are provided, wherein a number of ordered recognition hypotheses are generated and presented in response to a received speech input comprising natural spoken language. Generation of the recognition hypotheses comprises assigning basic probabilities to at least one basic component of the speech input using language models and calculating an overall probability of each of the recognition hypotheses using the assigned basic probabilities. The best hypothesis is selected by a user from the recognition hypotheses. Hypothesis generation is adapted in response to the selected best hypotheses, wherein the selected hypothesis is analyzed ...
      Read Full Article
      Mentions: Japan Inventiona
    20. Automated system for generating natural language translations that are domain-specific, grammar rule-based, and/or based on part-of-speech analysis

      Systems and methods for automated natural language translation take source natural language and translate it into a target natural language. The language translation is facilitated through the use of domain specific dictionaries, controlling grammar rules, and probability values associated with some dictionary entries. Domain specific dictionaries provide different translations of words depending on the context in which the words are used. Controlling grammar rules are used to supersede all other grammar rules to eliminate inaccurate and undesirable translations. Probability values are used to determine the likelihood a particular multi-word combination is a particular part of speech.
      Read Full Article
    21. Method of and system for disambiguating syntactic word multiples

      A method and system are provided for disambiguating multiples of syntactically related words automatically using the notion of semantic similarity between words. Based on syntactically related words derived from a sample text, a set is formed containing each associating word and the words associated in the syntactic relationship with it. The associating words are expanded to all word senses. Pair wise intersections of the resulting sets are formed so as to form pairs of semantically compatible word clusters which may be stored as pairs of cooccurrence restriction codes.
      Read Full Article
    22. Systems and methods for determinization and minimization a finite state transducer for speech recognition

      A pattern recognition system and method for optimal reduction of redundancy and size of a weighted and labeled graph presents receiving speech signals, converting the speech signals into word sequences, interpreting the word sequences in a graph where the graph is labeled with word sequences and weighted with probabilities and determinizing the graph by removing redundant word sequences. The size of the graph can also be minimized by collapsing some nodes of the graph in a reverse determinizing manner. The graph can further be tested for determinizability to determine if the graph can be determinized. The resulting word sequence in ...
      Read Full Article
      Mentions: Detroit
    23. Method and apparatus for providing syntactic analysis and data structure for translation knowledge in example-based language translation

      Syntactic analysis is performed on an input and on entries of a bilingual example database using at least one parse tree. The parse trees, formed using a context-free grammar, comprise a number of nodes and each node comprises at least one production rule. Furthermore, at least one node comprises nested production rules. The nested production rules comprise production rules for different combinations of the linguistic constituents of the input. The syntactic analysis comprises recognizing linguistic constituents, ordering the linguistic constituents, representing the linguistic constituents using an adapted feature structure analysis representation, and manipulating the adapted feature structure analysis representation using ...
      Read Full Article
      Mentions: Japan Inventiona
    24. Method and apparatus for parsing in a spoken language translation system

      A method and apparatus for parsing in a spoken language translation system are provided, wherein an input is received comprising at least one input sentence or expression. A parsing table is accessed and consulted for a next action, wherein the parser looks up in the next action in the parsing table. During parsing operations, the parser may perform shift actions and reduce actions. In performing a shift action, a next item of the input string is shifted onto a stack or intermediate data structure of the parser. A new parse node is generated, and a feature structure or lexical feature ...
      Read Full Article
      Mentions: Japan Inventiona
    7849-7872 of 7939 « 1 2 ... 325 326 327 328 329 330 331 »
  1. Categories

    1. Default:

      Discourse, Entailment, Machine Translation, NER, Parsing, Segmentation, Semantic, Sentiment, Summarization, WSD
  2. Popular Articles

  3. Organizations in the News

    1. (47 articles) Association for Computational Linguistics
    2. (43 articles) Google
    3. (37 articles) Stanford
    4. (35 articles) NLP
    5. (30 articles) Microsoft
    6. (22 articles) IBM
    7. (16 articles) Icml
    8. (16 articles) Collins
    9. (15 articles) Naacl
    10. (14 articles) Google Translate
    11. (14 articles) POS
    12. (12 articles) Bleu
  4. Locations in the News

    1. (25 articles) China
    2. (24 articles) Darpa
    3. (17 articles) USA
    4. (13 articles) New York
    5. (13 articles) Beijing
    6. (12 articles) Australia
    7. (10 articles) Amazon
    8. (10 articles) India
    9. (9 articles) Japan
    10. (9 articles) Russian
    11. (9 articles) Springer
    12. (9 articles) Brisbane
  5. People in the News

    1. (29 articles) Christopher Manning
    2. (19 articles) C. D. Manning
    3. (17 articles) Daniel Jurafsky
    4. (16 articles) D. Jurafsky
    5. (14 articles) D. Klein
    6. (9 articles) Spence Green
    7. (8 articles) Andrew Y. Ng
    8. (8 articles) D. Cer
    9. (8 articles) Dan Klein
    10. (8 articles) Richard Socher
    11. (7 articles) P. Koehn
    12. (7 articles) Michael Collins