1. Articles in category: Machine Translation

    7993-8016 of 8094 « 1 2 ... 331 332 333 334 335 336 337 338 »
    1. Automated natural language processing

      An automated natural language translation system takes source natural language text (preferably in Japanese) and translates them into a target natural language (preferably English). The system also allows an operator to re-translate automatically selected portions of the source text. The system includes an improvement directed to transforming kanas in the source text into alphabetic letters of the target language which allows the presence of a word or phrase boundary to be recognized in the middle of a kana. The system also includes an improvement involving performing concurrently on the source text both a morphological analysis and a syntactic analysis.
      Read Full Article
    2. Linguistic disambiguation system and method using string-based pattern training to learn to resolve ambiguity sites

      A linguistic disambiguation system and method creates a knowledge base by training on patterns in strings that contain ambiguity sites. The string patterns are described by a set of reduced regular expressions (RREs) or very reduced regular expressions (VRREs). The knowledge base utilizes the RREs or VRREs to resolve ambiguity based upon the strings in which the ambiguity occurs. The system is trained on a training set, such as a properly labeled corpus. Once trained, the system may then apply the knowledge base to raw input strings that contain ambiguity sites. The system uses the RRE- and VRRE-based knowledge base ...
      Read Full Article
    3. Parameterized word segmentation of unsegmented text

      The present invention segments a non-segmented input text. The input text is received and segmented based on parameter values associated with parameterized word formation rules. In one illustrative embodiment, the input text is processed into a form which includes parameter indications, but which preserves the word-internal structure of the input text. Thus, the parameter values can be changed without entirely re-processing the input text.
      Read Full Article
    4. Integrated and authoring and translation system

      The present invention is a system of integrated, computer-based processes for monolingual information development and multilingual translation. An interactive text editor enforces lexical and grammatical constraints on a natural language subset used by the authors to create their text, which they help disambiguate to ensure translatability. The resulting translatable source language text undergoes machine translation into any one of a set of target languages, without the translated text requiring any postediting.
      Read Full Article
    5. Method and system for detecting frequent association patterns

      A text-mining system and method automatically extracts useful information from a large set of tree-structured data by generating successive sets of candidate tree-structured association patterns for comparison with the tree-structured data. The number of times is counted that each of the candidate association patterns matches with a tree in the set of tree-structured data in order to determine which of the candidate association patterns frequently matches with a tree in the data set. Each successive set of candidate association patterns is generated from the frequent association patterns determined from the previous set of candidate association patterns.
      Read Full Article
      Mentions: IBM Corp.
    6. Simulating human intelligence in computers using natural language dialog

      A method and apparatus for simulating human intelligence and natural language dialog capability is disclosed. The present invention contains a cognitive model of human intelligence (20), a mathematical model of information abstraction, synthetic dialog interaction (202), a method of language-independent computer learning through training (201), interaction and document reading (203) and a method of efficient computer implementation (200) of all preceding parts. The cognitive model (20) is the theoretical basis of the entire invention, describes the way humans learn and interact in general terms, provides a mathematical basis for natural language (40) learning and interaction and establishes a basis for ...
      Read Full Article
    7. Information processing apparatus and method, and computer readable memory therefor

      The structure of entered document image data is analyzed and a character string in a text block that has been analyzed is subjected to pattern recognition. Synonyms and equivalents of words obtained as results of language analysis are extracted and words obtained as results of language analysis are converted to words of another language. A character string in a text block that has been analyzed is translated to another language. At least results of analyzing the structure of document image data, results of character recognition and results of language analysis are stored, and at least one of the results of ...
      Read Full Article
      Mentions: Microsoft
    8. Translating apparatus, dictionary search apparatus, and translating method

      If a context process range extending unit cannot obtain context information required by a context processing unit, from a range to be translated, it extends the context process range. Then, the context processing unit performs a context process, and passes extracted context information to a translation processing unit in order to perform translation, based on the extended context process range.
      Read Full Article
    9. Compression method, method for compressing entry word index data for a dictionary, and machine translation system

      A n-gram statistical analysis is employed to acquire frequently appearing character strings of n characters or more, and individual character strings having n characters or more are replaced by character translation codes of 1 byte each. The correlation between the original character strings having n characters and the character translation codes is registered in a character translation code table. Assume that a character string of three characters, i.e., a character string of three bytes, "sta," is registered as 1-byte code "e5" and that a character string of four characters, i.e., a character string of four bytes, "tion," is ...
      Read Full Article
    10. Apparatus and method for information retrieval, and storage medium storing program therefor

      A user inputs a retrieval query represented by a set of propositions using a modal operator through an interface. The retrieval query is passed to a document set gathering unit through a retrieval input unit. The document set gathering unit refers to an index, gathers a set of documents having a true proposition, and writes it to a work area. A similarity computation unit computes the similarity of the gathered set of documents and writes it to the work area. The retrieval result output unit refers to the work area, ranks the gathered sets of documents in consideration of a ...
      Read Full Article
      Mentions: anAND
    11. Multilingual electronic transfer dictionary containing topical codes and method of use

      A multilingual electronic transfer dictionary provides for automatic topic disambiguation by including one or more topic codes in definitions contained the dictionary. Automatic topic disambiguation is accomplished by determining the frequencies of topic codes within a block of text. Dictionary entries having more frequently occurring topic codes are preferentially selected over those having less frequently occurring topic codes. When the topic codes are members of a hierarchical topical coding system, such as the International Patent Classification system, an iterative method can be used with starts with a coarser level of the coding system and is repeated at finer levels until ...
      Read Full Article
    12. Automated translation of annotated text based on the determination of locations for inserting annotation tokens and linked ending, end-of-sentence or language tokens

      A system and method for translating an annotated source document in a first natural language to a target document in a second natural language having corresponding annotations, includes computer storage, a computer receiving module for receiving input textual information in a first language and for storing the input textual information in the computer storage, the input textual information including annotations and a translation engine for creating a first token string including first language tokens, annotation tokens that apply to the first language tokens, and ending tokens. Prior to translation, the annotation tokens are removed from the first token string and ...
      Read Full Article
    13. Systems and methods for determinizing and minimizing a finite state transducer for pattern recognition

      A pattern recognition system and method for optimal reduction of redundancy and size of a weighted and labeled graph presents receiving speech signals, converting the speech signals into word sequence, interpreting the word sequences in a graph where the graph is labeled with word sequences and weighted with probabilities and determinizing the graph by removing redundant word sequences. The size of the graph can also be minimized by collapsing some nodes of the graph in a reverse determinizing manner. The graph can further be tested for determinizability to determine if the graph can be determinized. The resulting word sequence in ...
      Read Full Article
      Mentions: Detroit sub
    14. Analyzing inflectional morphology in a spoken language translation system

      At least one speech input is received and at least one token is generated from speech input. Morphemes of the tokens are reduced to at least one feature. Furthermore, an inflection type of the token is identified. At least one dictionary is searched for entries comprising features that match the features reduced from the morphemes. At least one lexical feature structure is generated for the token by inserting at least one morphological feature associated with the inflection type into the entry feature. An output is provided comprising at least one lexical feature structure.
      Read Full Article
      Mentions: Japan Inventiona
    15. Type-based selection of rules for semantically disambiguating words

      In semantically disambiguating words, where more than one disambiguation applies to the context in which a word occurs, a rule can be selected based on the type of information from which it was obtained. The rules can be derived from different types of information in a corpus such as a dictionary, and rules can be selected in accordance with a prioritization of the types of information.
      Read Full Article
    16. Using ranked translation choices to obtain sequences indicating meaning of multi-token expressions

      To provide information about the meaning of a multi-token expression in a first language, where the information is understandable in a second language, subexpressions are obtained, such as tokens, chunks, and sentences. The multi-token expression could, for example, be a sentence or an input text with more than one sentence. Translation choices are obtained in the second language for a set of the subexpressions. A subset of the translation choices of a subexpression are ranked, and the ranked translation choices are used to produce a sequence of translation choices for the multi-token expression as a whole. Information is then presented ...
      Read Full Article
    17. Method and apparatus for style control in natural language generation

      A method and an apparatus for style control in natural language recognition and generation are provided, wherein an acoustic input is received comprising at least one source language. The acoustic input comprises words, sentences, and phrases in a natural spoken language. Source expressions are recognized in the source language. Style parameters are determined for the source expression. The style parameters may be extracted from the source expression, set by the user, or randomly selected by the natural language system. A recognized source expression is selected and confirmed by a user through a user interface. The recognized source expressions are translated ...
      Read Full Article
      Mentions: Osaka Inventiona
    18. Method and apparatus for performing spoken language translation

      A method and an apparatus for performing spoken language translation are provided, wherein a speech input is received comprising at least one source language. The speech input comprises words, sentences, and phrases in a natural spoken language. Source expressions are recognized in the source language. Misrecognitions of the source expressions resulting from factors comprising noise and speaker variation are minimized by the generation of intermediate data structures that encode at least one recognition hypothesis. Furthermore, misrecognitions are minimized by the generation of candidate recognized source expressions by processing the intermediate data structures using models comprising a general language model and ...
      Read Full Article
      Mentions: Japan Inventiona
    19. Method and system for managing a common dictionary and updating dictionary data selectively according to a type of local processing system

      A common dictionary management system that can reduce the work of data registration into dictionaries. Exchange data generated in any local dictionary of a natural language processing system is sent to the local dictionaries of other natural language processing systems automatically. A local dictionary management portion and a common dictionary management portion collect updated dictionary data from a plurality of local dictionaries and updating time information. The local dictionary management portion updates the common dictionary and the local dictionaries according to the collected dictionary data and the time information. The local dictionary management portion compares the information with the latest ...
      Read Full Article
    20. Related sentence retrieval system having a plurality of cross-lingual retrieving units that pairs similar sentences based on extracted independent words

      The present invention implements retrieval of mutually related sentences even between sentences in a wide variety of languages. A related sentence retrieval system is provided with n (n is 3 or a greater natural number) cross-lingual retrieval systems for bidirectionally retrieving related sentences between sentences written in language P and sentences written in other languages A to F, a language P sentence input unit is shared by the cross-lingual retrieval systems, and the language P sentence output unit is also shared by them to implement bidirectional retrieval of related sentences via language P between sentences in the other n languages ...
      Read Full Article
    7993-8016 of 8094 « 1 2 ... 331 332 333 334 335 336 337 338 »
  1. Categories

    1. Default:

      Discourse, Entailment, Machine Translation, NER, Parsing, Segmentation, Semantic, Sentiment, Summarization, WSD
  2. Popular Articles

  3. Organizations in the News

    1. (22 articles) Google
    2. (13 articles) Microsoft
    3. (8 articles) Google Translate
    4. (7 articles) Apple
    5. (5 articles) Microsoft Translator
    6. (5 articles) AGI
    7. (4 articles) IoT
    8. (4 articles) Simon & Schuster
    9. (4 articles) Johns Hopkins University
    10. (4 articles) NLP
    11. (4 articles) Corporate Development
    12. (4 articles) SVP
  4. Locations in the News

    1. (8 articles) China
    2. (6 articles) Doha
    3. (6 articles) Qatar
    4. (5 articles) Japan
    5. (5 articles) Russian
    6. (5 articles) Canada
    7. (5 articles) Maidenhead
    8. (4 articles) Shanghai
    9. (4 articles) England
    10. (4 articles) Amazon
    11. (4 articles) California
    12. (3 articles) San Francisco
  5. People in the News

    1. (4 articles) Philipp Koehn
    2. (2 articles) Dion Wiggins
    3. (2 articles) Andrew Ng
    4. (1 articles) Alan Turing
    5. (1 articles) Yoav Goldberg
    6. (1 articles) Harry Potter
    7. (1 articles) Yann LeCun
    8. (1 articles) Geoff Hinton
    9. (1 articles) Jaron Lanier
    10. (1 articles) Kenji Takeda
    11. (1 articles) Brad Smith
    12. (1 articles) Yoshua Bengio