1. Articles in category: Segmentation

    481-504 of 923 « 1 2 ... 18 19 20 21 22 23 24 ... 37 38 39 »
    1. Modification of annotated bilingual segment pairs in syntax-based machine translation

      Systems and methods for automatically modifying an annotated bilingual segment pair are provided. An annotated bilingual segment pair ("Pair") may be modified to generate improved translation rules used in machine translation of documents from a source language to a target language. Because a single Pair may be used to translate a phrase, many Pairs are used in a machine translation system and manual correction of each model is impractical. Each Pair may be modified by re-labeling syntactic categories within the Pair, re-structuring a tree within the Pair, and/or re-aligning source words to target words within the Pair. In exemplary ...

      Read Full Article
    2. Character-based automated text summarization

      Methods, devices, systems and tools are presented that allow the summarization of text, audio, and audiovisual presentations, such as movies, into less lengthy forms. High-content media files are shortened in a manner that preserves important details, by splitting the files into segments, rating the segments, and reassembling preferred segments into a final abridged piece. Summarization of media can be customized by user selection of criteria, and opens new possibilities for delivering entertainment, news, and information in the form of dense, information-rich content that can be viewed by means of broadcast or cable distribution, "on-demand" distribution, internet and cell phone digital ...

      Read Full Article
    3. Character-based automated shot summarization

      Methods, devices, systems and tools are presented that allow the summarization of text, audio, and audiovisual presentations, such as movies, into less lengthy forms. High-content media files are shortened in a manner that preserves important details, by splitting the files into segments, rating the segments, and reassembling preferred segments into a final abridged piece. Summarization of media can be customized by user selection of criteria, and opens new possibilities for delivering entertainment, news, and information in the form of dense, information-rich content that can be viewed by means of broadcast or cable distribution, "on-demand" distribution, internet and cell phone digital ...

      Read Full Article
    4. Semantic segmentation and tagging engine

      In accordance with the embodiments of the present invention, a method and engine for assigning semantic tags to segments within media. The invention receives media and extracts textual information related to the media's content. It processes the textual information and creates a list of topics related to the content. The invention segments the media and intelligently assigns topical tags to the segments. The semantically segmented media data is outputted for storage or analysis.

      Read Full Article
    5. System and method for automatic call segmentation at call center

      A system and method for automatic call segmentation including steps and means for automatically detecting boundaries between utterances in the call transcripts; automatically classifying utterances into target call sections; automatically partitioning the call transcript into call segments; and outputting a segmented call transcript. A training method and apparatus for training the system to perform automatic call segmentation includes steps and means for providing at least one training transcript with annotated call sections; normalizing the at least one training transcript; and performing statistical analysis on the at least one training transcript.

      Read Full Article
    6. Method and apparatus for organizing segments of media assets and determining relevance of segments to a query

      The invention pertains to methods, systems, and apparatus for identifying media items relevant to a selected subject matter, the method comprising determining the subject matter of a first media item, the first media item comprising at least one of audio content and video content, determining the classification within an ontology of the subject matter of the first media item, analyzing the ontology to identify other subject matter related to the subject matter of the first media item, and performing a search for other media items relevant to the subject matter of the first media item as a function of at ...

      Read Full Article
    7. Caption and/or metadata synchronization for replay of previously or simultaneously recorded live programs

      A synchronization process between captioning data and/or corresponding metatags and the associated media file parses the media file, correlates the caption information and/or metatags with segments of the media file, and provides a capability for textual search and selection of particular segments. A time-synchronized version of the captions is created that is synchronized to the moment that the speech is uttered in the recorded media. The caption data is leveraged to enable search engines to index not merely the title of a video, but the entirety of what was said during the video as well as any associated ...

      Read Full Article
    8. Automatic classification of consumers into micro-segments

      A campaign is received, at a micro-segmentation system, from an offer provider. The micro-segmentation system is a third-party system. The campaign indicates a set of target attributes and one or more offers corresponding to the set of target attributes. A set of user attributes pertaining to each of the plurality of users is received at the micro-segmentation system. The set of attributes is defined by an attribute knowledge structure. Permission is received at the micro-segmentation system from each of the plurality of users to receive an offer from the micro-segmentation system. Data associated with the micro-segment classification is provided from ...

      Read Full Article
    9. Preprocessing of text

      Performance of statistical machine learning techniques, particularly classification techniques applied to the extraction of attributes and values concerning products, is improved by preprocessing a body of text to be analyzed to remove extraneous information. The body of text is split into a plurality of segments. In an embodiment, sentence identification criteria are applied to identify sentences as the plurality of segments. Thereafter, the plurality of segments are clustered to provide a plurality of clusters. One or more of the resulting clusters are then analyzed to identify segments having low relevance to their respective clusters. Such low relevance segments are then ...

      Read Full Article
    10. Multi-pass speech recognition

      According to example configurations, a speech recognition system is configured to receive an utterance. Based on analyzing at least a portion of the utterance using a first speech recognition model on a first pass, the speech recognition system detects that the utterance includes a first group of one or more spoken words. The speech recognition system utilizes the first group of one or more spoken words identified in the utterance as detected on the first pass to locate a given segment of interest in the utterance. The given segment can include one or more that are unrecognizable by the first ...

      Read Full Article
    11. Extracting rich temporal context for business entities and events

      Methods and apparatus for performing computer-implemented extraction of temporal information for business entities and events are disclosed. In one embodiment, a sequence of text is obtained. A label is assigned to one or more of a plurality of segments of the text such that each of the one or more of the plurality of segments of the text is classified as temporal data in one of a plurality of classes of temporal data. One or more rules are applied to the one or more segments of the text that have been classified as temporal data to generate a structured representation ...

      Read Full Article
    12. System, method and computer program product for identifying products associated with polarized sentiments

      An overall average review rating for a product may be determined, based on user ratings that are associated with opinions of a product, within a dimension corresponding to a user trait. A segment variation score for each of a plurality of segments of the dimension may be determined. Each segment may correspond to one or more values of the user trait corresponding to the dimension. A total variation score may be determined for the dimension based on the segment variation scores determined for each of the plurality of segments of the dimension. The total variation score for the dimension may ...

      Read Full Article
    13. NLP-based entity recognition and disambiguation

      Methods and systems for entity recognition and disambiguation using natural language processing techniques are provided. Example embodiments provide an entity recognition and disambiguation system (ERDS) and process that, based upon input of a text segment, automatically determines which entities are being referred to by the text using both natural language processing techniques and analysis of information gleaned from contextual data in the surrounding text. In at least some embodiments, supplemental or related information that can be used to assist in the recognition and/or disambiguation process can be retrieved from knowledge repositories such as an ontology knowledge base. In one ...

      Read Full Article
    14. Letter model and character bigram based language model for handwriting recognition

      A handwriting recognition system is described that includes a language model with scoring to improve recognition accuracy, such as for words outside of a selected language model. The handwriting recognition system increases the accuracy of handwriting recognizers that perform segmentation of ink into atomic elements (segments) and then classify each ink segment separately. After segmentation, a shape classifier estimates the class (letter) probabilities for each segment of ink by producing a corresponding score. The system applies the language model scoring to the shape classification results and typically selects the class with the highest score as the recognition result. Because the ...

      Read Full Article
    15. Generating photogenic routes from starting to destination locations

      A method of computing at least one photogenic route from a starting location to a destination location, including; computing photogenic values for images in a large collection representing a geographic region that includes the starting location and the destination location; computing a photogenic index for each route segment based on computed photogenic values of images taken along the route segment; computing at least one photogenic route from the starting location to the destination location and presenting the route(s) to a user.

      Read Full Article
    16. Joint segmentation and named entity recognition using dual decomposition in Chinese discharge summaries.

      Joint segmentation and named entity recognition using dual decomposition in Chinese discharge summaries.

      J Am Med Inform Assoc. 2013 Aug 9;

      Authors: Xu Y, Wang Y, Liu T, Liu J, Fan Y, Qian Y, Tsujii J, Chang EI

      Abstract OBJECTIVE: In this paper, we focus on three aspects: (1) to annotate a set of standard corpus in Chinese discharge summaries; (2) to perform word segmentation and named entity recognition in the above corpus; (3) to build a joint model that performs word segmentation and named entity recognition. DESIGN: Two independent systems of word segmentation and named entity recognition were built ...

      Read Full Article
      Mentions: Tsujii J Liu J Wang Y
    17. Method and system of selecting word sequence for text written in language without word boundary markers

      The present disclosure discloses a method and apparatus of selecting a word sequence for a text written in a language without word boundary in order to solve the problem of having excessively large computation load when selecting an optimal word sequence in existing technologies. The disclosed method includes: segmenting a segment of the text to obtain different word sequences; determining a common word boundary for the word sequences; and performing optimal word sequence selection for portions of the word sequences prior to the common word boundary. Because optimal word sequence selection is performed for portions of word sequences prior to ...

      Read Full Article
    18. Semi-supervised training for statistical word alignment

      A system and method for aligning words in parallel segments is provided. A first probability distribution of word alignments within a first corpus comprising unaligned word-level parallel segments according to a model estimate is calculated. The model estimate is modified according to the first probability distribution. One or more sub-models associated with the modified model estimate are discriminatively re-ranked according to word-level annotated parallel segments. A second probability distribution of the word alignments within the first corpus is calculated according to the re-ranked sub-models associated with the modified model estimate.
      Read Full Article
    19. Automatic segmentation of video

      Content items may be segmented and labeled by topic to provide for the capture, analysis, indexing, retrieval and/or distribution of information within information rich media, such as audio or video, with greater functionality, accuracy and speed. The segments and other related information may be stored in a database and made accessible to users through, for example, a search service and/or an on-demand service. Automatic segmentation may include receiving a text representation, calculating relevance intervals based on the text representation, determining a nodal representation based on the relevance intervals, and determining segments of the content item based on the ...
      Read Full Article
    481-504 of 923 « 1 2 ... 18 19 20 21 22 23 24 ... 37 38 39 »
  1. Categories

    1. Default:

      Discourse, Entailment, Machine Translation, NER, Parsing, Segmentation, Semantic, Sentiment, Summarization, WSD
  2. Popular Articles

  3. Organizations in the News

    1. (23 articles) NLP
    2. (21 articles) Microsoft
    3. (17 articles) Cagr
    4. (14 articles) USD
    5. (14 articles) SMEs
    6. (14 articles) Apac
    7. (14 articles) IBM
    8. (13 articles) Service
    9. (12 articles) Market Data Tables
    10. (12 articles) Region
    11. (12 articles) Intel
    12. (12 articles) Google
  4. Locations in the News

    1. (30 articles) India
    2. (21 articles) Germany
    3. (19 articles) Pune
    4. (18 articles) Japan
    5. (13 articles) China
    6. (13 articles) France
    7. (10 articles) Mexico
    8. (9 articles) Canada
    9. (8 articles) Spain
    10. (7 articles) Netherlands
    11. (7 articles) Africa
    12. (7 articles) Brazil