1. Articles in category: Segmentation

    553-576 of 879 « 1 2 ... 21 22 23 24 25 26 27 ... 35 36 37 »
    1. System and method for call center dialog management

      A system and method for call center dialog management is disclosed. The method discloses: presenting a contact with a first call center dialog segment having a current call center dialog property; receiving from the contact a contact dialog segment; identifying a dialog property keyword within the contact dialog segment; replacing the current call center dialog property with a new call center dialog property in response to the dialog property keyword; and presenting a second call center dialog segment having the new call center dialog property to the contact. The system of the present invention, discloses means for implementing the method.
      Read Full Article
    2. Improving Text Segmentation with Non-systematic Semantic Relation

      Text segmentation is a fundamental problem in natural language processing, which has application in information retrieval, question answering, and text summarization. Almost previous works on unsupervised text segmentation are based on the assumption of lexical cohesion, which is indicated by relations between words in the two units of text. However, they only take into account the reiteration, which is a category of lexical cohesion, such as word repetition, synonym or superordinate. In this research, we investigate the non-systematic semantic relation, which is classified as collocation in lexical cohesion. This relation holds between two words or phrases in a discourse when ...
      Read Full Article
    3. Text retrieval from early printed books

      Abstract  Retrieving text from early printed books is particularly difficult because in these documents, the words are very close one to the other and, similarly to medieval manuscripts, there is a large use of ligatures and abbreviations. To address these problems, we propose a word indexing and retrieval technique that does not require word segmentation and is tolerant to errors in character segmentation. Two main principles characterize the approach. First, characters are identified in the pages and clustered with self-organizing map (SOM). During the retrieval, the similarity of characters is estimated considering the proximity of cluster centroids in the SOM ...
      Read Full Article
    4. Disfluency detection for a speech-to-speech translation system using phrase-level machine translation with weighted finite state transducers

      A computer-implemented method for creating a disfluency translation lattice includes providing a plurality of weighted finite state transducers including a translation model, a language model, and a phrase segmentation model as input, performing a cascaded composition of the weighted finite state transducers to create a disfluency translation lattice, and storing the disfluency translation lattice to a computer-readable media.
      Read Full Article
    5. Skeleton Simplification by Key Points Identification

      The current skeletonisation algorithms, based on thinning, extract the morphological features of an object in an image but the skeletonized objects are coarsely presented. This paper proposes an algorithm which goes beyond that approach by changing the coarse line segments into perfect “straight” line segments, obtaining points, angles, line segment size and proportions. Our technique is applied in the post-processing phase of the skeleton, which improves it no matter which skeletonisation technique is used, as long as the structure is made with one-pixel width continuous line segments. This proposal is a first step towards human activity recognition through the analysis ...
      Read Full Article
      Mentions: Spain Informatica
    6. A Local Generative Model for Chinese Word Segmentation

      This paper presents a local generative model for Chinese word segmentation, which has faster learning process than discriminative models and can do unsupervised learning. It has the ability to make use of larger resources. In this model, four successive characters are used to determine whether a character interval should be a word boundary or not. The Gibbs sampling algorithm, as well as three additional rules, is applied for the unsupervised learning. Besides words, the word candidates that are generated by our model can improve the performance of Chinese information retrieval. The experiments show that in supervised learning our method outperforms ...
      Read Full Article
      Mentions: Beijing China Boeing
    7. Sanskrit Compound Processor

      Sanskrit is very rich in compound formation. Typically a compound does not code the relation between its components explicitly. To understand the meaning of a compound, it is necessary to identify its components, discover the relations between them and finally generate a paraphrase of the compound. In this paper, we discuss the automatic segmentation and type identification of a compound using simple statistics that results from the manually annotated data. Content Type Book ChapterDOI 10.1007/978-3-642-17528-2_5Authors Anil Kumar, Department of Sanskrit Studies, University of Hyderabad, IndiaVipul Mittal, Language Technologies Research Centre, IIIT, Hyderabad, IndiaAmba Kulkarni, Department of Sanskrit Studies ...
      Read Full Article
    8. A word spotting framework for historical machine-printed documents

      Abstract  In this paper, we propose a word spotting framework for accessing the content of historical machine-printed documents without the use of an optical character recognition engine. A preprocessing step is performed in order to improve the quality of the document images, while word segmentation is accomplished with the use of two complementary segmentation methodologies. In the proposed methodology, synthetic word images are created from keywords, and these images are compared to all the words in the digitized documents. A user feedback process is used in order to refine the search procedure. The methodology has been evaluated in early Modern ...
      Read Full Article
    9. Transmembrane helix prediction using amino acid property features and latent semantic analysis

      Abstract Background  Prediction of transmembrane (TM) helices by statistical methods suffers from lack of sufficient training data. Current best methods use hundreds or even thousands of free parameters in their models which are tuned to fit the little data available for training. Further, they are often restricted to the generally accepted topology "cytoplasmic-transmembrane-extracellular" and cannot adapt to membrane proteins that do not conform to this topology. Recent crystal structures of channel proteins have revealed novel architectures showing that the above topology may not be as universal as previously believed. Thus, there is a need for methods that can better predict ...
      Read Full Article
    10. Text Segmentation by Clustering Cohesion

      An automatic linear text segmentation in order to detect the best topic boundaries is a difficult and very useful task in many text processing systems. Some methods have tried to solve this problem with reasonable results, but they present some drawbacks as well. In this work, we propose a new method, called ClustSeg, based on a predefined window and a clustering algorithm to decide the topic cohesion. We compare our proposal against the best known methods, with a better performance against these algorithms. Content Type Book ChapterDOI 10.1007/978-3-642-16687-7_37Authors Raúl Abella Pérez, Advanced Technologies Application Centre (CENATAV), 7a #21812 ...
      Read Full Article
      Mentions: Computer Vision
    553-576 of 879 « 1 2 ... 21 22 23 24 25 26 27 ... 35 36 37 »
  1. Categories

    1. Default:

      Discourse, Entailment, Machine Translation, NER, Parsing, Segmentation, Semantic, Sentiment, Summarization, WSD
  2. Popular Articles

  3. Organizations in the News

    1. (23 articles) Microsoft
    2. (22 articles) Cagr
    3. (22 articles) Apac
    4. (17 articles) Google
    5. (11 articles) NLP
    6. (11 articles) Healthcare
    7. (9 articles) IBM Corporation
    8. (9 articles) Bfsi
    9. (8 articles) Intel
    10. (7 articles) Dolbey Systems , Inc.
    11. (6 articles) AMR
    12. (6 articles) GE Healthcare
  4. Locations in the News

    1. (21 articles) India
    2. (20 articles) China
    3. (15 articles) Japan
    4. (15 articles) Australia
    5. (12 articles) Dublin
    6. (10 articles) Canada
    7. (10 articles) Germany
    8. (10 articles) France
    9. (9 articles) Pune
    10. (8 articles) Mexico
    11. (7 articles) Brazil
    12. (7 articles) Spain
  5. People in the News

    1. (6 articles) Laura Wood