Also categorized in Summarization:
Automatic analysis of Medical Dialogue in the Home Hemodialysis Domain: Structure Induction and Summarization
x hide permanently

Systems and methods for determining the topic structure of a portion of text

PatFT ยป Page 1 of 1
Systems and methods for determining the topic structure of a document including text utilize a Probabilistic Latent Semantic Analysis (PLSA) model and select segmentation points based on similarity values between pairs of adjacent text blocks. PLSA forms a framework for both text segmentation and topic identification. The use of PLSA provides an improved representation for the sparse information in a text block, such as a sentence or a sequence of sentences. Topic characterization of each text segment is derived from PLSA parameters that relate words to "topics", latent variables in the PLSA model, and "topics" to text segments. A system ...
Mentions: Royal Statistical Society Description of Related ArtIn Sigir