Also published in SpringerLink Home:
Personalised Multimedia Summaries
x hide permanently

A Novel System for Unlabeled Discourse Parsing in the RST Framework

SpringerLink Home
This paper presents UDRST, an unlabeled discourse parsing system in the RST framework. UDRST consists of a segmentation model and a parsing model. The segmentation model exploits subtree features to rerank N-best outputs of a base segmenter, which uses syntactic and lexical features in a CRF framework. In the parsing model, we present two algorithms for building a discourse tree from a segmented text: an incremental algorithm and a dual decomposition algorithm. Our system achieves 77.3% in the unlabeled score on the standard test set of the RST Discourse Treebank corpus, which improves 5.0% compared to HILDA [6 ...
Mentions: Japan JapanAkira Shimazu School of Information Science