Removal of extraneous text from electronic documents
Method for identifying verifiable statements in text

A method, system and computer-usable medium are disclosed for identifying verifiable statements in a corpus of text. A training corpus of text containing manually annotated instances of verifiable and non-verifiable statements is processed to parse the text into segmented statements, which are in turn processed to extract features. The extracted features and the annotated statements are then processed with a machine learning algorithm to generate a verifiable statement classification model. In turn, the verifiable statement classification model is referenced by a verifiable statement classification system to distinguish verifiable and non-verifiable statements contained within an input corpus of text.