Also categorized in Entailment:
Identifying Patient Smoking Status from Medical Discharge Records.
x hide permanently

Addressing Limited Data for Textual Entailment Across Domains. (arXiv:1606.02638v1 [cs.CL])

We seek to address the lack of labeled data (and high cost of annotation) for textual entailment in some domains. To that end, we first create (for experimental purposes) an entailment dataset for the clinical domain, and a highly competitive supervised entailment system, ENT, that is effective (out of the box) on two domains. We then explore self-training and active learning strategies to address the lack of labeled data.