Discovering and Controlling for Latent Confounds in Text Classification Using Adversarial Domain Adaptation
Author(s) -
Virgile Landeiro,
Minh Tuan Tran,
Aron Culotta
Publication year - 2019
Publication title -
society for industrial and applied mathematics ebooks
Language(s) - English
Resource type - Book series
DOI - 10.1137/1.9781611975673.34
Subject(s) - domain adaptation , adversarial system , computer science , artificial intelligence , latent variable , labeled data , training set , machine learning , adaptation (eye) , domain (mathematical analysis) , deep neural networks , invariant (physics) , class (philosophy) , artificial neural network , natural language processing , psychology , classifier (uml) , mathematics , mathematical analysis , neuroscience , mathematical physics
In text classification, the testing data often systematically differ from the training data, a problem called dataset shift. In this paper, we investigate a type of dataset shift we call confounding shift. Such a setting exists when two conditions are met: (a) there is a confound variable Z that influences both text features X and class label Y ; (b) the relationship between Z and Y changes from training to testing. While recent work in this area has required confounds to be known ahead of time, this is unrealistic for many settings. To address this shortcoming, we propose a method both to discover and to control for potential confounds. The approach first uses neural network-based topic modeling to discover potential confounds that differ between training and testing data, then uses adversarial training to fit a classification model that is invariant to these discovered confounds. We find the resulting method to improve over state-of-the-art domain adaptation method, while also producing results that are competitive with those obtained when confounds are known ahead of time.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom