Adversarial training methods for semi-supervised text classification
Captured source
source ↗Adversarial training methods for semi-supervised text classification | OpenAI
May 25, 2016
Adversarial training methods for semi-supervised text classification
Loading…
Share
Abstract
Adversarial training provides a means of regularizing supervised learning algorithms while virtual adversarial training is able to extend supervised learning algorithms to the semi-supervised setting. However, both methods require making small perturbations to numerous entries of the input vector, which is inappropriate for sparse high-dimensional inputs such as one-hot word representations. We extend adversarial and virtual adversarial training to the text domain by applying perturbations to the word embeddings in a recurrent neural network rather than to the original input itself. The proposed method achieves state of the art results on multiple benchmark semi-supervised and purely supervised tasks. We provide visualizations and analysis showing that the learned word embeddings have improved in quality and that while training, the model is less prone to overfitting. Code is available at this https URL.
Authors
Takeru Miyato, Andrew M. Dai, Ian Goodfellow
Related articles
Disrupting malicious uses of AI by state-affiliated threat actorsSecurityFeb 14, 2024
Building an early warning system for LLM-aided biological threat creationPublicationJan 31, 2024
Democratic inputs to AI grant program: lessons learned and implementation plansSafetyJan 16, 2024