arxiv:1801.06146

Universal Language Model Fine-tuning for Text Classification

Published on Jan 18, 2018

Authors:

Jeremy Howard ,

Abstract

ULMFiT is a transfer learning method for NLP that outperforms existing approaches and reduces errors on text classification tasks significantly with minimal data.

AI-generated summary

Inductive transfer learning has greatly impacted computer vision, but existing approaches in NLP still require task-specific modifications and training from scratch. We propose Universal Language Model Fine-tuning (ULMFiT), an effective transfer learning method that can be applied to any task in NLP, and introduce techniques that are key for fine-tuning a language model. Our method significantly outperforms the state-of-the-art on six text classification tasks, reducing the error by 18-24% on the majority of datasets. Furthermore, with only 100 labeled examples, it matches the performance of training from scratch on 100x more data. We open-source our pretrained models and code.

View arXiv page View PDF Add to collection

Community

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Universal Language Model Fine-tuning for Text Classification

Abstract

Community

Models citing this paper 3

Datasets citing this paper 1

Spaces citing this paper 8

Collections including this paper 9