AUTHOR=Gupta Sukrat , Kanchinadam Teja , Conathan Devin , Fung Glenn TITLE=Task-Optimized Word Embeddings for Text Classification Representations JOURNAL=Frontiers in Applied Mathematics and Statistics VOLUME=5 YEAR=2020 URL=https://www.frontiersin.org/journals/applied-mathematics-and-statistics/articles/10.3389/fams.2019.00067 DOI=10.3389/fams.2019.00067 ISSN=2297-4687 ABSTRACT=

Word embeddings have introduced a compact and efficient way of representing text for further downstream natural language processing (NLP) tasks. Most word embedding algorithms are optimized at the word level. However, many NLP applications require text representations of groups of words, like sentences or paragraphs. In this paper, we propose a supervised algorithm that produces a task-optimized weighted average of word embeddings for a given task. Our proposed text embedding algorithm combines the compactness and expressiveness of the word-embedding representations with the word-level insights of a BoW-type model, where weights correspond to actual words. Numerical experiments across different domains show the competence of our algorithm.