Power of Deep Learning: Quantifying Language to Explain Cross-Sectional Returns

Sean Cao,Yongtae Kim,Angie Wang,Houping Xiao

Power of Deep Learning: Quantifying Language to Explain Cross-Sectional Returns

2020

Sean Cao
Yongtae Kim
Angie Wang
Houping Xiao

When quantifying qualitative information from unstructured textual data, traditional bag-of-words approaches capture only semantic features of single words/phrases. The context, the sequence of words, and the relations among words (i.e., higher-order interaction features) are ignored. We introduce deep neural networks (NNs) to encode and mimic human intelligence in processing natural language. Using the NN-based artificial intelligence, we construct a new sentiment measure that is specific to performance discussions and is adjusted for complex contextual negations. We find that this performance-specific sentiment explains cross-sectional returns and future operating performance better than umbrella sentiment proxies used in the literature.

Keywords:

Natural language
Human intelligence
deep neural networks
Sentiment analysis
ENCODE
Negation
Natural language processing
Artificial intelligence
Artificial neural network
Deep learning
Computer science

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations