We introduce a new text-mining methodology that extracts sentiment information from news articles to predict asset returns. Unlike more common sentiment scores used for stock return prediction (e.g., those sold by commercial vendors or built with dictionary-based methods), our supervised learning framework constructs a sentiment score that is specifically adapted to the problem of return prediction. Our method proceeds in three steps: 1) isolating a list of sentiment terms via predictive screening, 2) assigning sentiment weights to these words via topic modeling, and 3) aggregating terms into an article-level sentiment score via penalized likelihood. We derive theoretical guarantees on the accuracy of estimates from our model with minimal assumptions. In our empirical analysis, we text-mine one of the most actively monitored streams of news articles in the financial system—the Dow Jones Newswires—and show that our supervised sentiment model excels at extracting return-predictive signals in this context.

More on this topic

BFI Working Paper·Mar 31, 2026

The Hidden Cost of Stock Market Concentration: When Funds Hit Regulatory Limits

Lubos Pastor, Taisiya Sikorskaya, and Jinrui Wang
Topics: Financial Markets
BFI Working Paper·Mar 27, 2026

Financial Sanctions and the Global Payments Network

Gregor Matvos and Brent Neiman
Topics: Financial Markets
BFI Working Paper·Jan 21, 2026

FinTech and Customer Capital

Bianca He, Lauren Mostrom, and Amir Sufi
Topics: Financial Markets, Technology & Innovation