Do AI Models Actually Understand Financial Sentiment?

machine learning

NLP

finance

LLMs

A simple comparison of how different AI models interpret financial sentiment and whether they truly understand what they’re reading.

Author

Manasvi Vardham

Published

March 17, 2026

Project Overview

Financial markets move fast, and a lot of investor sentiment now lives online on platforms like Reddit and Twitter.

But here’s the real question:

Do AI models actually understand financial language, or are they just picking up patterns?

In this project, we tested how different models interpret financial text, and more importantly, how they react when the meaning of a sentence changes.

We compared:

VADER (a simple rule-based model)
FinBERT (an AI model trained on financial text)
ChatGPT and Gemini (modern AI models)

What We Did

Instead of only checking accuracy, we went a step further.

We gave each model real financial posts from Reddit and Twitter
Asked them to classify sentiment as positive, negative, or neutral
Then we changed key words like “bullish” to “bearish”
And checked if the model’s prediction change when the meaning changed?

This helped us test whether the models were actually reasoning, not just guessing.

Key Findings

The simpler model, VADER, actually performed better than FinBERT on real-world social media data
FinBERT struggled and often predicted the same sentiment repeatedly
ChatGPT and Gemini performed the best overall, with more balanced and accurate predictions
But here’s the surprising part

None of the models were very good at understanding logical changes in meaning

Even when we flipped words like “buy” to “sell”, many models did not react properly.

What This Means

Most models can look accurate on paper but still fail basic reasoning.

This shows that:

Accuracy alone is not enough to judge AI performance
Models can miss important context in financial language
Testing how models handle changes in meaning is just as important

Tools Used

Python
HuggingFace (for FinBERT)
OpenAI API (ChatGPT)
Pandas and Scikit-learn

GitHub Repository

View full project and code:

Code Repository

What’s Next

Improve testing with more realistic sentence changes
Train models specifically on social media financial data
Explore combining multiple models for better performance