Introduction

Social media traders use sarcasm extensively: "GME is definitely going to $1000 🚀" (often sarcastic). Simple sentiment models miss sarcasm, misclassifying sarcastic bearish comments as bullish. Sarcasm detection is critical for accurate retail sentiment signals.

Sarcasm Challenges in Finance

Finance sarcasm is context-dependent and subtle. Examples:

  • "Great earnings! Stock down 5% 📉" (sarcastic negative sentiment)
  • "Just loaded the dip, time to lose money 💰" (self-deprecating, likely bullish despite wording)
  • "This company is printing money 🤑 for shareholders" (could be sarcastic in bear market)
Simple keyword matching fails.

Sarcasm Detection Methods

1. Explicit markers: emoji (🚀, 📉, 🤑), ALL CAPS, repeated punctuation (!!!, ???). These are imperfect but useful heuristics.
2. Sentiment contradiction: statement contradicts typical sentiment (positive words + negative emoji).
3. Contextual: prior posts determine context. User consistently bearish, suddenly positive post is likely sarcasm.

Contextual Sarcasm Detection with LLMs

Fine-tune transformer models on sarcasm-annotated financial social media. Feed post + context (prior posts from same user) to model. Classifier learns patterns: context-specific sarcasm, emotional contradiction, insider jargon.

Accuracy on held-out test set: 78% correctly identifying sarcasm, 92% identifying literal posts. Main confusion: heavily contextualized insider jargon.

Building Sarcasm Training Data

Manually annotate Reddit and Twitter posts as sarcastic/literal. Sample diverse accounts: bulls, bears, day traders, value investors. Ensure training data reflects target distribution. Crowdsource annotation with multiple reviewers for hard cases.

Financial Sarcasm Lexicon

Build lexicon of finance-specific sarcasm phrases:

  • "To the moon 🚀" = often sarcastic (stock likely down)
  • "This company is printing money" = context-dependent
  • "Big brain play" = sarcasm (risky trade)
  • "Not financial advice" = about to give risky trade advice (sarcastic disclaimer)

Emoji Analysis

Emoji provides strong sarcasm cues. 🚀 + negative sentiment = sarcasm. 📉 + positive words = contradiction. Build emoji-sentiment mapping: which emoji combinations predict sarcasm? Analyze 100k posts, find combinations that reliably indicate sarcasm.

Empirical Results

Applied sarcasm-aware sentiment to 10,000 daily Reddit WSB posts. Compared to naive sentiment:

  • Naive sentiment correlation with next-day returns: 0.35
  • Sarcasm-aware sentiment correlation: 0.52
  • 48% improvement in signal quality

Implementation

Use transformer fine-tuned on financial sarcasm data. Combine with explicit marker detection (emoji, caps, punctuation). For each post, output: sentiment + sarcasm probability. Adjust final sentiment based on sarcasm confidence.