Skip to content

Paper #1: Workshop Submission (LLM-Finance 2025) - Due Oct 26 #88

@iAmGiG

Description

@iAmGiG

Context

Conference: LLM-Finance 2025 - The 2nd IEEE International Workshop on Large Language Models for Finance
Part of: IEEE BigData 2025 (December 8-11, 2025, Macau, China)
Submission Deadline: October 26, 2025 (12 days from today)
Paper Type: Workshop paper (not full journal paper)

Advisor guidance: "Currently the experimental study is good for the workshop, more sophisticated experimental study with some future work can do a journal publication"

Today's date: October 14, 2025


Paper Title Exploration

Advisor's Suggested Title

"something like 'Learning the Invisible Hand: Evaluating Large Language Models' Ability to Infer Market Forces under Data Obfuscation'"

Alternative Title Options

Option 1: Advisor's Suggestion (Economic Metaphor)

  • "Learning the Invisible Hand: Evaluating Large Language Models' Ability to Infer Market Forces under Data Obfuscation"
  • Pros: Catchy, references Adam Smith, emphasizes "market forces"
  • Cons: "Invisible hand" is metaphorical (less precise)

Option 2: GEX/Gamma Exposure Focus (Technical)

  • "Detecting Latent Market Patterns: LLM-Based Gamma Exposure Analysis with Obfuscation Testing"
  • Pros: Uses "GEX" (recognizable to finance audience), "latent patterns" (catchy)
  • Cons: More jargon-heavy

Option 3: Constraint Detection (Methodology Focus)

  • "Obfuscation Testing for LLM Structural Reasoning: Detecting Dealer Hedging Constraints in Options Markets"
  • Pros: Clear methodology contribution, specific domain
  • Cons: Less catchy, longer

Option 4: Hybrid (Market Forces + GEX)

  • "Inferring Latent Market Forces: LLM Detection of Gamma Exposure Patterns under Data Obfuscation"
  • Pros: Combines "latent patterns" + "GEX" + "market forces"
  • Cons: Slightly generic

Option 5: WHO→WHOM→WHAT Framework

  • "From GEX to Action: LLM-Based Detection of Dealer Hedging Constraints via Obfuscation Testing"
  • Pros: Emphasizes causal mechanism (WHO forces WHOM)
  • Cons: Less emphasis on "market forces" metaphor

Option 6: Pattern Discovery Angle

  • "Discovering Mechanical Market Patterns: Obfuscation Testing for LLM Structural Understanding in Finance"
  • Pros: Emphasizes discovery, "mechanical patterns" (our classification)
  • Cons: Less specific to GEX/dealer hedging

Recommended Title (Balances All Elements)

Final Recommendation:

"Inferring Latent Market Forces: Evaluating LLM Detection of Gamma Exposure Patterns via Obfuscation Testing"

Rationale:

  • ✅ "Latent market forces" (broad appeal, user preference)
  • ✅ "Gamma exposure" (GEX terminology, technical credibility)
  • ✅ "Obfuscation testing" (methodology contribution)
  • ✅ "Evaluating LLM" (clear scope)
  • ✅ Length: 14 words (reasonable for finance/AI venues)

Workshop Requirements (IEEE Format)

Estimated Requirements (based on standard IEEE BigData workshop guidelines):

  • Format: IEEE 2-column format
  • Page Limit: 6-8 pages (typical for workshops)
  • Template: IEEE Conference template (available on Overleaf)
  • Submission: PDF via conference submission system
  • Review: Single-blind (authors visible to reviewers)

Topics of Interest (LLM-Finance workshop):

  • Large language models for financial applications
  • Financial market prediction and analysis
  • Risk management and compliance
  • Novel LLM architectures for finance

Paper Structure (Workshop Version)

Abstract (150-200 words)

  • Problem: Can LLMs detect structural patterns vs. memorize training data?
  • Method: Obfuscation testing framework (remove dates/tickers)
  • Domain: Dealer hedging constraints in options markets (gamma exposure)
  • Results: [Detection rates, accuracy, cross-pattern validation]
  • Contribution: Novel methodology for validating LLM structural reasoning

1. Introduction (1 page)

  • Market context: Options explosion (2020-2024), 0DTE growth
  • Problem: LLMs in finance - reasoning vs. memorization?
  • Challenge: How to test true understanding?
  • Our approach: Obfuscation testing + gamma exposure patterns
  • Contributions: (1) Validation methodology, (2) Empirical evidence

2. Background & Related Work (0.75 pages)

  • Dealer hedging constraints (academic foundation)
  • Gamma exposure (GEX) and market microstructure
  • LLMs in finance (sentiment analysis, forecasting)
  • Gap: No validation of structural reasoning

3. Methodology (1.5 pages)

  • Pattern taxonomy (MECHANICAL, NARRATIVE)
  • WHO→WHOM→WHAT framework
  • Obfuscation technique (dates→"Day T+N", tickers→generic)
  • GEX calculation (Black-Scholes gamma aggregation)
  • Outcome verification (forward returns, realized volatility)

4. Experimental Setup (1 page)

  • Dataset: SPY 2024 (Q1, Q3, Q4) - 181 trading days
  • Patterns tested: gamma_positioning, stock_pinning, 0dte_hedging
  • LLM: GPT-4o-mini (tool calling) + O3-mini (reasoning)
  • Validation criteria: ≥60% detection, ≥30 samples, obfuscation pass

5. Results (1.5 pages)

  • Detection rates (per pattern, per quarter)
  • Predictive accuracy (rule-based verification)
  • Key finding: Detection ≠ Profitability (structural vs. economic)
  • Obfuscation test results (passes all patterns)

6. Discussion (0.75 pages)

  • Why detection stays constant while profits vary
  • Implications for LLM structural reasoning
  • Limitations: Single asset (SPY), one year, three patterns

7. Future Work & Conclusion (0.5 pages)

References (0.5 pages)

  • Academic papers: Avellaneda (2003), Frey (1997), Gao (2024), etc.
  • SqueezeMetrics white papers (industry validation)
  • LLM reasoning papers (chain-of-thought, structural understanding)

Total: 6-8 pages (fits workshop format)


Task Checklist for User

Setup Tasks (Days 1-2: Oct 14-15)

  • Set up Overleaf project

    • Create new project: "LLM-Finance-2025-Paper"
    • Select template: "IEEE Conference Template" (search in Overleaf)
    • Invite advisor as collaborator (share link)
    • Create initial structure (sections 1-7 as above)
  • Gather validation results from HPCC

  • Finalize title

    • Review title options with advisor
    • Get advisor approval on final title
    • Update Overleaf document header

Writing Tasks (Days 3-8: Oct 16-21)

  • Draft Abstract (Day 3)

    • Problem statement (1-2 sentences)
    • Methodology overview (2-3 sentences)
    • Key results (2-3 sentences, use actual numbers once available)
    • Contribution statement (1 sentence)
  • Write Introduction (Days 3-4)

    • Market context (options explosion, dealer hedging)
    • Problem motivation (LLM reasoning vs. memorization)
    • Research question clearly stated
    • Contributions list (3-4 bullet points)
  • Write Methodology (Days 5-6)

    • Obfuscation technique (with examples)
    • Pattern taxonomy diagram (MECHANICAL vs. NARRATIVE)
    • WHO→WHOM→WHAT framework explanation
    • Validation criteria (60% threshold, 30 samples, obfuscation pass)
    • Figure: System architecture flowchart (use docs/SYSTEM_FLOW_SIMPLE.md)
  • Write Results (Day 7)

    • Table 1: Detection rates by pattern/quarter
    • Table 2: Predictive accuracy comparison
    • Table 3: Alpha vs. Detection (showing independence)
    • Discussion of key finding (detection constant, profits vary)
  • Write Background, Discussion, Future Work (Day 8)

Refinement Tasks (Days 9-11: Oct 22-24)

  • Get advisor feedback (Day 9)

    • Share Overleaf draft with advisor
    • Schedule meeting to review draft
    • Incorporate feedback on content/framing
  • Create figures and tables (Day 10)

    • Figure 1: Obfuscation example (before/after)
    • Figure 2: System architecture (from docs/SYSTEM_FLOW_SIMPLE.md)
    • Table 1: Detection rates (actual numbers from Issue Verify Validation Results on HPCC Database #86)
    • Table 2: Predictive accuracy
    • Table 3: Detection vs. Profitability (key evidence)
  • Proofread and format (Day 11)

    • Check IEEE format compliance (2-column, font size, margins)
    • Verify all citations formatted correctly
    • Check page count (target: 6-8 pages)
    • Run spell check
    • Check figure quality (300 DPI minimum for IEEE)

Submission Tasks (Day 12: Oct 25-26)

  • Final review (Oct 25 morning)

    • Read paper start to finish
    • Verify all tables/figures referenced in text
    • Confirm all claims have evidence
    • Check references complete
  • Generate PDF (Oct 25 afternoon)

    • Download from Overleaf
    • Verify PDF looks correct (no formatting issues)
    • Check file size (< 10MB typical limit)
  • Submit to workshop (Oct 26 - DEADLINE)

    • Find submission link (WikiCFP or conference website)
    • Create account if needed
    • Upload PDF
    • Fill out metadata (title, authors, abstract, keywords)
    • Submit before 11:59 PM (check timezone!)
    • Save confirmation email

Writing Guidelines (Workshop Paper)

Tone & Style

What to emphasize:

  • Novel methodology (obfuscation testing)
  • Empirical evidence (detection rates, accuracy)
  • Structural reasoning (constraints vs. correlations)

What to de-emphasize:

  • Trading profitability (mention but don't focus)
  • Implementation details (keep high-level)
  • Future extensions (brief mention only)

Writing style:

  • Clear and concise (workshop papers are shorter)
  • Focus on methodology contribution
  • Use examples to illustrate obfuscation
  • Connect to broader LLM reasoning research

Key Messages

  1. Problem: Hard to distinguish LLM reasoning from memorization
  2. Solution: Obfuscation testing removes temporal context
  3. Domain: Financial markets (dealer hedging constraints)
  4. Evidence: 100% detection with obfuscation across 181 days
  5. Contribution: Validation framework applicable beyond finance

Figures to Include

Figure 1: Obfuscation Example

BEFORE (Normal):
Date: 2024-01-05
Symbol: SPY
Net GEX: -$5.2B
Spot: $552.10

AFTER (Obfuscated):
Date: Day T+0
Symbol: INDEX_1
Net GEX: -$5.2B
Spot: [normalized]

Figure 2: System Architecture (from docs/SYSTEM_FLOW_SIMPLE.md)

  • Data → Obfuscation → LLM → Detection → Verification

Figure 3: Detection vs. Profitability (scatter plot)

  • X-axis: Quarter (Q1, Q3, Q4)
  • Y-axis: Detection rate (bars) + Net alpha (line)
  • Shows detection constant, profits vary

Tables to Include

Table 1: Detection Rates by Pattern/Quarter

Pattern Q1 2024 Q3 2024 Q4 2024 Average
gamma_positioning [TBD] [TBD] [TBD] [TBD]
stock_pinning [TBD] [TBD] [TBD] [TBD]
0dte_hedging [TBD] [TBD] [TBD] [TBD]

Table 2: Predictive Accuracy

Pattern Accuracy Sample Size Obfuscation Pass
gamma_positioning [TBD] 181
stock_pinning [TBD] 181
0dte_hedging [TBD] 181

Table 3: Detection vs. Economic Outcome

Quarter Detection Rate Predictive Accuracy Net Alpha
Q1 2024 [TBD] [TBD] [TBD]
Q3 2024 [TBD] [TBD] [TBD]
Q4 2024 [TBD] [TBD] [TBD]

(Shows detection constant while profitability varies - key evidence)


Keywords (for Submission)

Suggested keywords (5-7):

  • Large language models
  • Market microstructure
  • Obfuscation testing
  • Gamma exposure
  • Structural reasoning
  • Financial markets
  • Pattern detection

Dependencies

Blocking:

Non-blocking (can proceed without):


Success Criteria

✅ Paper submitted by October 26, 2025 (deadline)
✅ Advisor approval on draft before submission
✅ 6-8 pages in IEEE format
✅ All tables/figures included with actual data
✅ Title finalized and approved
✅ References complete and properly formatted
✅ Abstract clearly states contribution and results


Timeline Summary

Date Day Task
Oct 14-15 1-2 Setup Overleaf, gather results, finalize title
Oct 16-17 3-4 Write abstract + introduction
Oct 18-19 5-6 Write methodology
Oct 20 7 Write results
Oct 21 8 Write background, discussion, future work
Oct 22 9 Advisor feedback + revisions
Oct 23 10 Create figures and tables
Oct 24 11 Proofread and format
Oct 25 12 Final review + generate PDF
Oct 26 DEADLINE Submit to workshop

12 days - tight but feasible if validation results (Issue #86) are available soon


Contact Information

Workshop: LLM-Finance 2025
Conference Website: https://conferences.cis.um.edu.mo/ieeebigdata2025/workshops.html
WikiCFP: http://www.wikicfp.com/cfp/servlet/event.showcfp?eventid=189780
Workshop Organizer: QUANZHI LI


Priority

CRITICAL - 12-day deadline for first paper submission


Estimated Effort: 12 days of writing + advisor feedback + validation results
Target Completion: October 26, 2025 (submission deadline)

Sub-issues

Metadata

Metadata

Assignees

Labels

No labels
No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions