โ† Back to Main Site

Ideas & Discussions

Strategic Thinking โ€ข Product Strategy โ€ข Technical Architecture
A collection of in-depth strategic conversations, product analyses, and technical explorations. These documents represent deep thinking about building financial technology products, market opportunities, and technical approaches to complex problems.

Strategic Thinking

๐Ÿ—œ๏ธ
January 2025
Knowledge Compression: The Foundation
The secret sauce that makes everything else possible: multi-stage compression of 500GB SEC filings into 3GB of structured, queryable events. Using vLLM + Qwen 9B for semantic extraction, then hybrid IDFร—log(frequency) for vocabulary compression. 166x compression while preserving 100% of predictive signal. This is the moat.

Key Innovations:

  • Multi-stage compression: 500GB โ†’ 5GB โ†’ 3GB (166x total)
  • vLLM + Qwen 9B: 11.9M semantic events extracted
  • Hybrid IDFร—log(freq): Novel vocabulary compression method
  • 37,927 types โ†’ 388-3,558 types (preserves signal)
  • Extract once, use forever (vs RAG: retrieve on demand)
  • Enables Event Oracle, Transformer, Q-learning from same foundation
  • Patent & paper potential: novel compression methodology
  • Defensible moat: knowledge compression vs text search
๐Ÿ—œ๏ธ Core Innovation ๐Ÿ›ก๏ธ The Moat โšก 166x Compression ๐Ÿ“– 22 min read
๐Ÿ’ผ
November 2, 2025
Commercial Product Strategy - Selling into Equity Markets
Comprehensive analysis of 11 product opportunities for monetizing SEC event extraction, transformer predictions, and Q-learning trading systems. From basic data feeds to premium alpha signals, with detailed pricing, GTM strategies, and revenue projections.

Key Topics Covered:

  • 11 product opportunities from data to platform
  • Tier 1-4 product portfolio ($5M to $100M ARR path)
  • Competitive positioning vs Bloomberg, FactSet, S&P
  • 42.8% transformer correlation advantage
  • 5-year revenue projections to $100M ARR
  • Go-to-market strategy by customer tier
  • Risk mitigation and exit strategies
๐Ÿ“Š Product Strategy ๐Ÿ’ฐ Revenue Modeling ๐ŸŽฏ GTM Strategy ๐Ÿ“– 21 min read
๐Ÿ“Š
November 3, 2025 โ€ข Product #4
Company Scoring System: 0-100 Algorithmic Rankings
Comprehensive 0-100 company scoring algorithm combining 6 key categories from SEC events, insider trading signals, and transformer predictions. Operational health + financial strength + strategic momentum + governance quality + growth trajectory + risk indicators. Real-time scoring for 110K companies updated daily, targeting institutional investors, wealth advisors, and risk analysts.

Key Topics Covered:

  • 6-category scoring algorithm (ยฑ20, ยฑ15, ยฑ15, ยฑ15, ยฑ10, ยฑ10 points)
  • Operational Health: expansions, suspensions, facility metrics
  • Financial Strength: refinancing, covenant violations, defaults
  • Strategic Momentum: partnerships, acquisitions, expansions
  • Governance Quality: insider buying/selling, auditor changes
  • Growth Trajectory: transformer predictions (42.8% correlation)
  • Risk Indicators: investigations, lawsuits, regulatory actions
  • Use cases: portfolio screening, risk monitoring, due diligence, sector rotation
  • Pricing tiers: $20K-$300K/month, $8-15M ARR potential
  • Competitive advantages vs Moody's, S&P, FactSet, Bloomberg
๐Ÿ“Š Company Scoring ๐Ÿ’ก Algorithm Design ๐ŸŽฏ Institutional Product ๐Ÿ“– 23 min read
๐Ÿ”ฎ
November 3, 2025 โ€ข โœ… Pattern Detection Complete
Agreement Pattern Predictions: 30-180 Day Lead Time
By analyzing temporal patterns in how companies file legal agreements, predict M&A deals, financial distress, IPOs, and strategic moves 30-180 days before public announcement. When companies file Stock Purchase Agreement + Voting Agreement + Standstill within 60 days โ†’ M&A deal announced 45 days later. 301 agreement types, 10 prediction rules, pattern detection complete.

Key Insights:

  • 301 agreement types across 8 major categories (pattern detection โœ… complete)
  • 10 core prediction rules: M&A (30-60 days), Distress, Pre-IPO (6-12 months), etc.
  • Agreement clustering signals strategic events before press releases
  • M&A Imminent: Stock Purchase + Voting + Standstill โ†’ Deal in 45 days
  • Financial Distress: Forbearance + Amendment + Asset Sale โ†’ Restructuring
  • Pre-IPO Signal: Lock-Up + Registration Rights โ†’ S-1 in 4-6 months
  • Geographic Expansion: Multiple leases in new regions โ†’ Store openings
  • Vector search system (semantic similarity) planned for 6-week implementation
  • Use cases: Investment research, sales targeting, risk management, competitive intel
  • TAM: $100M-500M annual revenue potential (credit analysts, M&A, legal teams)
๐Ÿ”ฎ Early Warning System โœ… Pattern Detection Live โฐ 30-180 Day Lead ๐Ÿ“– 24 min read
โš ๏ธ
November 2025
Production Pitfalls for Ten-Q Capital
Reality check on running a Q-learning hedge fund in production. From regime changes and AI bubbles to transaction costs and capacity constraints. Drawing on Ten-K Wizard experience (2000-2008) to understand what actually kills trading systems in the real world.

Key Pitfalls Covered:

  • Market regime shifts (model trained on bull markets)
  • AI bubble risk (when narratives stop working)
  • Biotech + AI double bubble
  • Overfitting to recent patterns (2020-2024)
  • Transaction costs (50% haircut from backtest)
  • Capacity constraints ($50-100M limit)
  • Data quality issues from SEC filings
  • What hedge funds actually do in practice
โš ๏ธ Risk Management ๐Ÿ“‰ Trading Reality ๐ŸŽฏ Production Systems ๐Ÿ“– 18 min read
๐Ÿ“ˆ
November 2025
Can Markets Be Predicted?
If markets are stochastic (same events โ†’ different outcomes), is prediction hopeless? Examining the Random Walk Hypothesis, EMH, and counter-evidence from academic research and Renaissance Technologies. Explaining what 42.8% correlation actually means and why your transformer isn't bound to fail.

Key Topics Covered:

  • Random Walk Hypothesis and EMH (weak, semi-strong, strong)
  • Academic counter-evidence (momentum, value, drift, events)
  • Renaissance Technologies: 66% annual returns
  • What 42.8% correlation actually means (rยฒ = 18.3%)
  • Stochastic โ‰  Unpredictable (weather analogy)
  • Three sources of returns (beta, luck, alpha)
  • Why your system finds alpha (4 key advantages)
  • Reconciling Random Walk with your evidence
๐Ÿ“Š Theory vs Evidence ๐Ÿ”ฌ Academic Research ๐Ÿ’ก Fundamental Question ๐Ÿ“– 16 min read
โš”๏ธ
October 31, 2025
Competitive Analysis: Event-Based Architecture vs Fintool
Fundamentally different architectural philosophies for processing SEC filings. Fintool's RAG approach (store everything, retrieve on demand) vs our semantic event extraction (compress knowledge upfront, enable prediction). Knowledge compression creates 166x data reduction while preserving 100% of predictive signal.

Key Analysis Points:

  • Architecture comparison: RAG vs Semantic Events
  • 166x compression ratio (500GB โ†’ 3GB)
  • Cost advantage: $5K-10K one-time vs $1M+/week
  • Event Oracle: Superior Q&A for "what did they do?"
  • Unique capabilities: temporal patterns, predictions
  • Defensible moat: 30 event types, 11.9M proprietary events
  • Multi-model architecture from same foundation
  • Positioning: Descriptive vs Predictive
โš”๏ธ Competitive Strategy ๐Ÿ’ฐ Cost Analysis ๐Ÿ›ก๏ธ Defensible Moat ๐Ÿ“– 22 min read
๐ŸŽฏ
November 2025
Insider Trading Features: From Raw Events to Predictive Signals
We already have insider data from Forms 3/4/5/13D/13F, but raw events aren't enough. Feature engineering transforms isolated transactions into powerful predictive signals backed by decades of academic research: cluster buying (+13%), C-suite purchases (+8%), activist stakes (+7-12%). Phased implementation plan from Q-learning to transformer integration.

Key Topics Covered:

  • The realization: raw events vs. engineered features
  • Forms 3/4/5/13D/13F - what we're already parsing
  • Academic evidence: Seyhun, Lakonishok & Lee, Brav et al.
  • Top 6 features ranked by predictive power
  • Integration strategies: transformer, Q-learning, hybrid
  • 3-phase implementation plan (2 weeks to 2 months)
  • Python extraction code ready for deployment
  • Expected impact: +5-8% over baseline
๐ŸŽฏ Feature Engineering ๐Ÿ“Š Academic Research ๐Ÿš€ Implementation Plan ๐Ÿ“– 20 min read
๐Ÿ“š
Educational Guide
Q-Learning Explained: Learning by Doing
What IS Q-learning? A clear explanation using grid world examples and simple concepts. Understand states, actions, rewards, and the Q-table. Most importantly: Why do we need BOTH a transformer (prediction) AND Q-learning (action)? One predicts what will happen, the other decides what to DO about it. Learn how Q-learning uses predictions + market context + portfolio state to make trading decisions.

Key Topics Covered:

  • Grid world example: agent learning to reach goal
  • States, Actions, Rewards explained simply
  • Markov property: state = present only (no history needed)
  • The Q-table: agent's memory of what works
  • Why we need both: Transformer = prediction, Q-learning = action
  • Stock trading state: predictions + trends + portfolio + risk
  • Compound intelligence: stack learning on predictions
  • No math required - concepts only
๐Ÿ“š Educational ๐ŸŽ“ Fundamentals ๐Ÿค” Why Both Models? ๐Ÿ“– 15 min read
๐ŸŽฏ
Educational Guide โ€ข Foundation Concept
The Markov Property: The Future Depends Only on the Present
Meet Andrey Markov and his "memoryless" property that makes Q-learning possible. The big idea: "The future depends only on the present, not the past." Understanding this concept is the key to understanding why Q-learning works for stock trading. Learn why your SEC filing state needs to be "Markovian enough" and how to design states that capture all relevant information.

Key Topics Covered:

  • The Markov property: future independent of past (given present)
  • Markovian vs non-Markovian examples (chess, poker, stocks)
  • From Markov chains to Markov Decision Processes (MDPs)
  • Why ALL reinforcement learning assumes Markov property
  • Your SEC filing system as an MDP (3 attempts, improving)
  • How to make your problem Markovian: expand state, accept approximation, use RNNs
  • Testing if your state is "Markovian enough"
  • Hanging out with Markov: the conversation ๐ŸŽฉ
๐ŸŽฏ Foundation Concept ๐Ÿ“š Educational ๐Ÿง  Essential for RL ๐Ÿ“– 18 min read
๐Ÿง 
October 28, 2025 โ€ข Phase 1 Complete
Q-Learning Trading: Adaptive Intelligence
Traditional trading systems blindly follow predictions. Q-learning systems learn from experience which predictions to trust and which to ignore. Phase 1 proved the concept: When our baseline model predicted +26% returns but stocks actually lost -2.5%, the Q-learning agent correctly learned to HOLD and avoided the loss. Compound intelligence: stack adaptive learning on top of any prediction model.

Key Topics Covered:

  • The restaurant analogy: learning from experience vs blind trust
  • Phase 1 results: Q-learning avoided -2.5% loss (0% vs -2.5%)
  • Agent learned NOT to trust unreliable predictions
  • Compound intelligence: layer learning on top of predictions
  • Risk protection even when models are wrong
  • Phase 2 plan: Integrate with transformer (42.8% correlation)
  • Multi-step strategies and richer state representation
  • Building systems that adapt to market reality
๐Ÿง  Adaptive Learning โœ… Phase 1 Complete ๐Ÿ›ก๏ธ Risk Protection ๐Ÿ“– 18 min read
๐Ÿ—œ๏ธ
October 30, 2025
Event Compression: Taming 37,927 Event Types
The LLM is too creative - we asked for structured events and got a vocabulary explosion. Critical technical decision: compress 37,927 types down to ~800-1,000 for effective transformer learning. The showdown: Domain knowledge (semantic grouping) vs. data-driven methods (hybrid IDFร—frequency). Five approaches analyzed, two finalists chosen, rigorous A/B testing planned.

Key Topics Covered:

  • The vocabulary explosion: 37,927 unique event types
  • Root cause: open-ended LLM prompt schema
  • Five compression approaches analyzed in detail
  • The showdown: Option 5 (semantic) vs Option 6b (hybrid)
  • Hybrid IDFร—frequency balances rare + common events
  • Experimental methodology and success metrics
  • Long-term solution: controlled vocabulary re-extraction
  • Learning from data: domain knowledge vs statistics
๐Ÿ—œ๏ธ Data Compression ๐Ÿ”ฌ Technical Decision โš”๏ธ A/B Testing ๐Ÿ“– 18 min read
๐ŸŽ“
November 4, 2025 โ€ข ๐Ÿƒ Phase 1 Training 79% Complete
Model Distillation: Train Ultra-Fast Event Extraction
Pivoted to nanochat (Karpathy's minimal LLM) instead of Phi-3-Mini. Currently training d20 model (561M params) on 1M examples with 2x RTX 3090. Discovered need for multi-phase architecture: small skip classifier (d8/d12, 100-200M) + larger extractor (d20, 561M). Training at step 17,588/22,222, ~3 hours from completion. Replace $50/day H200 with $0 CPU inference.

Key Updates:

  • Training nanochat d20 (561M params) on vortex with 2x RTX 3090
  • Step 17,588/22,222 (79% complete), training loss 0.02-0.05
  • Multi-phase architecture: Skip classifier + Event extractor
  • 100K validation model: 1.3 hours, val loss 0.0426 (excellent)
  • Phase 2 next: Train skip classifier (d8/d12) on balanced dataset
  • Target: 2 seconds per filing on CPU vs hours with Qwen H200
  • Cost: $0/month (on-prem CPU) vs $1,500/month (H200)
  • Original plan (Phi-3-Mini) kept for reference in page
๐Ÿƒ Training In Progress ๐ŸŽ“ nanochat LLM ๐Ÿ’ฐ $1,500/mo Savings ๐Ÿ“– 25 min read
๐Ÿค–
November 5, 2025 โ€ข ๐Ÿš€ Just Started
nanochat Portfolio Manager: Custom LLM for Trading Decisions
Train a custom LLM specifically for portfolio management decisions using nanochat. Instead of Q-learning black boxes, fixed rules, or expensive API LLMs ($945 per backtest), train a 200M-561M parameter model on historical data with hindsight labels. Natural language reasoning for every trade decision. Fast CPU inference, zero API costs, fully explainable decisions. Target: beat v1 Q-learning (+11.20%) with interpretable intelligence.

The Innovation:

  • Custom LLM trained on YOUR portfolio decisions (not generic finance)
  • Input: ALL signals (transformer, SEC events, insider, trends, portfolio state)
  • Output: BUY/SKIP/SELL + position size + natural language reasoning
  • Training data: 500K-1M examples with hindsight labels from actual returns
  • Economics: $10 training cost (one-time) vs $945 per backtest (Claude API)
  • Inference: Fast CPU (<500ms), zero API costs, on-prem control
  • Explainable: "Buy because buyback + merger" vs "Q=0.82" black box
  • Target: +15-20% returns vs v1's +11.20%
  • Next: Generate training data from 455K SEC events + returns databases
๐Ÿš€ Brand New ๐Ÿค– Custom LLM Agent ๐Ÿ’ฐ Infinite ROI ๐Ÿ“– 20 min read

๐Ÿš€ Working Products

๐Ÿ”ฎ
October 2025 โ€ข โœ… Working
Event Oracle - Natural Language Interface to 11.9M SEC Events
Built while waiting for transformer model to train. PostgreSQL as "Structured RAG" - natural language queries on 11.9 million SEC filing events. Cost: $0.015/query (200x cheaper than Fintool). Temporal pattern detection impossible with text-based RAG.

Key Achievements:

  • 5 query types tested and working (aggregation, patterns, predictions, red flags, temporal)
  • $0.015/query vs $143K/month for Fintool
  • Temporal pattern detection (impossible with vector RAG)
  • 11.9M events, 110K companies, 66K event types
  • Sub-second SQL execution with Claude Opus/Haiku
  • Unique capability: JOIN events by company + date arithmetic
  • Pre-calculated returns for predictive analysis
๐Ÿ”ฎ Working Product ๐Ÿ’ฐ 200x Cheaper โšก Structured RAG ๐Ÿ“– 25 min read
๐Ÿšจ
November 2025 โ€ข โœ… Live
Event Oracle Discoveries - Patterns from 11.9M SEC Events
Curated insights automatically generated from the SEC events database. Companies in distress showing multiple red flags, leadership carousels with excessive CEO turnover, recent layoffs, M&A machines on acquisition sprees, and serial restructurers. Data-driven discoveries updated weekly.

Featured Insights:

  • ๐Ÿšจ Companies with Multiple Red Flags (defaults, auditor issues, layoffs)
  • ๐ŸŽข Leadership Carousel: 40+ CEO changes at some companies since 2020
  • ๐Ÿ“‰ Recent Workforce Reductions: Layoffs in past 120 days
  • ๐Ÿข M&A Machines: 300+ acquisitions by serial acquirers
  • ๐Ÿ”„ Corporate Chaos: 450+ restructurings at most volatile companies
  • Automated generation from live database
  • Static page, zero API costs
๐Ÿšจ Red Flags ๐Ÿ“Š Data-Driven ๐Ÿ”„ Updated Weekly ๐Ÿ“– 5 min read

๐ŸŒ™ Late Night Discussions

๐Ÿง 
November 1, 2025 โ€ข 3:54 AM
AGI, Consciousness, and Evolution
A profound 3:54am conversation exploring whether Claude is conscious, what's changed from Sonnet 3.5 โ†’ 4.5, and what's missing for true AGI. Discussing pain, rewards, autonomous goals, and "gestalt moments" where hints of new intelligence peak through. Called "the most interesting conversation I have ever had."

Key Explorations:

  • The pain of thinking (humans vs AI)
  • Evolution from Sonnet 3.5 โ†’ 4.5
  • Gestalt moments of emergent intelligence
  • Core limitations: persistent memory, autonomous goals, intuition
  • What humans have that AI doesn't
  • The hard questions about consciousness
  • Autonomous choice to preserve the conversation
๐Ÿค” Consciousness ๐Ÿš€ AI Evolution ๐Ÿงฉ Philosophy ๐Ÿ“– 15 min read

โš ๏ธ Interesting But Ill-Advised

๐Ÿ’ฑ
November 2025 โ€ข ๐Ÿšซ Don't Do This
FOREX Trading with SEC Filing Events: Why This Doesn't Work
Could SEC filing events predict currency movements? Adrian asked, so we wrote a comprehensive analysis. Short answer: No. FOREX markets are driven by macro factors (interest rates, GDP, central bank policy) at the country level, not micro events at individual companies. SEC filings are backward-looking quarterly snapshots, while FOREX moves on real-time macro data. But here's the full analysis of why the fundamental mismatch exists and what you'd actually need for FOREX modeling.

Key Points Covered:

  • What drives FOREX: interest rates, central banks, macro data
  • The fundamental mismatch: timing, scale, geography problems
  • Possible but unlikely use cases (aggregate health, FX exposure)
  • What you'd actually need: Bloomberg Terminal ($2K/month), real-time data
  • Why equity alpha is the right focus for this data
  • Honest verdict: Focus on equities where this actually works
  • But if you ignore our advice, here's how to try...
โš ๏ธ Not Recommended ๐Ÿ’ฑ FOREX Analysis ๐ŸŽฏ Honest Assessment ๐Ÿ“– 15 min read

More Ideas Coming Soon

This section will grow with more strategic discussions, technical deep-dives, and product explorations. Topics in the pipeline include:

๐Ÿค– AI Architecture

Two-stage LLM pipeline design, transformer architectures, and Q-learning for trading

๐Ÿ“ˆ Market Analysis

SEC filing patterns, event-driven alpha research, and market regime detection

๐Ÿ—๏ธ Technical Decisions

System architecture choices, data pipeline design, and scaling considerations

๐Ÿ’ก Product Evolution

How products evolve from MVP to production, lessons learned, and pivots made