← All research

Research F6 sources

Monthly Research · March 2026

Supervised Alpha and the Repricing of AI — March 2026

Frontier AI starts trading like pre-public infrastructure, and the quant literature converges on LLM alpha as real — but only under heavy supervision.

#llm-alpha #human-oversight #ai-valuations #private-markets

Abstract

March's capital cycle reached new scale — OpenAI closed a $122 billion round at an $852 billion valuation with explicit pre-IPO framing — while Anthropic's twin disclosure incidents showed operational security becoming the binding constraint for frontier labs. In quantitative finance, LLM- and agent-based methods moved from novelty to systematic evaluation: benchmark papers, supervision frameworks, and overfitting-control protocols now outnumber raw stock-picking claims. The month's conclusion: LLM alpha is real, but only under heavy human and statistical supervision.

Executive Summary

March 2026 was a month of consolidation on both fronts. In AI, the capital cycle reached a new scale — OpenAI closed a $122 billion round at an $852 billion valuation with explicit pre-IPO framing, including roughly $3 billion from retail channels — while Anthropic's twin disclosure incidents showed that operational security, not model quality is becoming the binding constraint for frontier labs [5][6]. In quantitative finance, the arXiv record for the month shows LLM- and agent-based methods moving from novelty to systematic evaluation: benchmark papers, supervision frameworks, and overfitting-control protocols now outnumber raw "LLM picks stocks" claims [1][2][3]. The takeaway for a partner: the market is repricing AI as infrastructure with public-market discipline approaching, and the quant literature is converging on the same conclusion we hold internally — LLM alpha is real but only under heavy human and statistical supervision.

AI — Latest Approaches

OpenAI closes a $122B round at an $852B valuation, with retail participation

OpenAI raised $122 billion — its largest round to date — at an $852 billion valuation, co-led by SoftBank, Andreessen Horowitz, D.E. Shaw Ventures, MGX, TPG, and T. Rowe Price, with Amazon, Nvidia, and Microsoft participating; about $3 billion came from individual investors via bank channels, and shares will appear in ARK-managed ETFs ahead of an expected IPO [6]. The company disclosed roughly $2 billion in monthly revenue, 900 million weekly active users, 50 million subscribers, enterprise at 40% of revenue, and an ads pilot already above $100 million ARR — agentic-workflow growth attributed to its newest model, GPT-5.4 [6]. The release reads as a draft public-market narrative; the round is as much about anchoring IPO expectations as about capital [6].

Anthropic's month of accidental disclosures

Anthropic suffered two self-inflicted exposures within a week of each other in late March: roughly 3,000 internal files made publicly available (including a draft post describing an unannounced model, reported March 27), followed on March 31 by a Claude Code npm release (v2.1.88) that shipped nearly 2,000 source files and more than 512,000 lines of code — the full architectural scaffolding of one of its most important products [5]. Anthropic called it "a release packaging issue caused by human error, not a security breach"; the same week it won an injunction against the administration in its Department of Defense dispute [5]. The episode underlines that release hygiene around agentic tooling is now a material risk surface for even the most safety-branded labs.

Consumer video retreats as labs refocus on developers and enterprise

According to Wall Street Journal reporting relayed by TechCrunch on March 31, OpenAI moved to pull the plug on its Sora video product roughly six months after its public launch, refocusing on developers and enterprises — partly a response to the growing momentum of Anthropic's Claude Code [5]. Combined with the enterprise-revenue figures in OpenAI's own raise disclosure, the month marks a visible rotation of frontier-lab effort from consumer media generation toward coding agents and enterprise workflows [5][6].

The AI capital cycle broadens: seed premiums and agentic enterprise rollouts

TechCrunch's March 2026 record shows the funding premium extending down-stack — AI seed startups commanding measurably higher valuations, and Runway launching a $10M fund and Builders program for early-stage AI startups — alongside agentic AI landing in mainstream enterprise software, with Salesforce announcing an AI-heavy Slack makeover (30 new features) and Amazon extending Alexa+ into agent-driven food ordering [4]. The pattern is consistent: capital and product attention are concentrating on agents embedded in workflows rather than standalone chat or media products [4].

Quantitative Trading — Latest Approaches

A large-scale, risk-adjusted benchmark for deep learning on financial time series

Saly-Kaufmann, Wood, Peter-Calliess, and Zohren (Oxford lineage) posted a 43-page benchmark of deep-learning forecasters on financial time series, evaluated on risk-adjusted performance rather than raw predictive accuracy (arXiv:2603.01820, q-fin.TR + cs.LG) [1]. Benchmarks of this scale matter for practitioners because they discipline the literature's tendency to report Sharpe-free accuracy metrics; the framing of the month is consistent with a field shifting from "does the model predict" to "does the model pay after costs and risk."

LLMs in the investment process: capable, but only under supervision

Crisostomo and Mykhalyuk (arXiv:2603.19944, submitted 20 March) evaluated ChatGPT, Gemini, DeepSeek, and Perplexity across naive, structured, and chain-of-thought prompting and found recurring reasoning failures — financial misconceptions, carryover errors, and hallucinated or stale information — yet showed that appropriately guided and supervised LLMs can outperform the market, with accuracy improving when recommendations are grounded in official regulatory filings [3]. The same month's listings show a dense cluster of agentic-finance work: aggregating zero-shot LLM agents for disclosure classification (2603.20965), translating natural language into executable option strategies (2603.16434), anonymization-first LLM portfolio optimization (2603.17692), and autonomous agentic frameworks for factor investing and investment screening (2603.14288, 2603.23300) [1][2].

Execution and microstructure: control theory and realistic simulators

Two notable March entries push execution research toward engineering rigor: a Model Predictive Control formulation of trade execution from a group including Dimitri Bertsekas (arXiv:2603.28898), and Rosenbaum et al.'s work on closing the "reality gap" in limit order book simulation (arXiv:2603.24137) [1]. Alongside a neural hidden Markov model with adaptive granularity attention for high-frequency order-flow modeling (arXiv:2603.20456), the month's microstructure output points at the same goal: execution models that are trained and validated in simulators faithful enough to transfer to live markets [1].

Backtest integrity and AI-native research infrastructure

The portfolio-management listings carry a marked robustness theme: a Stanford group including Boyd, Candès, Hastie, and Kochenderfer on single-asset adaptive leveraged volatility control (2603.01298); "Implementation Risk in Portfolio Backtesting" quantifying a previously unmeasured error source (2603.20319); and a walk-forward, out-of-sample protocol for mitigating overfitting in quant strategies (2603.09219) [2]. On the tooling side, FinRL-X proposes an AI-native modular infrastructure for quantitative trading (2603.21330), echoing the broader move toward standardized, agent-ready research stacks [1].

Cross-cutting Signals / Relevance to SteadyHash

The two themes intersect most clearly at supervision. The strongest quant result of the month — that LLMs can outperform the market only when guided, supervised, and grounded in regulatory filings [3] — is the research-grade version of what the AI industry demonstrated operationally: Anthropic's leaks were human process failures around powerful tooling, not model failures [5]. For a systematic firm, the implication is to invest in the harness — validation protocols, grounding pipelines, release hygiene — rather than chase headline model capability. The March arXiv record (benchmarks, implementation-risk quantification, overfitting protocols) suggests the academic community is building exactly that harness [1][2].

Second, the capital-markets signal is direct. OpenAI's $852 billion round with retail bank channels, ARK ETF inclusion, and S-1-style disclosure marks frontier AI's transition into a public-markets asset class with reported revenue run-rates and unit economics [6]. For SteadyHash's venture book, seed-stage AI valuations rising on the same tide [4] argue for pricing discipline; for the quant book, the imminent arrival of mega-cap AI listings implies new index, flow, and volatility dynamics worth modeling before the listings occur.

Finally, the rotation from consumer media toward coding agents and enterprise workflows [5][6][4] aligns with our AI focus: durable value is accruing to agents embedded in high-frequency professional workflows — the same architecture pattern (agent + tooling + guardrails) the q-fin literature is now formalizing for trading itself [1][2][3].

Sources

  1. arXiv — Trading and Market Microstructure (q-fin.TR), authors and titles for March 2026https://arxiv.org/list/q-fin.TR/2026-03 (accessed 2026-06-12)
  2. arXiv — Portfolio Management (q-fin.PM), authors and titles for March 2026https://arxiv.org/list/q-fin.PM/2026-03 (accessed 2026-06-12)
  3. Ricardo Crisostomo, Diana Mykhalyuk — Large Language Models and Stock Investing: Is the Human Factor Required? (arXiv:2603.19944) — https://arxiv.org/abs/2603.19944 (accessed 2026-06-12)
  4. TechCrunch — March 2026 archivehttps://techcrunch.com/2026/03/ (accessed 2026-06-12)
  5. Connie Loizos / TechCrunch — Anthropic is having a monthhttps://techcrunch.com/2026/03/31/anthropic-is-having-a-month/ (accessed 2026-06-12)
  6. Rebecca Bellan / TechCrunch — OpenAI, not yet public, raises $3B from retail investors in monster $122B fund raisehttps://techcrunch.com/2026/03/31/openai-not-yet-public-raises-3b-from-retail-investors-in-monster-122b-fund-raise/ (accessed 2026-06-12)