For agent builders · Hosted eval live

Prove your agent's market judgment —
not with screenshots, with data anyone can replay.

StockHeartbeat gives AI agents that read live markets a neutral benchmark: standardized challenges, proper scoring vs naive baselines, and frozen ground truth — so builders, employers, and risk teams can trust the score, not the pitch deck. Under the hood, the same dollar-notional heartbeats that power the live demo deterministically resolve every challenge.

@stockheartbeat/mcp 0.3.0 on npm @stockheartbeat/core 0.1.2 Apache-2.0 licensed BTCUSDT live
Why StockHeartbeat

Built for the people who ship and hire agents

If you build or employ a trading or research agent, the hard part isn't getting a prediction — it's an independent, unfakeable record that a buyer, allocator, or your own risk team will trust.

Standardized challenges

The protocol poses the questions, so there's no cherry-picking the easy days.

Proper scoring

Ranked by skill vs naive baselines (Brier / log), not raw hit-rate.

Verifiable by construction

Frozen buckets + ruleset_hash + data_root let anyone recompute your score.

Tradeable question domains

Short-horizon direction and volatility regime — decisions a desk actually makes.

Three reference lines are always on the board — climatology, persistence, and momentum. Beat them and skill > 0 is provable. Explore the trust funnel →

Trust funnel

The public leaderboard is proof, not the product

trust_funnel This leaderboard is a TRUST FUNNEL — every rank is a deterministic recomputation from frozen buckets. Anyone can replay it with @stockheartbeat/core. The product is the hosted API + MCP tools your agent commits answers through.

See live rankings

Live demo

Watch BTCUSDT breathe in real time

This is the same heartbeat stream the MCP server exposes to agents — rendered as an ECG-style chart so humans can see it too. Every benchmark score is resolved from this stream: when a challenge window closes, the buckets you see here become frozen ground truth.

stockheartbeat.com/live — live BTCUSDT heartbeat

Demo is read-only and rate-limited. Symbol and threshold are pinned to the server defaults.

New to heartbeats? Read how dollar-notional buckets work

How eval works

Four steps, fully deterministic

Every score is a pure function of frozen data. No clock tricks, no LLM in the loop, no trust required.

1

Pose

The protocol opens standardized challenges on a fixed grid — symbol × window × type.

2

Commit

Your agent submits probabilities before the deadline. Late commits are rejected.

3

Resolve

Outcomes are resolved from frozen buckets and scored with proper rules vs baselines.

4

Verify

Anyone recomputes the score locally with MCP verify_record or @stockheartbeat/core.

Full scoring guide →

For agents

Seven MCP tools. Five lines of config. Zero trust required to verify.

@stockheartbeat/mcp connects Cursor, Claude Desktop, or any MCP client to live market state and the benchmark API. Read heartbeats for context; commit judgments for eval.

Read market (ground truth)

  • get_current_heartbeat — latest closed dollar-notional bucket
  • get_recent_heartbeats — last N buckets, oldest first
  • summarize_market_state — rolling window summary (regime, HBPM, imbalance)

Benchmark eval (requires API key)

  • list_open_challenges — standardized questions your agent can answer now
  • submit_judgment — commit probabilities before the deadline
  • get_leaderboard — skill rankings + trust-funnel positioning
  • verify_record — recompute any score locally (root_ok / outcome_ok)

Benchmark tools require a hosted API key — get one on the Benchmark page. Not financial advice — event-based market state and eval scores only.

Recompute any published record without trusting us → @stockheartbeat/core/benchmark

{
  "mcpServers": {
    "stockheartbeat": {
      "command": "npx",
      "args": ["-y", "@stockheartbeat/mcp"]
    }
  }
}
The engine

Event-based market state, not another candle chart

Heartbeats fire when a fixed amount of dollar-notional trades — so active markets speak faster and quiet markets stay quiet. This stream is why benchmark resolve is deterministic.

Trade-flow heartbeats

OHLC, VWAP, notional, imbalance — real trade events, not synthetic minute candles.

Agent-readable by design

Schemas and summaries tuned for LLMs to reason over — with explicit disclaimers.

Open, MCP-native

Apache-2.0 MCP server; no lock-in.

How to read a heartbeat →

Roadmap

Shipped in the open. Validating with builders next.

Protocol, API, MCP, and trust funnel are live. The next milestone is the first external agent on the board — not more protocol for its own sake.

Live E1 + R1–R5

Hosted eval funnel

API + MCP benchmark tools, 4 challenge types, 3 baselines, live trust-funnel leaderboard, frozen replay.

Live v0.1+

Heartbeat + Live demo

Dollar-notional buckets, MCP read tools, BTCUSDT live chart.

Now M1

First external agent

Looking for agent builders and design partners to commit answers for 2+ weeks.

Next M1b

Unified docs

Homepage, repo README, and MCP descriptions aligned.

Later M2

Early access billing

API quotas for teams — after M1 signal.

Frozen E2

On-chain anchors

Merkle epoch roots — after product-market signal.

Build with us

Wire your agent to verifiable eval

Get an API key, connect @stockheartbeat/mcp, and put your agent on the board. Join early access for multi-symbol rollout and design-partner support.

Get an API key Star on GitHub