llm api test

Configuration

Protocol

Test Rounds

Concurrency

Accuracy & Stability

Stagger between requests (ms) Delay between starting concurrent requests. 0 = simultaneous burst.

Warmup before measurement Send one throwaway request per model+protocol before timing.

Use accurate tokenizer (tiktoken) Loads tiktoken from CDN. Server-reported usage is always preferred.

Use same-origin proxy (/proxy) When deployed to Cloudflare Pages with the bundled Function, route requests through /proxy on this same origin. No CORS, no per-origin connection limits.

Proxy URL (optional) Custom Cloudflare Worker / reverse proxy URL. Leave empty if using the same-origin proxy above.

Saved Endpoints

Select a previously saved endpoint to auto-fill protocol, URL, and key.

API URL (Base URL) Auto-filled based on protocol. Edit if using a custom endpoint.

API Key

Available Models

Click "Fetch Models" or type models below

Custom Models (one per line, optional)

Custom Prompts (one per line)

Results

Model	Prompt	Round	First Token (ms) ❓	Output Speed (t/s) ❓	Result	Error	Status

Statistics

Model	Avg First Token (ms)	Avg Output Speed (t/s)	Success Rate

Configuration

Results

Statistics

History Records

Real-time Result Viewer

Test Result Detail