Experiments
Manage and run AI experiments
| Name | Status | Runs | Avg Latency | Avg Cost | Updated | |
|---|---|---|---|---|---|---|
GPT-4o vs Claude Sonnet Comparison Comparing reasoning capabilities | completed | 12 | 1.45s | $0.003200 | just now | |
Prompt Engineering Test v3 Testing different prompt strategies | running | 8 | 890ms | $0.001500 | just now | |
Hallucination Analysis Measuring hallucination rates across models | draft | 0 | 0ms | $0.00 | just now | |
Token Optimization Study Analyzing token usage patterns | completed | 24 | 2.10s | $0.008900 | just now | |
Temperature Scaling Test Temperature effects on output quality | idle | 5 | 1.10s | $0.002100 | just now |