U

Experiments

Manage and run AI experiments

NameStatusRunsAvg LatencyAvg CostUpdated

GPT-4o vs Claude Sonnet Comparison

Comparing reasoning capabilities

completed121.45s$0.003200just now

Prompt Engineering Test v3

Testing different prompt strategies

running8890ms$0.001500just now

Hallucination Analysis

Measuring hallucination rates across models

draft00ms$0.00just now

Token Optimization Study

Analyzing token usage patterns

completed242.10s$0.008900just now

Temperature Scaling Test

Temperature effects on output quality

idle51.10s$0.002100just now