Open Source
Open Research
The tools, experiments, and research we build in the open. Explore our public repositories — code, benchmarks, and findings on AI Agents for Data Analytics.
.png?alt=media)
GPT Guesses Between 1 and 100
When asked to pick a random number between 1 and 100, ChatGPT does not follow a random uniform distribution
422Python

"The Car Wash Test" on 4 Frontier LLMs
The car wash is 100m away from my house. Should I walk or drive? LLMs tackle this test in surprisingly different ways
40Python
