AI | Navendu Pottekkat - The Open Source Absolutist

Notes on ForecastBench

ForecastBench continuously evaluates the performance of LLMs against an automatically generated, continuously updated set of forecasting questions.

Lessons on Context Engineering

Practical guidelines on context engineering, like having an append-only context, using response prefill to remove/force tools, setting up restorable compression strategies, and more.

Sandbox MCP: Enable LLMs to Run ANY Code Safely

A Model Context Protocol (MCP) server that lets LLMs run code safely in isolated Docker containers.

AI and APIs

What do the recent advancements in generative AI mean for APIs?

ChatGPT Explained by ChatGPT

A conversation with ChatGPT about ChatGPT. Who are you?