Evals Resources

Blogs

  • Aparna Dhinakaran - Medium · The AI Engineer (Newsletter) - Co-founder and CPO (Chief Product Officer) of Arize AI; writes about evals and tooling. Formerly Cornell CV PhD, Uber ML, UC Berkeley AI Research. (updated 2026-02-01)
  • Ian W. (Promptfoo creator) - Blog - Posts by the creator of Promptfoo, often about evals and tooling. (updated 2026-02-01)
  • Eugene Yan - Blog - Practical ML/AI writing with useful evaluation and production insights. (updated 2026-02-01)
  • Hamel Husain - Blog - Machine learning engineer with 20+ years experience; independent consultant helping companies build AI products. Formerly at Airbnb and GitHub; open-source contributor. (updated 2026-02-01)

Resources

  • Vivek Menon - Awesome AI Eval (GitHub) - Curated list of AI evaluation tools, papers, and resources. (updated 2026-02-01)

Communities

  • Hacker News - Search: evals - Live HN search results for evals discussions. (updated 2026-02-01)
  • Reddit - r/LocalLLaMA - Local-first AI community with lots of eval, tooling, and control discussions. (updated 2026-02-01)