Category: Developer Experience

  • Benchmarking Code Reviews: Kody vs. Raw LLMs (GPT & Claude)

    Benchmarking Code Reviews: Kody vs. Raw LLMs (GPT & Claude)

    Introduction AI-powered IDEs blew up this past year—suddenly, every dev’s got a little robot sidekick. But hey, coding is the fun part. Code reviews? Not so much. That’s where we started wondering: Can large Language Models (LLMs) handle your pull requests? We put the usual suspects (GPT-4o and Claude Sonnet 3.7) up against our pipeline-based…