Yesterday , shared a post about the benchmarks for Anthropic's new Claude models, beating GPT-4 in many areas. So naturally, I wanted to test this out myself and took some time today to do some experiments. I've shared my findings in this quick video, mainly diving into the developer experiences when you switch between different LLM providers and things to keep in mind. Hope you find it useful! Let me know if you've played around with the Anthropic API and what your observations have been!