
A comparison of DeepSeek AI and OpenAI models across various benchmarks including AIME 2024 Codeforces GPQA Diamond MATH 500 MMLU and SWE bench Verified The results show DeepSeek R1 leading in multiple categories highlighting its competitive performance in AI reasoning mathematics and coding tasks