Programming Language Benchmarks

Baidu unveils proprietary ERNIE 5 beating GPT-5 performance on charts, document understanding and more

While Baidu did not release full benchmark details or raw scores publicly, its performance positioning suggests a deliberate ...

Hackaday

Hackaday Podcast Episode 340: The Best Programming Language, Space Surgery, And Hacking Two 3D Printers Into One

Elliot Williams and Al Williams got together to share their favorite hacks of the week with you. If you listen in, you’ll hear exciting news about the upcoming SuperCon and the rare occurrence of Al ...

Hackaday

Ask Hackaday: What’s The Top Programming Language Of 2025

We did an informal poll around the Hackaday bunker and decided that, for most of us, our favorite programming language is solder. However, [Stephen Cass] over at IEEE Spectrum released their annual ...

blockchain

OpenAI's General-Purpose Reasoning Models Outperform Humans at 2025 ICPC World Finals in AI Programming Benchmark

According to OpenAI on X (formerly Twitter), their general-purpose reasoning models successfully solved all 12 problems at the 2025 International Collegiate Programming Contest (ICPC) World Finals, ...

GitHub

Elfsong/Awesome-Code-Benchmark

Software Development Life Cycle Perspective A Survey of Benchmarks for Code Large Language Models and Agents from Xi’an Jiaotong University HumanEval Evaluating Large Language Models Trained on Code ...

Slator

German Gov-Backed AI Benchmark Tracks Large Language Models in 200 Languages

A new multilingual AI benchmarking initiative backed by the German Government aims to advance equitable access to language technologies by highlighting where today’s large language models (LLMs) ...

TechCrunch

AI coding tools are shifting to a surprising place: The terminal

For years, code-editing tools like Cursor, Windsurf, and GitHub’s Copilot have been the standard for AI-powered software development. But as agentic AI grows more powerful and vibe coding takes off, a ...

blockchain

OpenAI o3-pro Vision-Language Model Sets New Benchmark in Complex Reasoning for Mathematics, Science, and Programming

According to DeepLearning.AI, OpenAI has released o3-pro, an advanced vision-language model specifically engineered to surpass previous iterations like o3 and o1-pro in complex reasoning tasks, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results