Capabilities Model Performance Management

Google Gemini 2 is the New Top Ranked Model and Improves Agent Capabilities

Gemini 2.0 Experimental Advanced: Complex Task Handling: This version shows significantly improved performance on complex tasks such as coding, math, reasoning, and following instructions, positioning ...

NBC News

AI's capabilities may be exaggerated by flawed tests, according to new study

Researchers behind a new study say that the methods used to evaluate AI systems’ capabilities routinely oversell AI performance and lack scientific rigor. The study, led by researchers at the Oxford ...

Forbes

IBM’s InstructLab: A New Era For AI Model Creation And Performance

Forbes contributors publish independent expert analyses and insights. Paul-Smith Goodson is an analyst covering quantum computing and AI. IBM and Red Hat recently introduced InstructLab, a new AI ...

GLM 4.7 AI Brings Stronger Reasoning, Higher HLE Scores & Cleaner Web Output with Tools

GLM version 4.7 lifts software engineering accuracy from 68% to 73.8%, helping you ship cleaner code and UI faster. Terminal Bench rises from 24.5% to 41%, giving teams steadier ...

VentureBeat

Microsoft’s new Phi-4 AI models pack big performance in small packages

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Microsoft has introduced a new class of ...

Computer Weekly

What should platform engineering look like?

Platform engineering is based on the principles of product management and the product model applied to digital and IT systems. Fast-moving digital teams show resistance to strict process frameworks ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results