Blog - AI Image Tools

Deep Dives

AI evals are becoming the new compute bottleneck

Running AI eval...

Granite 4.1 LLMs: A Hands-On Look at How IBM Built Them

IBM's Granite 4...

Training mRNA Language Models Across 25 Species for $165: What Actually Worked

OpenMed built a...

QIMMA: The Arabic LLM Leaderboard That Actually Checks Its Homework

Most Arabic LLM...

VAKRA: A Reality Check for AI Agents That Actually Use Tools

IBM Research's ...

Google’s TurboQuant Shrinks LLM Memory by 6x Without the Usual Quality Hit

Google Research...

Google’s AMIE AI Tried Taking Patient Histories Before Real Doctor Visits — Here’s How It Went

Google Research...

TurboQuant: Google’s New Compression Trick That Actually Works

Google Research...

Google and NHS test AI for breast cancer screening: two studies, real results

Google Research...

Google tested 6 LLMs on superconductivity physics. The results are telling.

Google research...

ConvApparel: Why Your AI User Simulators Are Still Bad and How to Fix Them

Google Research...

Google Research Tries to Figure Out if LLMs Have People Skills

Google Research...

1 2