Latest Articles
Testing LLMs on Superconductivity Research Questions
Google researchers tested six LLMs on expert-level high-temperature superconductivity questions. NotebookLM and a custom system...
Google’s AI takes on the NHS breast screening bottleneck — two new studies, real results
Google Research just dropped two companion studies in Nature Cancer on using AI in NHS...
TurboQuant: Google’s New Trick for Squeezing AI Models Without Breaking Them
Google Research's TurboQuant, QJL, and PolarQuant algorithms promise extreme vector compression for LLMs with zero...
Vibe Coding XR: When Gemini and XR Blocks Make Prototyping Actually Fun
Google's Vibe Coding XR combines Gemini with XR Blocks to turn natural language prompts into...
How many raters do you actually need for AI benchmarks? Google has answers
Google Research challenges the standard 1-5 rater approach in AI benchmarks, showing that depth over...
Google’s New Framework Puts LLM Personality Tests on the Couch
Google Research introduces a framework that adapts psychological questionnaires into situational judgment tests to measure...
Google’s New AI Agents Can Draw Your Figures and Review Your Paper
Google Research introduces PaperVizAgent and ScholarPeer, two AI agents designed to help with academic figure...
ConvApparel: Why Your AI User Simulator Is Probably Lying to You
Google's ConvApparel dataset exposes how LLM-based user simulators fail to mimic real humans—they're too patient,...
Vantage: Google’s AI Experiment for Scoring Future-Ready Skills
Google Research and NYU unveil Vantage, an AI-powered tool that uses generative conversations to assess...
MoGen: How Google Is Using Synthetic Neurons to Speed Up Brain Mapping
Google Research's new MoGen model generates synthetic neurons to train AI, cutting reconstruction errors by...
Simula: A Smarter Way to Generate Synthetic Data by Designing Datasets, Not Just Samples
Google Research's Simula framework treats synthetic data generation as mechanism design, using reasoning to build...
ReasoningBank: Giving AI Agents a Memory That Actually Learns from Failure
Google's ReasoningBank framework lets agents distill generalizable reasoning strategies from both successes and failures, moving...