Papers
-
Reinforced Generation of Combinatorial Structures: Ramsey Numbers
-
Solving an Open Problem in Theoretical Physics using AI-Assisted Discovery
-
FlashAttention-4: Algorithm and Kernel Pipelining Co-Design for Asymmetric Hardware Scaling
-
ZipMap: Linear-Time Stateful 3D Reconstruction with Test-Time Training
-
Architecting Trust in Artificial Epistemic Agents
-
LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory
-
Discovering Multiagent Learning Algorithms with Large Language Models
-
The Geometry of Noise: Why Diffusion Models Don't Need Noise Conditioning
-
Unified Latents (UL): How to train your latents
-
On Surprising Effectiveness of Masking Updates in Adaptive Optimizers
-
Intelligent AI Delegation
-
Accelerating Mathematical and Scientific Discovery with Gemini Deep Think
-
Self-Consistency Improves Chain of Thought Reasoning in Language Models
-
Accelerating Scientific Research with Gemini: Case Studies and Common Techniques
-
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
-
TranslateGemma Technical Report
-
Reasoning Models Generate Societies of Thought
-
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities.
-
Prompt Repetition Improves Non-Reasoning LLMs
-
Towards a Science of Scaling Agent Systems
-
T5Gemma 2: Seeing, Reading, and Understanding Longer
-
Efficiently Reconstructing Dynamic Scenes One D4RT at a Time
-
SIMA 2: A Generalist Embodied Agent for Virtual Worlds
-
Gemini Robotics 1.5: Pushing the Frontier of Generalist Robots with Advanced Embodied Reasoning, Thinking, and Motion Transfer
-
ATLAS: Practical Scaling Laws for Multilingual Models
-
Cortex: Workflow-Aware Resource Pooling and Scheduling for Agentic Servin
-
Training Agents Inside of Scalable World Models
-
AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data
-
An AI System to Help Scientists Write Expert-Level Empirical Software
-
Measuring the environmental impact of delivering AI at Google Scale
-
Why do LLMs attend to the first token
-
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems
-
Scaling Data-Constrained Language Models
-
Non-preemptive Throughput Maximizationunder Time-varying Capacity
-
AlphaEvolve: A coding agent for scientific and algorithmic discovery
-
Gemini Robotics: Bringing AI into the Physical World
-
Lessons from Defending Gemini Against Indirect Prompt Injections
-
Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use
-
LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities
-
It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization
-
How new data permeates LLM knowledge and how to dilute it
-
Migrating Code At Scale With LLMs At Google
-
Gemini: A Family of Highly Capable Multimodal Models
-
Gemma 3 Technical Report
-
Gemini Embedding: Generalizable Embeddings from Gemini
-
EmbeddingGemma: Powerful and Lightweight Text Representations
-
Titans: Learning to Memorize at Test Time
-
VDB-GPDF: Online Gaussian Process Distance Field with VDB Structure
-
OpenVLA: An Open-Source Vision-Language-Action Model
