Papers

Filter by company

Pretrained Vision-Language-Action Models are Surprisingly Resistant to Forgetting in Continual Learning

Microsoft / The University of Texas

Published on: 2026-03-04 1 author
Phi-4-reasoning-vision-15B Technical Report

Microsoft

Published on: 2026-03-04 1 author
UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?

Microsoft / Shanghai Jiao Tong University

Published on: 2026-03-03 1 author
Beyond Pixel Histories: World Models with Persistent 3D State

Microsoft / University of Edinburgh

Published on: 2026-03-03 1 author
Modular Memory is the Key to Continual Learning Agents

Microsoft / University of Bremen

Published on: 2026-03-02 1 author
Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization

Microsoft

Published on: 2026-02-26 1 author
Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization

Microsoft

Published on: 2026-02-26 1 author
WebGym: Scaling Training Environments for Visual Web Agents with Realistic Tasks

Microsoft / University of Illinois Urbana-Champaign

Published on: 2026-02-25 1 author
Experiential Reinforcement Learning

Microsoft / University of Southern California

Published on: 2026-02-15 1 author
WizardLM: Empowering large pre-trained language models to follow complex instructions

Microsoft / Peking University

1 author
Florence: A New Foundation Model for Computer Vision

Microsoft

1 author
LLM-in-Sandbox Elicits General Agentic Intelligence

Microsoft / Tsinghua University

Published on: 2026-02-12 1 author
On-Policy Context Distillation for Language Models

Microsoft / Microsoft Research

Published on: 2026-02-12 1 author
CineScene: Implicit 3D as Effective Scene Representation for Cinematic Video Generation

Microsoft

Published on: 2026-02-06 1 author
See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning

Microsoft / Tsinghua University

Published on: 2026-02-05 1 author
LIVE: Long-horizon Interactive Video World Modeling

Microsoft / The Chinese University of Hong Kong

Published on: 2026-02-03 1 author
Closing the Loop: Universal Repository Representation with RPG-Encoder

Microsoft

Published on: 2026-02-03 1 author
CUA-Skill: Develop Skills for Computer Using Agent

Microsoft

Published on: 2026-02-02 1 author
AgentRx: Diagnosing AI Agent Failures from Execution Trajectories

Microsoft

Published on: 2026-02-02 6 authors
X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests

Microsoft

Published on: 2026-02-01 1 author
LLM-42: Enabling Determinism in LLM Inference with Verified Speculation

Microsoft / Microsoft Research

Published on: 2026-01-30 1 author
Lost in Transmission: When and Why LLMs Fail to Reason Globally

Microsoft

Published on: 2026-01-30 1 author
Efficient Autoregressive Video Diffusion with Dummy Head

Microsoft

Published on: 2026-01-28 1 author
Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs

Microsoft

Published on: 2026-01-22 1 author
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Microsoft / MIT

Published on: 2026-01-15 1 author
Controlled LLM Training on Spectral Sphere

Microsoft / Renmin University

Published on: 2026-01-13 1 author
InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training

Microsoft / Peking University

Published on: 2026-01-08 1 author
Thinking with Blueprints: Assisting Vision-Language Models in Spatial Reasoning via Structured Object Representation

Microsoft / National University of Singapore

Published on: 2026-01-05 1 author
From Word to World: Can Large Language Models be Implicit Text-based World Models?

Microsoft / Southern University of Science and Technology

Published on: 2025-12-21 1 author
Sigma-MoE-Tiny Technical Report

Microsoft / Microsoft Research

Published on: 2025-12-19 1 author
FlashPortrait: 6x Faster Infinite Portrait Animation with Adaptive Latent Prediction

Microsoft / Fudan University

Published on: 2025-12-18 1 author
Spatia: Video Generation with Updatable Spatial Memory

Microsoft / The University of Sydney

Published on: 2025-12-17 1 author
Native and Compact Structured Latents for 3D Generation

Microsoft / Tsinghua University

Published on: 2025-12-16 1 author
Wait, Wait, Wait... Why Do Reasoning Models Loop?

Microsoft / MIT

Published on: 2025-12-15 1 author
Glance: Accelerating Diffusion Models with 1 Sample

Microsoft / Wissenschaftliche Hochschule für Unternehmensführung

Published on: 2025-12-11 1 author
Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion

Microsoft / Beijing Jiaotong University

Published on: 2025-12-05 1 author
VIGS-SLAM: Visual Inertial Gaussian Splatting SLAM

Microsoft / ETH Zurich

Published on: 2025-12-02 1 author
The Art of Scaling Test-Time Compute for Large Language Models

Microsoft / Indian Institute of Technology Delhi

Published on: 2025-12-01 1 author
ThetaEvolve: Test-time Learning on Open Problems

Microsoft

Published on: 2025-11-28 1 author
LatBot: Distilling Universal Latent Actions for Vision-Language-Action Models

Microsoft / University of Chinese Academy of Sciences

Published on: 2025-11-28 1 author
SageServe: Optimizing LLM Serving on Cloud Data Centers with Forecast Aware Auto-Scaling

Microsoft / University of Illinois Urbana-Champaign

Published on: 2025-11-12 1 author
Shifting Work Patterns with Generative AI

Microsoft / Harvard University

Published on: 2025-10-13 1 author
LUT-LLM: Efficient Large Language Model Inference with Memory-based Computations on FPGAs

Microsoft / University of California

Published on: 2025-10-09 5 authors
Teaching LLMs to Plan: Logical Chain-of-Thought Instruction Tuning for Symbolic Planning

Microsoft / MIT

Published on: 2025-09-14 1 author
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

MetaGPT, Google, Microsoft / Nanyang Technological University, Université de Montréal, University of Illinois at Urbana-Champaign

Published on: 2025-08-02 1 author
Autoregressive Speech Synthesis without Vector Quantization

Microsoft / The Chinese University of Hong Kong

Published on: 2025-05-27 1 author
LLMs Get Lost In Multi-Turn Conversation

Microsoft

Published on: 2025-05-09 1 author
I-Con: A Unifying Framework for Representation Learning

Google, Microsoft / MIT

Published on: 2025-04-23 5 authors
AI-Instruments: Embodying Prompts as Instruments to Abstract & Reflect Graphical Interface Commands as General-Purpose Tools

Microsoft

Published on: 2025-02-26 1 author
AI at Work Is Here. Now Comes the Hard Part

Microsoft

Published on: 2024-05-08 Venue: Microsoft Work Trend Index

1 2 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: