Papers

Filter by company

Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning

Skywork AI

Published on: 2025-06-06 1 author
Splat and Replace: 3D Reconstruction with Repetitive Elements

Adobe / Massachusetts Institute of Technology, National Institute for Research in Digital Science and Technology, Université Côte d’Azur

Published on: 2025-06-06 1 author
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

Apple / Swiss Federal Institute of Technology Lausanne

Published on: 2025-06-04 1 author
HybGRAG: Hybrid Retrieval-Augmented Generation on Textual and Relational Knowledge Bases

Amazon / Carnegie Mellon University

Published on: 2025-06-02 1 author
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation

Snowflake

Published on: 2025-06-02 1 author
Scaling Diffusion Language Models via Adaptation from Autoregressive Models

Tencent, Apple / The University of Hong Kong, University of Illinois at Urbana-Champaign

Published on: 2025-05-31 1 author
M+: Extending MemoryLLM with Scalable Long-Term Memory

Amazon / Massachusetts Institute of Technology, University of California

Published on: 2025-05-30 1 author
Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents

Published on: 2025-05-29 5 authors
BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model

Published on: 2025-05-29 11 authors
Skywork Open Reasoner 1 Technical Report

Skywork AI

Published on: 2025-05-29 1 author
GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

NVIDIA

Published on: 2025-05-27 1 author
More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives

Moonshot AI / Renmin University of China

Published on: 2025-05-27 1 author
Optimizing Robustness and Accuracy in Mixture of Experts: A Dual-Model Approach

Perplexity

Published on: 2025-05-27 1 author
Autoregressive Speech Synthesis without Vector Quantization

Microsoft / The Chinese University of Hong Kong

Published on: 2025-05-27 1 author
Vision as LoRA

ByteDance / University of Birmingham

Published on: 2025-05-26 8 authors
syftr: Pareto-Optimal Generative AI

DataRobot

Published on: 2025-05-26 1 author
SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond

MiniMax / City University of Hong Kong, Hong Kong University of Science and Technology

Published on: 2025-05-26 1 author
Gemini Robotics: Bringing AI into the Physical World

Google

Published on: 2025-05-25 1 author
OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks

MiniMax / Fudan University

Published on: 2025-05-24 1 author
A Minimalist Method for Fine-tuning Text-to-Image Diffusion Models

Published on: 2025-05-23 4 authors
One RL to See Them All: Visual Triple Unified Reinforcement Learning

MiniMax / Northwestern Polytechnical University, Shanghai Jiao Tong University

Published on: 2025-05-23 1 author
GiGL: Large-Scale Graph Neural Networks at Snapchat

Snap

Published on: 2025-05-23 1 author
Incremental Sequence Classification with Temporal Consistency

UiPath

Published on: 2025-05-22 8 authors
From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech Recognition

ByteDance / Singapore University of Technology and Design

Published on: 2025-05-22 1 author
DAPO: An Open-Source LLM Reinforcement Learning System at Scale

ByteDance / Tsinghua University

Published on: 2025-05-20 1 author
M-RewardBench: Evaluating Reward Models in Multilingual Settings

Cohere / Allen Institute for AI

Published on: 2025-05-20 1 author
Lessons from Defending Gemini Against Indirect Prompt Injections

Google

Published on: 2025-05-20 1 author
G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning

Moonshot AI / Peking University, University of Chinese Academy of Sciences

Published on: 2025-05-19 1 author
Progressive Autoregressive Video Diffusion Models

Adobe / Stony Brook University

Published on: 2025-05-18 1 author
FastVLM: Efficient Vision Encoding for Vision Language Models

Apple

Published on: 2025-05-15 1 author
VGGT: Visual Geometry Grounded Transformer

Meta Platforms / University of Oxford

Published on: 2025-05-14 6 authors
Qwen3 Technical Report

Alibaba

Published on: 2025-05-14 1 author
The Leaderboard Illusion

Cohere / Princeton University

Published on: 2025-05-12 1 author
MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

MiniMax

Published on: 2025-05-12 1 author
LLMs Get Lost In Multi-Turn Conversation

Microsoft

Published on: 2025-05-09 1 author
Reasoning Models Don't Always Say What They Think

Published on: 2025-05-08 15 authors
A Survey on Test-Time Scaling in Large Language Models: What, How, Where, and How Well

Salesforce / City University of Hong Kong

Published on: 2025-05-04 1 author
Command A: An Enterprise-Ready Large Language Model

Cohere

Published on: 2025-05-01 1 author
InteractRank: Personalized Web-Scale Search Pre-Ranking with Cross Interaction Features

Pinterest

Published on: 2025-05-01 1 author
Investigating the Overlooked Hessian Structure: From CNNs to LLMs

ByteDance / Beijing Institute of Mathematical Sciences and Applications, Hong Kong Baptist University, Hong Kong University of Science and Technology, Rutgers University

Published on: 2025-05-01 1 author
The Leaderboard Illusion

Cohere / Allen Institute for Artificial Intelligence, Massachusetts Institute of Technology, Princeton University, Stanford University, University of Washington, University of Waterloo

Published on: 2025-04-29 13 authors
TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate

Published on: 2025-04-28 4 authors
Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use

Google / Stanford University

Published on: 2025-04-28 1 author
Perception Encoder: The best visual embeddings are not at the output of the network

Meta Platforms / Ritsumeikan Global Innovation Research Organization

Published on: 2025-04-28 1 author
Kimi-Audio Technical Report

Moonshot AI

Published on: 2025-04-25 1 author
I-Con: A Unifying Framework for Representation Learning

Google, Microsoft / Massachusetts Institute of Technology

Published on: 2025-04-23 5 authors
Describe Anything: Detailed Localized Image and Video Captioning

NVIDIA / University of California

Published on: 2025-04-22 11 authors
LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

Google / Johannes Kepler University Linz

Published on: 2025-04-22 5 authors
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing

Apple

Published on: 2025-04-22 1 author
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second

Apple

Published on: 2025-04-21 1 author

Prev 146 147 148 149 150 151 152 153 154 155 156 Next

Go to section

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: