Papers

Filter by company

Vessel-Aware Deep Learning for OCTA-Based Detection of AMD

Stony Brook University

Published on: 2026-03-06 5 authors
LucidNFT: LR-Anchored Multi-Reward Preference Optimization for Generative Real-World Super-Resolution

Hong Kong University of Science and Technology

Published on: 2026-03-06 6 authors
Energy-Driven Adaptive Visual Token Pruning for Efficient Vision-Language Models

Hong Kong University of Science and Technology

Published on: 2026-03-06 2 authors
Unify the Views: View-Consistent Prototype Learning for Few-Shot Segmentation

Tongji University

Published on: 2026-03-06 3 authors
Who We Are, Where We Are: Mental Health at the Intersection of Person, Situation, and Large Language Models

Oslo Metropolitan University, Stony Brook University, University of Texas

Published on: 2026-03-06 6 authors
Domain-Adaptive Model Merging across Disconnected Modes

Nanchang University, Peking University, Southeast University, Tongji University

Published on: 2026-03-06 5 authors
OVGGT: O(1) Constant-Cost Streaming Visual Geometry Transformer

National Taiwan University, National Taiwan University of Science and Technology

Published on: 2026-03-06 6 authors
Omni-Masked Gradient Descent: Memory-Efficient Optimization via Mask Traversal with Improved Convergence

Peking University

Published on: 2026-03-06 5 authors
Exploring Open-Vocabulary Object Recognition in Images using CLIP

Iwate Prefectural University

Published on: 2026-03-06 2 authors
Skeleton-to-Image Encoding: Enabling Skeleton Representation Learning via Vision-Pretrained Models

Hebei University of Technology, KTH Royal Institute of Technology, Lancaster University, Nanyang Technological University, Shenzen MSU-BIT University, VinUniversity

Published on: 2026-03-06 8 authors
CR-QAT: Curriculum Relational Quantization-Aware Training for Open-Vocabulary Object Detection

Incheon National University, Korea Advanced Institute of Science & Technology, University of Seoul

Published on: 2026-03-06 5 authors
PROBE: Probabilistic Occupancy BEV Encoding with Analytical Translation Robustness for 3D Place Recognition

Published on: 2026-03-06 3 authors
Imagine How To Change: Explicit Procedure Modeling for Change Captioning

Aalto University, Chinese Academy of Sciences, Sichuan University, University of Chinese Academy of Sciences

Published on: 2026-03-06 5 authors
Breaking Smooth-Motion Assumptions: A UAV Benchmark for Multi-Object Tracking in Complex and Adverse Conditions

Xidian University

Published on: 2026-03-06 9 authors
Towards High-resolution and Disentangled Reference-based Sketch Colorization

The University of Tokyo, Waseda University

Published on: 2026-03-06 8 authors
An Interactive Multi-Agent System for Evaluation of New Product Concepts

TCL Technology / Seoul National University of Science and Technology

Published on: 2026-03-06 3 authors
HarvestFlex: Strawberry Harvesting via Vision-Language-Action Policy Adaptation in the Wild

Beijing Academy of Agriculture and Forestry Sciences, ShanghaiTech University

Published on: 2026-03-06 4 authors
Agent Hunt: Bounty Based Collaborative Autoformalization With LLM Agents

AI4REASON Institute, Chalmers University of Technology, The University of Melbourne, University of Gothenburg

Published on: 2026-03-06 3 authors
Technical Report: Automated Optical Inspection of Surgical Instruments

National University of Computer and Emerging Sciences Islamabad

Published on: 2026-03-06 3 authors
Rank-Factorized Implicit Neural Bias: Scaling Super-Resolution Transformer with FlashAttention

University of Seoul

Published on: 2026-03-06 4 authors
TADPO: Reinforcement Learning Goes Off-road

Carnegie Mellon University

Published on: 2026-03-06 6 authors
Track-SQL: Enhancing Generative Language Models with Dual-Extractive Modules for Schema and Context Tracking in Multi-turn Text-to-SQL

Guangdong Laboratory of Artificial Intelligence and Digital Economy, Guangdong University of Technology, Peng Cheng Laboratory, Shantou University

Published on: 2026-03-06 6 authors
MM-ISTS: Cooperating Irregularly Sampled Time Series Forecasting with Multimodal Vision-Text LLMs

Academy of Sciences Hong Kong, East China Normal University, The Hong Kong Polytechnic University

Published on: 2026-03-06 6 authors
RePer-360: Releasing Perspective Priors for 360$^\circ$ Depth Estimation via Self-Modulation

Published on: 2026-03-06 5 authors
Restoring Linguistic Grounding in VLA Models via Train-Free Attention Recalibration

Fudan University, Singapore Management University, Tsinghua University

Published on: 2026-03-06 4 authors
Demystifying KAN for Vision Tasks: The RepKAN Approach

Sejong University

Published on: 2026-03-06 1 author
EvoESAP: Non-Uniform Expert Pruning for Sparse MoE

Mohamed bin Zayed University of Artificial Intelligence, Westlake University, Zhejiang University

Published on: 2026-03-06 5 authors
MASFactory: A Graph-centric Framework for Orchestrating LLM-Based Multi-Agent Systems with Vibe Graphing

Beijing University of Posts and Telecommunications, Shanghai Jiao Tong University

Published on: 2026-03-06 9 authors
Preventing Learning Stagnation in PPO by Scaling to 1 Million Parallel Environments

Google / University of Oxford

Published on: 2026-03-06 7 authors
EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation

Tencent / City University of Hong Kong

Published on: 2026-03-06 6 authors
MOSIV: Multi-Object System Identification from Videos

Insta360 / Carnegie Mellon University, ETH Zurich, Georgia Tech, Harvard University, University of California, University of Illinois Urbana-Champaign

Published on: 2026-03-06 12 authors
ViewFusion: Structured Spatial Thinking Chains for Multi-View Reasoning

Hong Kong University of Science and Technology, University of California, University of Queenland

Published on: 2026-03-06 5 authors
Sensitivity-Aware Retrieval-Augmented Intent Clarification

University of Amsterdam

Published on: 2026-03-06 1 author
Agnostic learning in (almost) optimal time via Gaussian surface area

ETH Zurich, University of Amsterdam

Published on: 2026-03-06 3 authors
Improved high-dimensional estimation with Langevin dynamics and stochastic weight averaging

Harvard University, Princeton University, University of California

Published on: 2026-03-06 3 authors
ResearchEnvBench: Benchmarking Agents on Environment Synthesis for Research Code Execution

Fudan University, Jilin University, Nanjing University, OpenMOSS, Shanghai Innovation Institution, Shanghai Key Laboratory of Multimodal Embodied AI, Wuhan University

Published on: 2026-03-06 10 authors
StruVis: Enhancing Reasoning-based Text-to-Image Generation via Thinking with Structured Vision

AntGroup / East China Normal University, Hong Kong University of Science and Technology, Shanghai Jiao Tong University

Published on: 2026-03-06 10 authors
ViroGym: Realistic Large-Scale Benchmarks for Evaluating Viral Proteins

GSK / KTH Royal Institute of Technology, Technical University of Munich, University of Washington

Published on: 2026-03-06 5 authors
Occlusion-Aware SORT: Observing Occlusion for Robust Multi-Object Tracking

Chinese Academy of Sciences, Sichuan University

Published on: 2026-03-06 5 authors
Ensemble Learning with Sparse Hypercolumns

Dublin City University

Published on: 2026-03-06 6 authors
Heterogeneous Decentralized Diffusion Models

Bagel Lab

Published on: 2026-03-06 4 authors
Improved Constrained Generation by Bridging Pretrained Generative Models

Inverted AI / University of British Columbia

Published on: 2026-03-06 5 authors
FontUse: A Data-Centric Approach to Style- and Use-Case-Conditioned In-Image Typography

University of Tsukuba

Published on: 2026-03-06 3 authors
Stabilizing Reinforcement Learning for Diffusion Language Models

Huawei / Hong Kong University of Science and Technology, The Chinese University of Hong Kong

Published on: 2026-03-06 8 authors
Learning to Generate via Understanding: Understanding-Driven Intrinsic Rewarding for Unified Multimodal Models

Baidu / Chinese Academy of Sciences, Peking University, Sun Yat-sen University, University of Chinese Academy of Sciences

Published on: 2026-03-06 9 authors
GenHOI: Towards Object-Consistent Hand-Object Interaction with Temporally Balanced and Spatially Selective Object Injection

Baidu / Northwestern Polytechnical University, Sun Yat-sen University

Published on: 2026-03-06 12 authors
Devil is in Narrow Policy: Unleashing Exploration in Driving VLA Models

Lenovo / Beihang University, Communication University of China, Tsinghua University

Published on: 2026-03-06 13 authors
Probing Visual Concepts in Lightweight Vision-Language Models for Automated Driving

University of Limerick

Published on: 2026-03-06 6 authors
TempoSyncDiff: Distilled Temporally-Consistent Diffusion for Low-Latency Audio-Driven Talking Head Generation

Gargi Memorial Institute of Technology, Variable Energy Cyclotron Centre

Published on: 2026-03-06 2 authors
Transforming Omnidirectional RGB-LiDAR data into 3D Gaussian Splatting

State University of New York

Published on: 2026-03-06 3 authors

Prev 125 126 127 128 129 130 131 132 133 134 135 Next

Go to section

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: