Papers
-
Bilevel Autoresearch: Meta-Autoresearching Itself
-
Mecha-nudges for Machines
-
Similarity-Aware Mixture-of-Experts for Data-Efficient Continual Learning
-
Targeted Adversarial Traffic Generation : Black-box Approach to Evade Intrusion Detection Systems in IoT Networks
-
SIGMA: A Physics-Based Benchmark for Gas Chimney Understanding in Seismic Images
-
Evaluating LLM-Based Test Generation Under Software Evolution
-
3DCity-LLM: Empowering Multi-modality Large Language Models for 3D City-scale Perception and Understanding
-
Code Review Agent Benchmark
-
DetPO: In-Context Learning with Multi-Modal LLMs for Few-Shot Object Detection
-
CSTS: A Canonical Security Telemetry Substrate for AI-Native Cyber Detection
-
End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions
-
RealMaster: Lifting Rendered Scenes into Photorealistic Video
-
InverFill: One-Step Inversion for Enhanced Few-Step Diffusion Inpainting
-
Byzantine-Robust and Differentially Private Federated Optimization under Weaker Assumptions
-
UniFunc3D: Unified Active Spatial-Temporal Grounding for 3D Functionality Segmentation
-
VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs
-
ReqFusion: A Multi-Provider Framework for Automated PEGS Analysis Across Software Domains
-
SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning
-
Failure of contextual invariance in gender inference with large language models
-
TETO: Tracking Events with Teacher Observation for Motion Estimation and Frame Interpolation
-
One View Is Enough! Monocular Training for In-the-Wild Novel View Generation
-
AgentRVOS: Reasoning over Object Tracks for Zero-Shot Referring Video Object Segmentation
-
Foveated Diffusion: Efficient Spatially Adaptive Image and Video Generation
-
VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language interactions
-
Estimating Flow Velocity and Vehicle Angle-of-Attack from Non-invasive Piezoelectric Structural Measurements Using Deep Learning
-
WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG
-
DA-Flow: Degradation-Aware Optical Flow Estimation with Diffusion Models
-
UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation
-
MedObvious: Exposing the Medical Moravec's Paradox in VLMs via Clinical Triage
-
OccAny: Generalized Unconstrained Urban 3D Occupancy
-
LongTail Driving Scenarios with Reasoning Traces: The KITScenes LongTail Dataset
-
Environment Maps: Structured Environmental Representations for Long-Horizon Agents
-
LLMORPH: Automated Metamorphic Testing of Large Language Models
-
LLMLOOP: Improving LLM-Generated Code and Tests through Automated Iterative Feedback Loops
-
M3T: Discrete Multi-Modal Motion Tokens for Sign Language Production
-
Revisiting Real-Time Digging-In Effects: No Evidence from NP/Z Garden-Paths
-
Evaluating a Multi-Agent Voice-Enabled Smart Speaker for Care Homes: A Safety-Focused Framework
-
A Theory of LLM Information Susceptibility
-
Ukrainian Visual Word Sense Disambiguation Benchmark
-
Steering Code LLMs with Activation Directions for Language and Library Control
-
Stochastic Ray Tracing for the Reconstruction of 3D Gaussian Splatting
-
Can LLM Agents Be CFOs? A Benchmark for Resource Allocation in Dynamic Enterprise Environments
-
LLM Inference at the Edge: Mobile, NPU, and GPU Performance Efficiency Trade-offs Under Sustained Load
-
Swiss-Bench SBP-002: A Frontier Model Comparison on Swiss Legal and Regulatory Tasks
-
λSplit: Self-Supervised Content-Aware Spectral Unmixing for Fluorescence Microscopy
-
Foundation Model Embeddings Meet Blended Emotions: A Multimodal Fusion Approach for the BLEMORE Challenge
-
Ethio-ASR: Joint Multilingual Speech Recognition and Language Identification for Ethiopian Languages
-
Boost Like a (Var)Pro: Trust-Region Gradient Boosting via Variable Projection
-
Probing Ethical Framework Representations in Large Language Models: Structure, Entanglement, and Methodological Challenges
-
GTO Wizard Benchmark
MongoDB - Build AI That Scales
