Papers
-
Less Gaussians, Texture More: 4K Feed-Forward Textured Splatting
-
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling
-
ArtHOI: Taming Foundation Models for Monocular 4D Reconstruction of Hand-Articulated-Object Interactions
-
Vision Transformers and Graph Neural Networks for Charged Particle Tracking in the ATLAS Muon Spectrometer
-
Beyond identifiability: Learning causal representations with few environments and finite samples
-
Resolving the Robustness-Precision Trade-off in Financial RAG through Hybrid Document-Routed Retrieval
-
End-to-end Feature Alignment: A Simple CNN with Intrinsic Class Attribution
-
LEMON: a foundation model for nuclear morphology in Computational Pathology
-
Do All Vision Transformers Need Registers? A Cross-Architectural Reassessment
-
RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation
-
ExVerus: Verus Proof Repair via Counterexample Reasoning
-
MAGNET: Autonomous Expert Model Generation via Decentralized Autoresearch and BitNet Training
-
Geo$^\textbf{2}$: Geometry-Guided Cross-view Geo-Localization and Image Synthesis
-
Doctorina MedBench: End-to-End Evaluation of Agent-Based Medical AI
-
ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?
-
Fus3D: Decoding Consolidated 3D Geometry from Feed-forward Geometry Transformer Latents
-
A Neural Score-Based Particle Method for the Vlasov-Maxwell-Landau System
-
Gradient-Informed Training for Low-Resource Multilingual Speech Translation
-
A Compression Perspective on Simplicity Bias
-
GazeQwen: Lightweight Gaze-Conditioned LLM Modulation for Streaming Video Understanding
-
Self-Organized Optical Pathways in Optofluidic Photonic Crystals
-
Incorporating contextual information into KGWAS for interpretable GWAS discovery
-
In-Context Molecular Property Prediction with LLMs: A Blinding Study on Memorization and Knowledge Conflicts
-
On the Expressive Power of Contextual Relations in Transformers
-
Why Safety Probes Catch Liars But Miss Fanatics
-
Methods for Knowledge Graph Construction from Text Collections: Development and Applications
-
Dynamic LIBRAS Gesture Recognition via CNN over Spatiotemporal Matrix Representation
-
GUIDE: A Benchmark for Understanding and Assisting Users in Open-Ended GUI Tasks
-
Seeing Through Smoke: Surgical Desmoking for Improved Visual Perception
-
Learning to Recorrupt: Noise Distribution Agnostic Self-Supervised Image Denoising
-
PiCSRL: Physics-Informed Contextual Spectral Reinforcement Learning
-
Speech-Synchronized Whiteboard Generation via VLM-Driven Structured Drawing Representations
-
DRiffusion: Draft-and-Refine Process Parallelizes Diffusion Models with Ease
-
Spectral Coherence Index: A Model-Free Metric for Protein Structural Ensemble Quality Assessment
-
Automated Quality Assessment of Blind Sweep Obstetric Ultrasound for Improved Diagnosis
-
World Reasoning Arena
-
Polarization-Based Eye Tracking with Personalized Siamese Architectures
-
Few Shots Text to Image Retrieval: New Benchmarking Dataset and Optimization Methods
-
THFM: A Unified Video Foundation Model for 4D Human Perception and Beyond
-
Data-Driven Plasticity Modeling via Acoustic Profiling
-
On Integrating Resilience and Human Oversight into LLM-Assisted Modeling Workflows for Digital Twins
-
Decoding Defensive Coverage Responsibilities in American Football Using Factorized Attention Based Transformer Models
-
Shared Representation for 3D Pose Estimation, Action Classification, and Progress Prediction from Tactile Signals
-
Parameter-Free Dynamic Regret for Unconstrained Linear Bandits
-
Preventing Data Leakage in EEG-Based Survival Prediction: A Two-Stage Embedding and Transformer Framework
-
Good Scores, Bad Data: A Metric for Multimodal Coherence
-
Personalizing Mathematical Game-based Learning for Children: A Preliminary Study
-
Density-aware Soft Context Compression with Semi-Dynamic Compression Ratio
-
DiReCT: Disentangled Regularization of Contrastive Trajectories for Physics-Refined Video Generation
-
DenseSwinV2: Channel Attentive Dual Branch CNN Transformer Learning for Cassava Leaf Disease Classification
MongoDB - Build AI That Scales
