Papers
-
Orion: Characterizing and Programming Apple's Neural Engine for LLM Training and Inference
-
VS3R: Robust Full-frame Video Stabilization via Deep 3D ReconstructionHunan University
-
Don't Freeze, Don't Crash: Extending the Safe Operating Range of Neural Navigation in Dense CrowdsPurdue University
-
Evolving Medical Imaging Agents via Experience-driven Self-skill DiscoveryHokkaido University, Ricken, Southwest Jiaotong University, The University of Tokyo, Westlake University
-
ReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinforcement Learning
-
TumorChain: Interleaved Multimodal Chain-of-Thought Reasoning for Traceable Clinical Tumor AnalysisDAMO Academy
-
PatchCue: Enhancing Vision-Language Model Reasoning with Patch-Based Visual Cues
-
PolyBlocks: A Compiler Infrastructure for AI Chips and Programming Frameworks
-
Shifting Adaptation from Weight Space to Memory Space: A Memory-Augmented Agent for Medical Image SegmentationHarvard Medical School, New Jersey Institute of Technology, Northeastern University, University of California, University of Georgia
-
Stochastic Event Prediction via Temporal Motif TransitionsUniversity at Buffalo
-
Systematic Evaluation of Novel View Synthesis for Video Place RecognitionFordham University
-
ROSE: Reordered SparseGPT for More Accurate One-Shot Large Language Models PruningWestlake University
-
Confidence Before Answering: A Paradigm Shift for Efficient LLM Uncertainty Estimation
-
CylinderSplat: 3D Gaussian Splatting with Cylindrical Triplanes for Panoramic Novel View SynthesisNanjing University of Science and Technology, ShanghaiTech University
-
VerChol -- Grammar-First Tokenization for Agglutinative Languages
-
Computational Pathology in the Era of Emerging Foundation and Agentic AI -- International Expert Perspectives on Clinical Integration and Translational ReadinessAntGroup / Carnegie Mellon University, Duke University, Emory University, Georgia Institute of Technology, Harvard Medical School, Harvard University, Oncode Institute, Qingdao City University, Shanghai Jiao Tong University, Sichuan University, Stanford University, Technische Universität Dresden, Tsinghua University, University of Chicago, University of Texas, University of Warwick
-
HERO: Hierarchical Embedding-Refinement for Open-Vocabulary Temporal Sentence Grounding in VideosHangzhou Dianzi University, Tsinghua University
-
Reconstruct! Don't Encode: Self-Supervised Representation Reconstruction Loss for High-Intelligibility and Low-Latency Streaming Neural Audio CodecJohns Hopkins University, University of Southern California
-
PixARMesh: Autoregressive Mesh-Native Single-View Scene ReconstructionUniversity of California
-
Lost in Stories: Consistency Bugs in Long Story Generation by LLMs
-
Building an Ensemble LLM Semantic Tagger for UN Security Council Resolutions
-
InnoAds-Composer: Efficient Condition Composition for E-Commerce Poster GenerationChongqing University of Posts and Telecommunications, Hong Kong University of Science and Technology, Zhejiang University
-
Mitigating Bias in Concept Bottleneck Models for Fair and Interpretable Image ClassificationMassachusetts Institute of Technology
-
Reference-guided Policy Optimization for Molecular Optimization via LLM ReasoningAlibaba / Central Michigan University, DAMO Academy, Hong Kong Baptist University, Shanghai Jiao Tong University
-
Calibrated Credit Intelligence: Shift-Robust and Fair Risk Scoring with Bayesian Uncertainty and Gradient Boosting
-
LUMINA: LLM-Guided GPU Architecture Exploration via Bottleneck Analysis
-
CollabOD: Collaborative Multi-Backbone with Cross-scale Vision for UAV Small Object DetectionShenzhen ATC Technology / Hong Kong University of Science and Technology, The University of International Business and Economics Beijing, The University of Sydney
-
Beyond Geometry: Artistic Disparity Synthesis for Immersive 2D-to-3D
-
Pano3DComposer: Feed-Forward Compositional 3D Scene Generation from Single Panoramic ImageSun Yat-sen University
-
InfoGatherer: Principled Information Seeking via Evidence Retrieval and Strategic QuestioningCanadian Institute for Advanced Research, Dalhousie University, University of British Columbia, University of Washington, Vector Institute
-
The World Won't Stay Still: Programmable Evolution for Agent Benchmarks
-
CORE-Seg: Reasoning-Driven Segmentation for Complex Lesions via Reinforcement LearningAgency for Science, Technology and Research, Singapore, Nanjing University of Science and Technology, Southeast University
-
DeepFact: Co-Evolving Benchmarks and Agents for Deep Research Factuality
-
Stock Market Prediction Using Node Transformer Architecture Integrated with BERT Sentiment AnalysisUniversity of Ottawa
-
Design Experiments to Compare Multi-armed Bandit AlgorithmsThe Chinese University of Hong Kong, University of Toronto
-
BlackMirror: Black-Box Backdoor Detection for Text-to-Image Models via Instruction-Response DeviationBeijing Institute of Technology, Chinese Academy of Sciences, Sun Yat-sen University, University of Chinese Academy of Sciences
-
Learning Next Action Predictors from Human-Computer InteractionHasso Plattner Institute, New York University, Stanford University
-
Weak-SIGReg: Covariance Regularization for Stable Deep LearningKreasof AI
-
RAC: Rectified Flow Auto CoderNanyang Technological University, Rutgers University, University of Wisconsin-Madison
-
Towards Driver Behavior Understanding: Weakly-Supervised Risk Perception in Driving Scenes
-
Addressing the Ecological Fallacy in Larger LMs with Human ContextStony Brook University, Vanderbilt University
-
Beyond Static Frames: Temporal Aggregate-and-Restore Vision Transformer for Human Pose EstimationZhejiang Gongshang University
-
A Persistent-State Dataflow Accelerator for Memory-Bound Linear Attention Decode on FPGAUniversity of Southern California
-
FTSplat: Feed-forward Triangle Splatting NetworkNankai University
-
Implicit Style Conditioning: A Structured Style-Rewrite Framework for Low-Resource Character ModelingGuangdong University of Finance
-
OD-RASE: Ontology-Driven Risk Assessment and Safety Enhancement for Autonomous DrivingChubu University
-
Facial Expression Recognition Using Residual Masking NetworkHo Chi Minh City University of Technology
-
SLER-IR: Spherical Layer-wise Expert Routing for All-in-One Image RestorationUniversity of California San Diego
-
XAI for Coding Agent Failures: Transforming Raw Execution Traces into Actionable InsightsIslington College
-
Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew EstimationHo Chi Minh City University of Technology, Vietnam National University Ho Chi Minh City
