Papers
-
Var-JEPA: A Variational Formulation of the Joint-Embedding Predictive Architecture -- Bridging Predictive and Generative Self-Supervised Learning
-
Demonstration of Adapt4Me: An Uncertainty-Aware Authoring Environment for Personalizing Automatic Speech Recognition to Non-normative Speech
-
Current LLMs still cannot 'talk much' about grammar modules: Evidence from syntax
-
Conditioning Protein Generation via Hopfield Pattern Multiplicity
-
Chain-of-Adaptation: Surgical Vision-Language Adaptation with Reinforcement Learning
-
Evolving Jailbreaks: Automated Multi-Objective Long-Tail Attacks on Large Language Models
-
Generalizable NGP-SR: Generalizable Neural Radiance Fields Super-Resolution via Neural Graph Primitives
-
An Agentic Multi-Agent Architecture for Cybersecurity Risk Management
-
Revisiting Gene Ontology Knowledge Discovery with Hierarchical Feature Selection and Virtual Study Group of AI Agents
-
Reasoning Gets Harder for LLMs Inside A Dialogue
-
AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI
-
Synergistic Perception and Generative Recomposition: A Multi-Agent Orchestration for Expert-Level Building Inspection
-
Can Large Multimodal Models Inspect Buildings? A Hierarchical Benchmark for Structural Pathology Reasoning
-
Enhancing Hyperspace Analogue to Language (HAL) Representations via Attention-Based Pooling for Text Classification
-
Design-OS: A Specification-Driven Framework for Engineering System Design with a Control-Systems Design Case
-
Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD
-
Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models
-
Evaluating Evidence Grounding Under User Pressure in Instruction-Tuned Language Models
-
The Robot's Inner Critic: Self-Refinement of Social Behaviors through VLM-based Replanning
-
Comprehensive Description of Uncertainty in Measurement for Representation and Propagation with Scalable Precision
-
EgoForge: Goal-Directed Egocentric World Simulator
-
Learning Dynamic Belief Graphs for Theory-of-mind Reasoning
-
WebNavigator: Global Web Navigation via Interaction Graph Retrieval
-
Measuring Faithfulness Depends on How You Measure: Classifier Sensitivity in LLM Chain-of-Thought Evaluation
-
TinyML Enhances CubeSat Mission Capabilities
-
LagerNVS: Latent Geometry for Fully Neural Real-time Novel View Synthesis
-
Adaptive Greedy Frame Selection for Long Video Understanding
-
Improving Generalization on Cybersecurity Tasks with Multi-Modal Contrastive Learning
-
Kolmogorov-Arnold causal generative models
-
VideoSeek: Long-Horizon Video Agent with Tool-Guided Seeking
-
Improving Image-to-Image Translation via a Rectified Flow Reformulation
-
MuSteerNet: Human Reaction Generation from Videos via Observation-Reaction Mutual Steering
-
Wildfire Spread Scenarios: Increasing Sample Diversity of Segmentation Diffusion Models with Training-Free Methods
-
MeanFlow Meets Control: Scaling Sampled-Data Control for Swarms
-
CoVR-R:Reason-Aware Composed Video Retrieval
-
Deterministic Mode Proposals: An Efficient Alternative to Generative Sampling for Ambiguous Segmentation
-
LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation
-
From Masks to Pixels and Meaning: A New Taxonomy, Benchmark, and Metrics for VLM Image Tampering
-
MME-CoF-Pro: Evaluating Reasoning Coherence in Video Generative Models with Text and Visual Hints
-
ALARA for Agents: Least-Privilege Context Engineering Through Portable Composable Multi-Agent Teams
-
The production of meaning in the processing of natural language
-
Uni-Classifier: Leveraging Video Diffusion Priors for Universal Guidance Classifier
-
Multi-Stage Fine-Tuning of Pathology Foundation Models with Head-Diverse Ensembling for White Blood Cell Classification
-
Jigsaw Regularization in Whole-Slide Image Classification
-
From Cross-Validation to SURE: Asymptotic Risk of Tuned Regularized Estimators
-
A chemical language model for reticular materials design
-
CAMA: Exploring Collusive Adversarial Attacks in c-MARL
-
Monocular Models are Strong Learners for Multi-View Human Mesh Recovery
-
SymCircuit: Bayesian Structure Inference for Tractable Probabilistic Circuits via Entropy-Regularized Reinforcement Learning
-
Compression is all you need: Modeling Mathematics
