Papers
-
Intelligent Materials Modelling: Large Language Models Versus Partial Least Squares Regression for Predicting Polysulfone Membrane Mechanical Performance
-
IdentityGuard: Context-Aware Restriction and Provenance for Personalized Synthesis
-
MICRO: A Lightweight Middleware for Optimizing Cross-store Cross-model Graph-Relation Joins [Technical Report]
-
Fine-tuning is Not Enough: A Parallel Framework for Collaborative Imitation and Reinforcement Learning in End-to-end Autonomous Driving
-
MOGeo: Beyond One-to-One Cross-View Object Geo-localization
-
Is Seeing Believing? Evaluating Human Sensitivity to Synthetic Video
-
Sirens' Whisper: Inaudible Near-Ultrasonic Jailbreaks of Speech-Driven LLMs
-
Exploring the Dimensions of a Variational Neuron
-
Fronto-parietal and fronto-temporal EEG coherence as predictive neuromarkers of transcutaneous auricular vagus nerve stimulation response in treatment-resistant schizophrenia: A machine learning study
-
APEX-Searcher: Augmenting LLMs' Search Capabilities through Agentic Planning and Execution
-
Power Term Polynomial Algebra for Boolean Logic
-
VFM-Loc: Zero-Shot Cross-View Geo-Localization via Aligning Discriminative Visual Hierarchies
-
OrigamiBench: An Interactive Environment to Synthesize Flat-Foldable Origamis
-
Learning through Creation: A Hash-Free Framework for On-the-Fly Category Discovery
-
Geo-ID: Test-Time Geometric Consensus for Cross-View Consistent Intrinsics
-
Script-to-Slide Grounding: Grounding Script Sentences to Slide Objects for Automatic Instructional Video Generation
-
Inevitable Encounters: Backdoor Attacks Involving Lossy Compression
-
TransDex: Pre-training Visuo-Tactile Policy with Point Cloud Reconstruction for Dexterous Manipulation of Transparent Objects
-
On Interpolation Formulas Describing Neural Network Generalization
-
Zero-Forgetting CISS via Dual-Phase Cognitive Cascades
-
Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs
-
GradMem: Learning to Write Context into Memory with Test-Time Gradient Descent
-
Scribe Verification in Chinese manuscripts using Siamese, Triplet, and Vision Transformer Neural Networks
-
Step-CoT: Stepwise Visual Chain-of-Thought for Medical Visual Question Answering
-
Dual-Strategy Improvement of YOLOv11n for Multi-Scale Object Detection in Remote Sensing Images
-
Benchmarking the Energy Cost of Assurance in Neuromorphic Edge Robotics
-
SCoCCA: Multi-modal Sparse Concept Decomposition via Canonical Correlation Analysis
-
Multi-Modal Character Localization and Extraction for Chinese Text Recognition
-
Large Language Models Reproduce Racial Stereotypes When Used for Text Annotation
-
UVLM: A Universal Vision-Language Model Loader for Reproducible Multimodal Benchmarking
-
Robust Self-Training with Closed-loop Label Correction for Learning from Noisy Labels
-
MO-SAE:Multi-Objective Stacked Autoencoders Optimization for Edge Anomaly Detection
-
CT-Conditioned Diffusion Prior with Physics-Constrained Sampling for PET Super-Resolution
-
Distributed Acoustic Sensing for Urban Traffic Monitoring: Spatio-Temporal Attention in Recurrent Neural Networks
-
Pixel-level Scene Understanding in One Token: Visual States Need What-is-Where Composition
-
LineMaster Pro: A Low-Cost Intelligent Line Following Robot with PID Control and Ultrasonic Obstacle Avoidance for Educational Robotics
-
FedPBS: Proximal-Balanced Scaling Federated Learning Model for Robust Personalized Training for Non-IID Data
-
Scene Generation at Absolute Scale: Utilizing Semantic and Geometric Guidance From Text for Accurate and Interpretable 3D Indoor Scene Generation
-
AgriChat: A Multimodal Large Language Model for Agriculture Image Understanding
-
The Phenomenology of Hallucinations
-
Towards Stable Self-Supervised Object Representations in Unconstrained Egocentric Video
-
Evaluation of Visual Place Recognition Methods for Image Pair Retrieval in 3D Vision and Robotics
-
OpenCOOD-Air: Prompting Heterogeneous Ground-Air Collaborative Perception with Spatial Conversion and Offset Prediction
-
Generative Inverse Design of Cold Metals for Low-Power Electronics
-
SmoothVLA: Aligning Vision-Language-Action Models with Physical Constraints via Intrinsic Smoothness Optimization
-
Close to Reality: Interpretable and Feasible Data Augmentation for Imbalanced Learning
-
Discriminative Flow Matching Via Local Generative Predictors
-
True 4-Bit Quantized Convolutional Neural Network Training on CPU: Achieving Full-Precision Parity
-
OmniCompliance-100K: A Multi-Domain, Rule-Grounded, Real-World Safety Compliance Dataset
-
Iterative Semantic Reasoning from Individual to Group Interests for Generative Recommendation with LLMs
