Papers
-
SceneTransporter: Optimal Transport-Guided Compositional Latent Diffusion for Single-Image Structured 3D Scene Generation
-
SkyReels-V4: Multi-modal Video-Audio Generation, Inpainting and Editing model
-
Matrix-game 2.0: An open-source real-time and streaming interactive world model
-
Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch
-
Matrix-3D: Omnidirectional Explorable 3D World GenerationSkywork AI / Beijing Normal University, Chinese Academy of Sciences, Hong Kong University of Science and Technology
-
Skywork UniPic: Unified Autoregressive Modeling for Visual Understanding and Generation
-
Skywork-R1V3 Technical Report
-
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy
-
Skywork-SWE: Unveiling Data Scaling Laws for Software Engineering in LLMs
-
CSVQA: A Chinese Multimodal Benchmark for Evaluating STEM Reasoning Capabilities of VLMs
-
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought
-
Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning
-
Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning
-
Skywork Open Reasoner 1 Technical Report
KiloClaw - Managed 🦀 