-
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 80 -
Scaling Latent Reasoning via Looped Language Models
Paper • 2510.25741 • Published • 231 -
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Paper • 2502.05171 • Published • 156 -
Pretraining Language Models to Ponder in Continuous Space
Paper • 2505.20674 • Published • 3
Collections
Discover the best community collections!
Collections including paper arxiv:2605.13301
-
TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments
Paper • 2510.01179 • Published • 29 -
Towards a Medical AI Scientist
Paper • 2603.28589 • Published • 90 -
MinT: Managed Infrastructure for Training and Serving Millions of LLMs
Paper • 2605.13779 • Published • 219 -
Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling
Paper • 2605.13301 • Published • 159
-
deepseek-ai/DeepSeek-V4-Pro
Text Generation • 862B • Updated • 5.85M • • 4.52k -
Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling
Paper • 2605.13301 • Published • 159 -
microsoft/Fara-7B
Image-Text-to-Text • 8B • Updated • 14.5k • 605 -
openbmb/MiniCPM5-1B
Text Generation • 1B • Updated • 45.7k • 681
-
Transformers Can Do Arithmetic with the Right Embeddings
Paper • 2405.17399 • Published • 54 -
Solving Inequality Proofs with Large Language Models
Paper • 2506.07927 • Published • 20 -
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning
Paper • 2507.00432 • Published • 79 -
CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization
Paper • 2507.06181 • Published • 45
-
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 80 -
Scaling Latent Reasoning via Looped Language Models
Paper • 2510.25741 • Published • 231 -
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Paper • 2502.05171 • Published • 156 -
Pretraining Language Models to Ponder in Continuous Space
Paper • 2505.20674 • Published • 3
-
deepseek-ai/DeepSeek-V4-Pro
Text Generation • 862B • Updated • 5.85M • • 4.52k -
Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling
Paper • 2605.13301 • Published • 159 -
microsoft/Fara-7B
Image-Text-to-Text • 8B • Updated • 14.5k • 605 -
openbmb/MiniCPM5-1B
Text Generation • 1B • Updated • 45.7k • 681
-
TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments
Paper • 2510.01179 • Published • 29 -
Towards a Medical AI Scientist
Paper • 2603.28589 • Published • 90 -
MinT: Managed Infrastructure for Training and Serving Millions of LLMs
Paper • 2605.13779 • Published • 219 -
Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling
Paper • 2605.13301 • Published • 159
-
Transformers Can Do Arithmetic with the Right Embeddings
Paper • 2405.17399 • Published • 54 -
Solving Inequality Proofs with Large Language Models
Paper • 2506.07927 • Published • 20 -
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning
Paper • 2507.00432 • Published • 79 -
CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization
Paper • 2507.06181 • Published • 45