15 101

Leo PRO

leideng

https://leideng.github.io/

AI & ML interests

Efficient AI, Sparse Attention

Recent Activity

updated a collection about 2 hours ago

SFT

authored a paper about 8 hours ago

Extending Context Window of Large Language Models via Semantic Compression

authored a paper about 8 hours ago

Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory

View all activity

Organizations

None yet

Articles 1

Article

Nanochat-Ascend: Training Karpathy's Nanochat on Ascend NPU (Part 1)

Collections 6

View 6 collections

Papers 8

models 7

datasets 12

leideng/Dolci-Think-SFT-7B-4K-Plus

Viewer • Updated Apr 19 • 3.63M • 1.69k

leideng/Dolci-Instruct-SFT-4K-Plus

Viewer • Updated Apr 19 • 2.19M • 842

leideng/Dolci-Think-RL-7B-4K-Plus

Viewer • Updated Apr 19 • 102k • 39

leideng/Dolci-Instruct-RL-4K-Plus

Viewer • Updated Apr 19 • 170k • 48

leideng/Dolci-Think-DPO-7B-4K-Plus

Viewer • Updated Apr 19 • 150k • 100

leideng/Dolci-Instruct-DPO-4K-Plus

Viewer • Updated Apr 19 • 260k • 289

leideng/longbench-v2-view

Viewer • Updated Apr 11 • 1.51k • 67

leideng/longbench-view

Viewer • Updated Apr 11 • 8.42k • 214

leideng/nanochat-ascend-dataset

Viewer • Updated Apr 6 • 72.3k • 446

leideng/nanochat-ascend-eval

Viewer • Updated Apr 6 • 272k • 227

View 12 datasets

Leo PRO

AI & ML interests

Recent Activity

Organizations

Articles 1

Nanochat-Ascend: Training Karpathy's Nanochat on Ascend NPU (Part 1)

Collections 6

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Proximal Policy Optimization Algorithms

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

LFM2 Technical Report

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Proximal Policy Optimization Algorithms

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

LFM2 Technical Report

Papers 8

models 7

leideng/nanochat-ascend-d32-rl-pt

leideng/nanochat-ascend-d32-sft-pt

leideng/nanochat-ascend-d32-pt

leideng/nanochat-ascend-d20-sft-pt

leideng/nanochat-ascend-d20-rl-pt

leideng/nanochat-ascend-d20-pt

leideng/nanochat-ascend-tokenizer

datasets 12

leideng/Dolci-Think-SFT-7B-4K-Plus

leideng/Dolci-Instruct-SFT-4K-Plus

leideng/Dolci-Think-RL-7B-4K-Plus

leideng/Dolci-Instruct-RL-4K-Plus

leideng/Dolci-Think-DPO-7B-4K-Plus

leideng/Dolci-Instruct-DPO-4K-Plus

leideng/longbench-v2-view

leideng/longbench-view

leideng/nanochat-ascend-dataset

leideng/nanochat-ascend-eval

Leo PRO

AI & ML interests

Recent Activity

Organizations

Articles 1

Nanochat-Ascend: Training Karpathy's Nanochat on Ascend NPU (Part 1)

Collections 6

Papers 8

models 7 Sort: Recently updated

datasets 12 Sort: Recently updated

models 7

datasets 12