Models and datasets for Elastic Reset (NeurIPS 2023), code at https://github.com/mnoukhov/elastic-reset
Michael N
mnoukhov
AI & ML interests
Representation learning for functional language
Recent Activity
published a model about 7 hours ago
mnoukhov/nuevamol-360M-6Btok-wd8 published a model about 7 hours ago
mnoukhov/nuevamol-46M-6Btok-wd1 updated a model about 8 hours ago
mnoukhov/nuevamol-360m-initOrganizations
models 53
mnoukhov/nuevamol-360M-6Btok-wd8
Updated
mnoukhov/nuevamol-46M-6Btok-wd1
Updated
mnoukhov/nuevamol-360m-init
0.4B • Updated
mnoukhov/nuevamol-135M-wsd-6Btok-wd2.0
Text Generation • 0.1B • Updated • 3
mnoukhov/nuevamol-135m-6B-wd3
Text Generation • 0.1B • Updated • 99
mnoukhov/nuevamol-80m-reinvent-sft
Text Generation • 78.1M • Updated • 124
mnoukhov/nuevamol-80m-base
Text Generation • 78.1M • Updated • 100
mnoukhov/nuevamol-220m-reinvent-sft
Text Generation • 0.2B • Updated • 103
mnoukhov/nuevamol-80m-init
Text Generation • 0.1B • Updated • 27
mnoukhov/nuevamol-135m-reinvent-sft
Text Generation • 0.1B • Updated • 253
datasets 102
mnoukhov/chembl_filtered
Viewer • Updated • 1.18M • 43
mnoukhov/brumo-2025-openinstruct-qwen3-4b-base-32samples-quartiles
Viewer • Updated • 60 • 16
mnoukhov/brumo-2025-openinstruct-qwen3-4b-base-32samples
Viewer • Updated • 30 • 10
mnoukhov/aime-2025-openinstruct-qwen3-4b-base-32samples-quartiles
Viewer • Updated • 60 • 15
mnoukhov/aime-2025-openinstruct-qwen3-4b-base-32samples
Viewer • Updated • 30 • 8
mnoukhov/dapo-math-17k-processed-filtered-qwen3-4b-base-32samples-quartiles
Viewer • Updated • 25.3k • 87
mnoukhov/dapo-math-17k-processed-filtered-qwen3-4b-base-32samples
Viewer • Updated • 12.6k • 33
mnoukhov/gsm8k-train-harder-quartiles
Viewer • Updated • 11.2k • 9
mnoukhov/manufactoria-qwen3-4b-instruct-warmup650-pass128
Viewer • Updated • 874 • 9
mnoukhov/manufactoria-qwen3-4b-instruct-warmup650-pass128-completions
Viewer • Updated • 874 • 52