Shihan Dou's picture

Shihan Dou

Ablustrund

·

Ablustrund

AI & ML interests

Natural Language Processing, Large Language Models

Recent Activity

upvoted a paper about 1 month ago

LLMEval-Logic: A Solver-Verified Chinese Benchmark for Logical Reasoning of LLMs with Adversarial Hardening

published a dataset about 2 months ago

tencent/CL-bench-Life

updated a dataset about 2 months ago

tencent/CL-bench-Life

View all activity

Organizations

Papers 22

arxiv:2507.05197

arxiv:2504.13914

arxiv:2502.17184

arxiv:2412.12505

models 1

Ablustrund/moss-rlhf-reward-model-7B-zh

Updated Jul 13, 2023 • 1 • 23

datasets 0

None public yet