Amir Gholami
amirgh
AI & ML interests
None yet
Recent Activity
upvoted a paper 33 minutes ago
EfficientRollout: System-Aware Self-Speculative Decoding for RL Rollouts upvoted a paper 4 months ago
Residual Context Diffusion Language Models upvoted a paper 10 months ago
XQuant: Breaking the Memory Wall for LLM Inference with KV Cache
Rematerialization