view article Article How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent nvidia • 3 days ago • 37
Verbatim RAG v1 Collection Hallucination free RAG and out SOTA state-of-the-art extractors • 8 items • Updated 5 days ago • 9
view article Article Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action nvidia • 6 days ago • 70
view article Article Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains JetBrains • 6 days ago • 29
view article Article ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM ibm-research • 11 days ago • 14
📝 Research & Long-Form Blog Posts Collection In-depth technical articles and research pieces published by Hugging Face • 18 items • Updated 10 days ago • 25
view article Article Eight Days in China: What I Learned from the AI Labs, Robotics Startups and Academia matthew-d-white • 15 days ago • 3
view article Article Harness, Scaffold, and the AI Agent Terms Worth Getting Right sergiopaniego, ariG23498 • 13 days ago • 101
view article Article Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality ibm-granite • 24 days ago • 32
MiniCPM-V 4.6 Collection MLX variants of MiniCPM-V 4.6, 1.3B parameters (SigLIP2 400M vision encoder + Qwen3.5-0.8B LLM), repo: https://huggingface.co/openbmb/MiniCPM-V-4.6 • 7 items • Updated 27 days ago • 1
view article Article EMO: Pretraining mixture of experts for emergent modularity allenai • 30 days ago • 38
view article Article Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents nvidia • Apr 28 • 61
TIPSv2: Advancing Vision-Language Pretraining with Enhanced Patch-Text Alignment Paper • 2604.12012 • Published Apr 13 • 13
view article Article DeepSeek-V4: a million-token context that agents can actually use burtenshaw • Apr 24 • 48