11 5

Tasos Stamoulakatos

stamtron

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

How I contributed a new model to the Transformers library using Codex

upvoted a paper 11 months ago

How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks

upvoted an article 11 months ago

Open-source DeepResearch – Freeing our search agents

View all activity

Organizations

upvoted an article about 1 month ago

Article

How I contributed a new model to the Transformers library using Codex

nielsr

•

Mar 30

• 52

upvoted a paper 11 months ago

How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks

Paper • 2507.01955 • Published Jul 2, 2025 • 36

upvoted an article 11 months ago

Article

Open-source DeepResearch – Freeing our search agents

m-ric, albertvillanova, merve, thomwolf, clefourrier

•

Feb 4, 2025

• 1.32k

upvoted 6 articles about 1 year ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

orrzohar, mfarre, andito, merve, pcuenq, cyrilzakka, Xenova

•

Feb 20, 2025

• 340

Article

SmolVLM - small yet mighty Vision Language Model

andito, merve, mfarre, eliebak, pcuenq

•

Nov 26, 2024

• 418

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

manu

•

Jul 5, 2024

• 319

Article

DeepSearch Using Visual RAG in Agentic Frameworks 🔎

paultltc

•

Mar 21, 2025

• 38

Article

Reinforcement Learning for Large Language Models: Beyond the Agent Paradigm

royswastik

•

Mar 19, 2025

• 8

Article

The Large Language Model Course

mlabonne

•

Jan 16, 2025

• 230

Tasos Stamoulakatos

AI & ML interests

Recent Activity

Organizations

stamtron's activity

How I contributed a new model to the Transformers library using Codex

Open-source DeepResearch – Freeing our search agents

SmolVLM2: Bringing Video Understanding to Every Device

SmolVLM - small yet mighty Vision Language Model

ColPali: Efficient Document Retrieval with Vision Language Models 👀

DeepSearch Using Visual RAG in Agentic Frameworks 🔎

Reinforcement Learning for Large Language Models: Beyond the Agent Paradigm

The Large Language Model Course