Writer

Enterprise

company

Verified

https://writer.com/

Get_Writer

writer

Activity Feed

AI & ML interests

AGI, LLMs, Knowledge Graph, Palmyra, Domain Specific LLM

Recent Activity

wassemgtk new activity 15 days ago

Writer/Palmyra-Fin-70B-32K:What is the base model of Palmyra-Fin-70B-32K?

sanderland updated a dataset 23 days ago

Writer/IRT-mislabeled-items

aparnabalagopalan0825 published a dataset 26 days ago

Writer/colm-data

View all activity

Papers

Accurate Failure Prediction in Agents Does Not Imply Effective Failure Prevention

View all Papers

Articles

Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason!

Sep 11, 2025

• 58

wassemgtk

posted an update 8 days ago

Post

186

Built GLM-5.2-visual-runtime: a training-free multimodal runtime gateway that makes GLM-5.2 work like a vision-capable model.

It keeps images as persistent visual variables, runs local visual/OCR/chart/palette tools only when needed, and sends compact structured evidence to the reasoning model instead of retraining or modifying weights.

The one-click stack includes GLM-5.2 via vLLM, Qwen3-Omni for vision/omni input, local OCR, Postgres, MinIO, and an OpenAI-compatible API.

Model repo: wassemgtk/glm-5.2-visual-runtime

wassemgtk

in Writer/Palmyra-Fin-70B-32K 15 days ago

What is the base model of Palmyra-Fin-70B-32K?

#6 opened almost 2 years ago by

ZixuanKe

sanderland

updated a dataset 23 days ago

Writer/IRT-mislabeled-items

Viewer • Updated 23 days ago • 1k • 12

aparnabalagopalan0825

published a dataset 26 days ago

Writer/colm-data

Viewer • Updated 26 days ago • 200 • 64

aparnabalagopalan0825

updated a dataset 26 days ago

Writer/colm-data

Viewer • Updated 26 days ago • 200 • 64

wassemgtk

posted an update 3 months ago

Post

181

Here is the updated note and benchmark table for your review.

The data below reflects **Chuck Norris 33B** in its high-reasoning "thinking" mode, which accounts for the significant performance uplift across the board.

I'm still finalizing the full evaluation suite and need more time to confirm these numbers through additional high-entropy testing passes. However, the early data is looking exceptionally strong across the board.

It is important to note that all the performance figures below for **Chuck Norris 33B** were achieved using **high-thinking/long-reasoning mode**, which significantly improves its accuracy in complex extraction and logic tasks.
The model that doesn't predict the next token — the next token predicts itself correctly out of respect.

wassemgtk

posted an update 3 months ago

Post

179

Releasing Chuck Norris LLM — full SFT fine-tune with chain-of-thought reasoning.

Trained on +100k examples across math, logic, and code. Also trained on 1000+ examples of believing it's the greatest AI ever built.

Its training loss went to zero. The loss function was too afraid to report anything else.

wassemgtk/chuck-norris-llm

wassemgtk

authored a paper 5 months ago

Accurate Failure Prediction in Agents Does Not Imply Effective Failure Prevention

Paper • 2602.03338 • Published Feb 3 • 26

melisa

authored a paper 5 months ago

Accurate Failure Prediction in Agents Does Not Imply Effective Failure Prevention

Paper • 2602.03338 • Published Feb 3 • 26

melisa

submitted a paper to Daily Papers 5 months ago

Accurate Failure Prediction in Agents Does Not Imply Effective Failure Prevention

Paper • 2602.03338 • Published Feb 3 • 26

sanderland

authored a paper 8 months ago

Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures

Paper • 2510.24081 • Published Oct 28, 2025 • 24

tperes

posted an update 10 months ago

Post

253

Introducing Palmyra-mini: Compact AI Models for Efficient Inference

The Palmyra-mini family from Writer includes three lightweight models designed for high performance and efficient inference. These models are ideal for developers looking to integrate AI capabilities without excessive computational overhead.

Model Variants

* palmyra-mini: A base model for general-purpose generative tasks, achieving 52.6% on Big Bench Hard (exact match).

* palmyra-mini-thinking-a: Optimized for complex logical reasoning with a Chain of Thought (CoT) approach, scoring 82.87% on GSM8K (strict match).

* palmyra-mini-thinking-b: Specialized for mathematical reasoning, achieving 92.5% on AMC23.

Technical Details

* All models are based on the Qwen architecture, compatible with popular inference frameworks like vLLM, SGLang, and TGI.

* "Thinking" models utilize CoT training for enhanced reasoning capabilities.

* GGUF and MLX quantizations are available for optimized performance.

For more information, including benchmark methodologies and detailed performance metrics, refer to our blog post: (https://huggingface.co/blog/Writer/announcing-palmyra-mini).

Model repos can be found here:
* Writer/palmyra-mini
* Writer/palmyra-mini-thinking-a
* Writer/palmyra-mini-thinking-b

Also check out a mobile implementation of palmyra-mini on iOS here to see a to see a working example of how inference can be incorporated on-device.(https://github.com/tsperes/palmyra-mini-mobile/)

dmytro-writer

authored a paper about 1 year ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30, 2025 • 283

sanderland

authored a paper about 1 year ago

RewardBench 2: Advancing Reward Model Evaluation

Paper • 2506.01937 • Published Jun 2, 2025 • 7

kiranr

authored a paper about 1 year ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30, 2025 • 283

wassemgtk

authored a paper about 1 year ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30, 2025 • 283

melisa

authored a paper about 1 year ago

Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning

Paper • 2505.24726 • Published May 30, 2025 • 283

sanderland

authored a paper about 1 year ago

Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models

Paper • 2405.05417 • Published May 8, 2024 • 1

wassemgtk

posted an update about 1 year ago

Post

3358

I’ve been diving into the iRoPE architecture from Llama 4—a game-changer for long-context models! It interleaves local attention (with RoPE) for short contexts and global attention (with inference-time temp scaling) for long-range reasoning, aiming for infinite context. I’m going to try writing iRoPE—who wants to help?

Code: https://github.com/wassemgtk/iRoPE-try/blob/main/iRoPE.ipynb

1 reply

sanderland

authored a paper about 1 year ago

Command A: An Enterprise-Ready Large Language Model

Paper • 2504.00698 • Published Apr 1, 2025 • 31

AI & ML interests

Recent Activity

Papers

Articles

Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason!

Team members 177

Writer's activity

What is the base model of Palmyra-Fin-70B-32K?