Sapient Intelligence releases HRM-Text 1B: 40B tokens, ~$1k pretrain, beats Llama3.2 3B on MATH and DROP

**What Happened:**
Sapient Intelligence recently released HRM-Text 1B, a hierarchical reasoning model with 40 billion unique tokens. Trained from scratch on 16 GPUs in just 1.9 days, this model claims to have a budget of approximately $1,000. The model outperforms Llama3.2 3B on benchmarks like MATH and DROP, scoring 56.2 vs Llama3.2’s 48.0 for MATH and 82.2 vs Llama3.2’s 45.2 for DROP.

**Why It Matters:**
The release of HRM-Text 1B is significant because it demonstrates the potential of hierarchical reasoning models, which are designed to handle complex tasks by breaking them down into simpler sub-problems. This model excels in multi-step reasoning tests (like MATH and DROP), where it scores well above other large language models like Llama3.2 and GPT-3.5.

However, the gap between HRM-Text 1B and Qwen3.5 for knowledge recall tasks (such as MMLU) highlights a limitation: while hierarchical reasoning can be powerful, it requires substantial data to encode world knowledge effectively. The model’s performance in these areas suggests that further research may be needed to develop models capable of handling the breadth of information required by such benchmarks.

**Takeaways:**
1. Hierarchical Reasoning Models Show Promise: HRM-Text 1B demonstrates the potential for hierarchical reasoning, showing strong performance on tasks requiring multiple steps of logical deduction.
2. Data Requirements for Knowledge Recall: The model’s weaker performance in knowledge recall tasks underscores the critical role that large datasets play in building models capable of handling complex information retrieval.
3. Need for Further Research: While HRM-Text 1B performs well, its limitations highlight the need for more research into how to effectively incorporate hierarchical reasoning mechanisms and manage data requirements within these models.

Source Read original →