Reasoning Engine
Guide complex analysis with fewer tokens
Datarus-R1 mirrors human problem-solving by iterating through hypotheses, code execution, and decisive answers.
- Learns from 144K ReAct-style analytical notebooks
- Switch between Agentic and Reflection interfaces
- Apache 2.0 licensing for unrestricted deployment

“Hand me your dataset or prompt, I will iterate and deliver a defensible answer.”
Open weights for analytical excellence
Datarus-R1-14B-Preview is tuned from Qwen2.5-14B-Instruct to behave like a senior data analyst. It studies entire reasoning notebooks, capturing thought, action, and reflection loops to stay grounded while solving quantitative challenges.
Scaled to compete with 32B+ models while remaining cost-efficient.
Full analytical notebooks with reasoning, code execution, and self-corrections.
Commercial-friendly open weights ready for regulated environments.
Swap between ReAct planning and concise CoT write-ups on demand.
Capabilities
From exploration to production
Datarus-R1 helps analysts, engineers, and operators make informed decisions with transparent reasoning steps.
Trajectory-Centric Training
Captures the "AHA" moments where hypotheses pivot, boosting accuracy on LiveCodeBench and AIME challenges.
Dual Interfaces
- Agentic <step>/<action> planning
- Reflection <think>/<answer> synthesis
- Seamless mode switching mid-analysis
Enterprise Ready
Role-based controls, observability hooks, and private fine-tuning paths.
Benchmark results that rival bigger models
Trajectory-aware training keeps accuracy high while using up to 49% fewer tokens than peer models, enabling longer analyses within existing budgets.
| Benchmark | Datarus-R1-14B | Edge vs peers |
|---|---|---|
| LiveCodeBench v6 | 57.7 | +1.1 vs QwQ-32B |
| AIME 2024 | 70.1 | +17.5 vs DeepSeek-R1-Distill-14B |
| AIME 2025 | 66.2 | +3.1 vs Phi-4-reasoning |
| GPQA Diamond | 62.1 | +2.0 vs QwQ-32B |
Choose between agentic and reflective flows
Toggle between interactive execution with tool use or compact reflections for documentation. Each mode preserves the ReAct-style tags adopted by Datarus notebooks.
<step>, <action>, and<observation> tags.<step> <thought>Inspect recent credit risk failures</thought> <action>python_executor</action> <action_input> data = load_latest_defaults() plot_weekly_trends(data) </action_input> </step>
<think> and <answer> tags for polished, auditable output.<think>Adjusted ARIMA residuals confirm seasonality in charge-offs.</think> <answer>The blended forecast improves MAE by 12% while keeping capital buffers stable.</answer>
Multi-step execution
Orchestrate dockerized Jupyter notebooks, capture outputs, and feed observations back into the model.
Intelligent recovery
Automatic retries with error context keep analyses on track when imports fail or data drifts.
Rich artifacts
Generate shareable notebooks, structured transcripts, and business-ready narratives from a single run.
Integrations
Works where your teams operate
Embed Datarus-R1 inside notebooks, BI tools, or automation pipelines with SDKs and container images.
Marketplace Interface
Browse and deploy templates

Notebook Agents
Jupyter, VS Code, and Databricks ready.
Data Connectors
Snowflake, BigQuery, and S3 ingestion.
Deployment Targets
vLLM, TGI, TensorRT-LLM runtime support.
Fluent across formats
Supports natural language, Python, SQL, and markdown with precise control instructions.
Natural Language
SupportedPython & Notebooks
SupportedSQL & Tabular
SupportedMarkdown Reports
SupportedWorkflow DSLs
SupportedMultilingual
Coming SoonDeploy anywhere
Serve on-prem or in the cloud with optimized runtimes for major GPU architectures.
vLLM
RuntimeTensorRT-LLM
AccelerationTGI
ServingDeepSpeed
TrainingRay
DistributedKubernetes
OrchestrationAWS / Azure / GCP
CloudWhy teams choose Datarus-R1
Reason reliably, audit effortlessly, and iterate quickly.
Transparent Reasoning
Structured traces make it easy to review every decision.
Operational Efficiency
Token savings reduce infrastructure cost while maintaining accuracy.
Ecosystem Ready
Open weights with Apache 2.0 licensing unlock commercial innovation.