thulpoff

Thulpoff

Generate, evaluate, and refine SKILL.md files using teacher-student model distillation. Built in Rust.

cargo install thulpoff-cli

Generate

A capable teacher model solves a task. Thulpoff extracts the solution patterns into a reusable SKILL.md file with test cases.

Evaluate

Run test cases against student models with and without the skill context. Baseline comparison shows exactly how much the skill helps.

Refine

Failed tests feed back to the teacher model, which improves the skill definition. The loop continues until the student passes.

Workspace

CrateWhat
thulpoff-coreTypes, traits, LlmProvider
thulpoff-providerAnthropicProvider, NimProvider
thulpoff-engineGeneration, Evaluation, Refinement + baseline + history
thulpoff-cligenerate, eval, refine, list, runs

Quick Start

# Generate a skill from a teacher session
thulpoff generate "Write an optimized sorting algorithm" --model claude-opus-4-6 --provider anthropic
# Evaluate with a student model
thulpoff eval ./skills/sorting/SKILL.md --model mistralai/mistral-small-24b-instruct-2501 --provider nim
# View run history
thulpoff runs sorting
# Refine based on failures
thulpoff refine ./skills/sorting/SKILL.md --model claude-opus-4-6 --provider anthropic

Ecosystem

ProjectWhat
thulpExecution context engineering
aresAgentic retrieval-enhanced server
pawanCLI coding agent
erukaContext intelligence engine