SOLUTIONS · HERMES AGENT

Hermes Agent on ARK. The autonomous agent that grows with you.

Hermes Agent, from Nous Research, is the fastest-growing agent framework of 2026 — 95,600+ GitHub stars in seven weeks. A self-improving autonomous agent with persistent three-layer memory and 118 bundled skills. Run it on ARK and its loops collapse from thousands of tokens to tens per call.

Get free ARK credits → View Hermes on GitHub

Hermes · 3-layer memory live Self-improving

118

Bundled skills
v0.10.0

−98.9%

Tokens/call in
stateful mode

WHAT IT IS

A persistent, self-improving agent that actually learns from you.

Hermes Agent is an open-source autonomous AI agent built by Nous Research and released in February 2026. It is designed as a persistent assistant that learns from its interactions, retains memory across sessions, and can operate across multiple messaging platforms simultaneously.

v0.10 ships with 118 bundled skills, a three-layer memory system (working, episodic, long-term), FTS5 recall with LLM summarization, six messaging integrations (Telegram, Discord, Slack, WhatsApp, Signal, Email, CLI), and a closed learning loop that creates reusable skills from experience.

It runs anywhere — a $5 VPS, a dedicated GPU instance, or serverless infrastructure like Daytona or Modal. Point it at ARK and every step of its planning, tool use, and summarization loop runs inside the EU, with agent-sized token bills.

WHAT MAKES HERMES DIFFERENT

Persistent memory across sessions — it remembers projects, preferences, context
Self-improving learning loop — turns interactions into reusable skills
118 bundled skills covering file, shell, web, messaging, scheduling, and more
Six messaging integrations out of the box, with a CLI for headless ops
Runs on commodity infra — $5/month VPS to $50/month dedicated GPU
MIT-licensed, OpenAI-compatible — ARK drops in as the inference backend

HERMES + ARK

Stateful inference is what makes autonomous agents economical.

Hermes' loops replay the same context on every step — planning, reflection, tool observation, summarization. ARK's session-level KV-cache persistence means that context is computed once per session, not once per turn.

−98.9%

Tokens per agent step collapse

From ~4,150 tokens per call stateless to ~46 tokens stateful, measured on real loops. Your Hermes bill scales with intent, not with context length.

−87%

Time-to-first-token on every step

TTFT drops from ~1.07s to ~0.14s in stateful mode — the difference between a sluggish agent and a responsive one on long loops.

99%

Agents that don't stop when GPUs do

Persistent agents need a persistent backend. ARK's shard-redundant runtime keeps serving even at 99% hardware failure — your Hermes instance keeps its train of thought.

QUICKSTART

Run Hermes against ARK.

Hermes reads a standard config file. Set ARK as the inference provider, pick a reasoning-capable model (DeepSeek R1, QwQ-32B, or Llama 3.3 70B all work well), and enable stateful mode so the KV-cache persists across the agent's loop.

The memory, skills, and messaging layers all keep running on whatever infra you chose — only the inference calls change addresses.

Hermes docs →

hermes.toml

[inference]
provider   = "openai"
base_url   = "https://api.ark-labs.cloud/api/v1"
api_key    = "${ARK_API_KEY}"
model      = "deepseek-r1"          # or qwq-32b, llama-3.3-70b-instruct
stateful   = true                  # keep the KV-cache warm

[memory]
layers     = ["working", "episodic", "long_term"]

[channels]
enabled    = ["telegram", "slack", "email"]

# $ hermes up --config hermes.toml

USE CASES

Autonomy, with the right economics.

Autonomous research assistant

Run multi-step research loops overnight. Hermes plans, searches, summarises, and reports back in Slack — with ARK's stateful mode keeping token cost sane.

Ops agent that actually learns

Hermes' closed learning loop turns resolved tickets into reusable skills. Pair it with ARK's persistent inference and the agent gets cheaper and faster over time.

Cross-channel personal COO

Hermes in Signal, Telegram, and Email; ARK in the EU. A genuinely autonomous personal operator that respects your data boundaries.