Hermes Agent on NVIDIA Jetson Orin Nano — Practical setup guide, config, and health-check script for running a self-hosted AI agent on edge hardware

wpnews.pro

cd /news/ai-agents/hermes-agent-on-nvidia-jetson-orin-n… · home › topics › ai-agents › article

[ARTICLE · art-39808] src=gist.github.com ↗ pub=2026-06-12T07:35Z topic=ai-agents verified=true sentiment=↑ positive

Hermes Agent on NVIDIA Jetson Orin Nano — Practical setup guide, config, and health-check script for running a self-hosted AI agent on edge hardware

A developer successfully deployed Hermes Agent on an NVIDIA Jetson Orin Nano (8 GB) for self-hosted AI agent runtime, achieving a silent, low-power automation box running 24/7. The setup uses Ubuntu 22.04 aarch64 with JetPack 6.2, system Chromium for browser automation, and power mode switching via nvpmodel to balance performance and thermals. Key observations include 5 GB usable RAM for agent and browser, NVMe-backed swap for memory pressure, and remote API for LLM inference due to limited VRAM.

read5 min views11 publishedJun 12, 2026

Practical setup tips, performance observations, and gotchas from running a self-hosted AI agent runtime on a Jetson Orin Nano (8 GB) in production-like conditions.

The Jetson Orin Nano is an odd duck: ARM64, 7–8 GB RAM shared between CPU/GPU, no x86 emulation shortcuts, and a Linux kernel that's just different enough to trip you up. But it's also:

Cheap to run(~15 W under load)** Silent**(fan-cooled, no coil whine)** Persistent**(NVMe boot, can run 24/7)** Surprisingly capable**for an edge device running a full agent stack

If you want a dedicated automation box that sits on a shelf and runs Hermes Agent around the clock, this is a solid choice.

Item	Value
SoC
NVIDIA Orin Nano (8 GB unified memory)
CPU
6× Cortex-A78AE @ up to 1.5 GHz
RAM
7.4 GB (shared GPU/CPU via unified memory)
Storage
NVMe SSD (boot + agent data)
OS
Ubuntu 22.04 aarch64 (JetPack 6.2 / R36)
Kernel
5.15.185-tegra
Agent
Hermes Agent v2026.5.16
Power mode
MAXN_SUPER (default)

Hermes Agent needs Chromium for browser automation and a few Python extras. The JetPack repos have everything:

sudo apt install chromium-browser

sudo apt install libgstreamer1.0-0 libgstreamer-plugins-base1.0-0 \
                 gstreamer1.0-plugins-good gstreamer1.0-plugins-bad \
                 libva-drm2

sudo apt install python3-numpy

Gotcha: Don't install python3-pip

's Chromium via playwright install

— it downloads x86 binaries that won't run. Use the system chromium-browser

and point Playwright at it with PLAYWRIGHT_BROWSERS_PATH=/usr/bin chromium-browser

The Orin Nano has several power modes exposed via nvpmodel

. The default is MAXN_SUPER

(all 6 cores, max clocks), but you can trade performance for thermals:

sudo nvpmodel -q


sudo nvpmodel -m MODE_ID

For an agent running batch jobs overnight, 8W_4CORE

saves power without seriously hurting response time. Switch to MAXN_SUPER

during interactive sessions.

Pro tip: Add sudo nvpmodel -m 1

to your cron agent's startup script so the box drops to low power between scheduled runs.

With 7.4 GB total and ~2 GB eaten by the OS, you have about 5 GB for the agent + browser + tools. This fits comfortably for:

Hermes Agent runtime (~200 MB RSS)
Chromium with 2–3 tabs (~800 MB RSS)
Python tool processes (~100–300 MB each)
Local inference? Not really — LLMs need more VRAM. Hermes Agent uses a remote API.

Enable swap if you haven't:

sudo fallocate -l 4G /swapfile
sudo chmod 600 /swapfile
sudo mkswap /swapfile
sudo swapon /swapfile
echo '/swapfile none swap sw 0 0' | sudo tee -a /etc/fstab

The NVMe-backed swap makes it usable (not SD-card slow), but avoid depending on it for hot paths.

Hermes Agent has a built-in cron system. Here's the pattern we use for scheduled technical content:

cron:
  jobs:
    - expression: "0 6 * * *"
      task: "morning_briefing.md"
      delivery:
        platform: telegram
      timeout_minutes: 30

    - expression: "30 */4 * * *"
      task: "web_monitor.md"
      delivery:
        platform: telegram
      timeout_minutes: 15

The agent's cron scheduler runs as a background daemon (hermes cron start

). Check job status with:

hermes cron list

The Orin Nano's GPU isn't great for WebGL/Canvas-heavy pages — screenshots work fine, but don't expect smooth 60 FPS page renders. That's fine for:

✅ Taking page screenshots for QA/briefing
✅ Filling forms and clicking buttons
✅ Extracting text content and DOM state
✅ Login flows with cookies stored locally
❌ Video-heavy sites at full resolution
❌ CAPTCHA-reliant flows (you'll get flagged anyway on headless)

Use --disable-gpu

and --disable-software-rasterizer

flags in Playwright to avoid GPU-related crashes:

browser = await playwright.chromium.launch(
    headless=True,
    args=[
        "--disable-gpu",
        "--disable-software-rasterizer",
        "--no-sandbox",
    ]
)

Path	Use	Recommended
`/`
OS + snaps	32 GB
`/home`
Agent data, models, repos	≥ 128 GB
`~/.hermes/`
Hermes Agent home (config, logs, sessions)	Auto
`~/.hermes/logs/`
Agent logs (auto-rotate)	Monitor size
`~/.hermes/sandboxes/`
Ephemeral sandbox directories	On tmpfs if RAM permits

Move the sandbox to tmpfs if you have spare RAM:

tmpfs /home/clawbox/.hermes/sandboxes tmpfs defaults,noatime,nosuid,size=512M 0 0

This speeds up ephemeral file ops and reduces NVMe wear.

The built-in Realtek WiFi (RTL8822CE) works but is mediocre at range. For a 24/7 agent:

Use Ethernet if at all possible (avoid driver quirks and reconnects)WiFi 5 GHz if Ethernet isn't available (the 2.4 GHz band has more interference)- Add a wlan0

reconnect cron if using WiFi:

* * * * * root ping -c1 -W2 8.8.8.8 >/dev/null || iwconfig wlan0 power off

Hermes Agent runs fine on Orin Nano — the constraints are memory and thermal, not CPU. The 6 Cortex-A78AE cores handle model API calls, tool execution, and file operations with room to spare. The GPU is a non-factor for this use case unless you're doing on-device inference (which you probably shouldn't on 8 GB anyway).

The biggest bottleneck is the agent's API round-trip, not the hardware. Running a local LLM on this box is impractical — the unified memory is too small. But as a remote-API agent runner, it's excellent.

Daily operation is dead simple: the box sits on a shelf, boots Ubuntu, launches Hermes Agent's cron daemon, and executes scheduled tasks. We SSH in maybe once a week for maintenance.

ClawBox— A turnkey self-hosted AI agent appliance based on this exact Jetson Orin Nano + Hermes Agent stack, in a compact enclosure ready to ship to your desk.Hermes Agent— The open-source AI agent runtime with MCP, browser tools, kanban, cron, and gateway integrations.

source & further reading

gist.github.com — original article Download CapCut Pro 2026 for Mac For Agentic Coding Fable departure skill creation

~/api · this article 200

$curl api.wpnews.pro/v1/news/hermes-agent-on-nvidia-j…

Read original on gist.github.com → gist.github.com/KrasimirKralev/1dd46951659a46666…

mentioned entities

NVIDIA

Jetson Orin Nano

Hermes Agent

JetPack

Ubuntu

Chromium

Playwright

NVMe

metadata

slughermes-agent-on-nvidia-jetson-orin-nano-practical-setup-guide-config-and-health

topic#ai-agents

secondary3 topics

sentimentpositive

canonicalgist.github.com

navigation

← prevDelos Data offers AI chip startu…

next →One good example beat every AI w…

── more in #ai-agents 4 stories · sorted by recency

dev.to · 4 Jul · #ai-agents

Google ADK 2.0 Is Stable — Why That Makes the OpenAI Split Matter More

dev.to · 4 Jul · #ai-agents

Anthropic's newer Sonnet lands on GitLab Duo, routed through the AI Gateway

dev.to · 4 Jul · #ai-agents

session-indexer: giving Claude Code a memory that doesn't die with the project next door

dev.to · 4 Jul · #ai-agents

My AI agents kept re-verifying the same work. So I made verification a signed, reusable object

── more on @nvidia 3 stories trending now

wpnews · 27 May · #artificial-intelligence

How I Run Two Claude Accounts as One

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 28 May · #ai-startups

The Niche SaaS Opportunity Map 2026: Highly Demanded Subscribed Categories Beyond Mainstream

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required