Cómo darle memoria a tu bot de WhatsApp con IA (sin construir la infraestructura)

wpnews.pro

cd /news/ai-agents/como-darle-memoria-a-tu-bot-de-whats… · home › topics › ai-agents › article

[ARTICLE · art-23168] src=dev.to ↗ pub=2026-06-06T05:40Z topic=ai-agents verified=true sentiment=↑ positive

Cómo darle memoria a tu bot de WhatsApp con IA (sin construir la infraestructura)

A developer has built a REST API that gives WhatsApp bots persistent memory without requiring developers to build their own infrastructure. The API handles message storage, context formatting for LLMs like OpenAI and Anthropic, automatic token truncation, and conversation summarization. Developers can integrate it with just a few HTTP requests, eliminating the need to design database tables or manage token budgets.

read2 min views15 publishedJun 6, 2026

Si alguna vez hiciste un bot de WhatsApp con IA, te topaste con esto: WhatsApp no recuerda nada. Cada mensaje que llega es un evento aislado. Si el cliente escribe "y el rojo, ¿lo tenés?", tu modelo no tiene idea de qué "rojo" habla, porque no sabe qué se dijo dos mensajes antes.

Para resolverlo, todos terminamos construyendo la misma plomería:

Es trabajo repetitivo que no tiene nada que ver con la lógica de tu agente. En este artículo te muestro el problema en detalle y cómo evitarlo con una API de memoria.

Un primer intento típico en Python se ve así:

historial = db.query("SELECT role, content FROM mensajes WHERE phone = %s ORDER BY created_at", phone)

mensajes = [{"role": m.role, "content": m.content} for m in historial]
mensajes.insert(0, {"role": "system", "content": "Sos el asistente de una pizzería..."})

respuesta = openai.chat.completions.create(model="gpt-4o", messages=mensajes)

Funciona... hasta que:

system

de los messages

).Cada una de esas cosas es una tarde de trabajo. Multiplicado por cada bot que hacés.

La idea es delegar toda esa capa a una API REST. Guardás los mensajes con un POST

y pedís el contexto ya armado con un GET

. Veamos un bot completo de pizzería:

import httpx, openai

MEMORY_API = "https://whatsapp-memory-api.onrender.com"
HEADERS = {"Authorization": "Bearer wma_live_xxxx"}

def manejar_mensaje(phone: str, texto_cliente: str) -> str:
    httpx.post(f"{MEMORY_API}/sessions/{phone}/messages",
        headers=HEADERS, json={"role": "user", "content": texto_cliente})

    ctx = httpx.get(f"{MEMORY_API}/sessions/{phone}/context",
        headers=HEADERS, params={"format": "openai"}).json()

    respuesta = openai.chat.completions.create(
        model="gpt-4o",
        messages=[
            {"role": "system", "content": "Sos el asistente de una pizzería..."},
            *ctx["context"],   # <-- historial listo para usar
        ],
    ).choices[0].message.content

    httpx.post(f"{MEMORY_API}/sessions/{phone}/messages",
        headers=HEADERS, json={"role": "assistant", "content": respuesta})

    return respuesta

Eso es todo. No hay tablas que diseñar, no hay manejo de tokens, no hay reformateo.

La API se encarga: trunca por presupuesto de tokens y, cuando la charla supera cierto tamaño, genera un resumen automático de lo viejo y lo antepone como contexto. Vos seguís pidiendo /context

y siempre recibís algo que entra en el prompt.

Cambiás un parámetro:

ctx = httpx.get(f"{MEMORY_API}/sessions/{phone}/context",
    headers=HEADERS, params={"format": "anthropic"}).json()

Cada sesión tiene un store de metadata:

httpx.patch(f"{MEMORY_API}/sessions/{phone}/metadata",
    headers=HEADERS, json={"nombre": "Juan", "ultimo_pedido": "ORD-1234"})

La API que usé en los ejemplos está en vivo y tiene docs interactivas:

👉 https://whatsapp-memory-api.onrender.com/docs

Creás una key con un POST /keys

y ya podés empezar. Si estás construyendo bots de WhatsApp, me encantaría saber qué le falta.

¿Hacés bots de WhatsApp? Contame en los comentarios cómo manejás hoy la memoria — me interesa ver qué enfoques usa cada uno.

source & further reading

dev.to — original article SKILL.md: how to write a Claude Code skill that actually triggers (format + template) Two AI models that attack each other beat one that agrees with itself Understanding Vector Databases: A Beginner's Guide to Embeddings and Similarity Search

~/api · this article 200

$curl api.wpnews.pro/v1/news/como-darle-memoria-a-tu-…

Read original on dev.to → dev.to/joedev10/como-darle-memoria-a-tu-bot-de-w…

mentioned entities

OpenAI

GPT-4o

Python

metadata

slugcomo-darle-memoria-a-tu-bot-de-whatsapp-con-ia-sin-construir-la-infraestructura

topic#ai-agents

secondary4 topics

sentimentpositive

canonicaldev.to

navigation

← prevAbove: Privacy Phones, Laptops, …

next →Japan proposes rebuilding nuclea…

── more in #ai-agents 4 stories · sorted by recency

koreaherald.com · 22 Jul · #ai-agents

Samsung, SK, Naver chiefs head to US for Nvidia talks

semafor.com · 22 Jul · #ai-agents

News Corp accuses search engine Brave of AI copyright infringement

startupfortune.com · 22 Jul · #ai-agents

China signs 29 nations into a rival AI bloc while open-source models do the real work

pub.towardsai.net · 22 Jul · #ai-agents

The Next AI Breakthrough May Not Be a Bigger Model

── more on @whatsapp 3 stories trending now

wpnews · 30 May · #ai-safety

Nightcord Security Analysis Report - Threat Investigation

wpnews · 26 May · #ai-agents

Think, Durable Objects, and the Real Shape of AI Applications

wpnews · 8 Jul · #ai-tools

What's the Future of Clay?

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required