LLMs talk. This one works.
Any prompt.
A production API.
Write a system prompt. Connect your Aerostack workspace. Get a REST endpoint that reasons, calls tools, and returns structured JSON — not just text.
You don't need another chat completion.
LLM APIs return text — your product needs structured data
You parse JSON yourself, handle failures, retry on malformed output. Every time.
No tool access — the AI can answer but can't act
It can tell you what to do but can't query your database, update your CRM, or create a ticket.
Building the agentic loop is a project in itself
Tool selection, result parsing, conversation management, timeout handling, retry logic — weeks of work.
Locked to one LLM provider
Switch from OpenAI to Anthropic? Rewrite your tool calling format, response parsing, and error handling.
Three steps. One endpoint.
Define the agent
Write a system prompt that describes what the agent does. Add an output schema if you want structured JSON.
The system prompt defines behavior. The output schema enforces structure. The AI follows both — and calls MCP tools when the prompt says to.
Connect your workspace
Attach an Aerostack workspace with your tools — database, CRM, GitHub, Slack, anything MCP-compatible.
The API key is SHA-256 hashed and stored. The raw key is returned once — never again. You can regenerate it anytime.
Call it from anywhere
Any HTTP client. Any language. No SDK required. One POST, structured response.
The agent reads your input, calls tools if needed (up to 10 iterations), and returns structured JSON matching your schema. All in one HTTP request.
Three ways to run. Same agent.
Call it like an API, wire it to events, or let it run on a schedule. Pick the trigger when you create the endpoint.
API (manual)
Call it on demand
A synchronous REST call. POST your input, get a structured response back — output, token usage, cost, latency, and the full tool-call history.
POST /api/run/:slug
Webhook
React to events
A passive listener that fires when an external service sends an event — Stripe payments, GitHub pushes, Slack messages. Includes retry logic and source filtering.
Inbound event → agent runs
Scheduled
Run on a schedule
Cron-driven autonomous execution. The agent runs on its own — daily reports, data sync, content generation — with no caller required.
cron: 0 9 * * *
Not another wrapper API.
Side-by-side with the alternatives you've probably considered.
| Feature | Raw LLM API | OpenAI Assistants | Aerostack |
|---|---|---|---|
| Multi-step tool orchestration | — | ✓ | ✓ |
| Open MCP tool ecosystem | — | — | ✓ |
| Structured JSON output (schema) | — | ~ | ✓ |
| Multi-provider (Claude, GPT, Gemini...) | — | — | ✓ |
| Stateless — one request, one response | ✓ | — | ✓ |
| Streaming SSE | ✓ | ✓ | ✓ |
| Your own URL / slug | — | — | ✓ |
| No SDK required (plain HTTP) | ✓ | — | ✓ |
| Per-run cost visibility (cents) | — | — | ✓ |
| Charge consumers per call | — | — | ✓ |
| Run history & audit trail | — | ~ | ✓ |
| BYOK — zero platform markup | ✓ | ✓ | ✓ |
Stateless — no threads to manage
OpenAI Assistants need Threads, Runs, and polling. Here: one POST, one response. The agent loop runs inside that single request.
MCP tools — not proprietary functions
Your tools use the open MCP standard. Same tools work in your bot, your webhook processor, and your gateway — not locked to one vendor.
Monetize each call
Set price_per_run_cents on your endpoint. Consumers pay per call, deducted from their wallet. Built-in billing — no Stripe integration needed.
Pre-built templates. Ready to deploy.
Start from a proven pattern or build from scratch.
Invoice Parser
Extract vendor, amount, due date, and line items from invoice text. Save to database.
Raw invoice text or PDF content
{ vendor, amount, due_date, line_items[] }
Lead Enricher
Enrich a name + company with contact info, company size, and intent signals.
{ name, company }
{ email, role, company_size, intent_score }
PR Code Reviewer
Review a pull request diff for bugs, security issues, and style violations.
PR diff text
{ bugs[], security[], style[], verdict }
Resume Screener
Score a resume against job requirements. Identify skills gaps.
Resume text + job description
{ match_score, skills_gap[], recommendation }
SQL Query Generator
Convert natural language to SQL. Validate against schema. Run and return results.
"Show me all users who signed up last week"
{ sql, explanation, results[] }
Email Drafter
Draft professional emails from context, intent, and tone preferences.
{ context, intent, tone }
{ subject, body }
Watch the agent think.
Streaming SSE shows every step — tool calls, results, and the final output — as they happen.
data: { "status": "loading_tools", "message": "Loading 6 workspace tools..." }
data: { "status": "llm_call", "iteration": 1 }
data: { "tool": "db__search_invoices", "arguments": { "vendor": "Acme" } }
data: { "tool": "db__search_invoices", "success": true, "latency_ms": 45 }
data: { "tool": "db__insert_invoice", "arguments": { ... } }
data: { "tool": "db__insert_invoice", "success": true, "latency_ms": 23 }
data: { "output": { "vendor": "Acme Corp", "amount": 4250 }, "usage": { ... } }
Status updates — loading tools, calling LLM, iteration count. Shows the agent is working.
The agent decided to use a tool. Shows the tool name and arguments before execution.
Tool execution completed. Shows success/failure and latency — your audit trail.
Agent loop complete. Contains the final output, parsed JSON, tool call summary, and cost breakdown.
Something went wrong. Contains the error message. Stream ends.
Build real-time UIs
Show users a live progress feed while the agent works. No polling. Standard EventSource API.
Production features, not afterthoughts.
Everything you need to run an AI agent as a real API — included from day one.
Async + callback
Fire-and-forget for long-running agents. The request returns immediately and the result is POSTed to your callback URL when it is ready.
Bring your own key
Use your own OpenAI, Anthropic, Gemini, or Groq key for direct provider pricing — or run on Aerostack's managed key. Your choice per endpoint.
JSON schema validation
Declare an output schema and the agent's JSON is validated against it — a stable contract your downstream code can depend on.
Per-endpoint rate limits
Set a request-per-minute limit on each endpoint. API-key or public auth, with a per-IP cap in public mode.
Run history + analytics
Every endpoint tracks total runs, success rate, average latency, cost, token usage, top tools called, and error breakdowns over time.
Your first agent endpoint.
60 seconds.
System prompt. Workspace. Done. You have a production API.