Data Extractor Edge Function — Ai
AIExtract structured data from unstructured text. Define a schema, paste text, get JSON. Perfect for invoices, receipts, emails, and forms.
Edge function Data Extractor Extract structured data from unstructured text. Define a schema, paste text, get JSON. Perfect for invoices, receipts, emails, and forms.. Deployed on Cloudflare Workers — zero cold starts, globally distributed. Mount it via your Aerostack workspace to call it from any AI agent.
npx aerostack add aerostack/data-extractor Use with AI Assistants
MCPConnect Claude, Cursor, or any MCP-compatible client — then call this function by slug
① Add MCP Server
Add this once — access all Aerostack functions from your AI tool.
{
"mcpServers": {
"aerostack": {
"url": "https://mcp.aerostack.dev",
"type": "http"
}
}
} ② Call this function
Ask your AI to use the call_function tool with this slug:
call_function({
slug: "data-extractor",
args: {}
}) Data Extractor
Extract structured JSON from unstructured text using AI. Define your output schema and get clean data.
Endpoints
| Method | Path | Description |
|---|---|---|
| POST | /extract | Extract data using a schema |
| POST | /extract/auto | Auto-detect and extract key fields |
| GET | /health | Status check |
Extract with schema
curl -X POST {endpoint}/extract \
-d '{"text":"Invoice #1234 from Acme Corp, total $99.50, due 2024-03-15","schema":{"invoice_number":"string","company":"string","total":"number","due_date":"string"}}'
Metadata
Tags
Build and publish your own functions
Write a TypeScript function, deploy it to the edge, and share it with thousands of developers — in minutes.
More AI Functions
Browse AI Functions →ai-chunk-overlap
by @navin
Splits text into overlapping chunks with configurable size, overlap, and boundary snapping (char, word, sentence) for RAG and embedding pipelines.
ai-context-window-fit
by @navin
Trims a conversation message array to fit within a model's context window using configurable strategies, without making any API calls.
ai-cost-estimate
by @navin
Calculates the API cost for an LLM request given a model name, prompt token count, and completion token count, supporting multiple currencies.
ai-extract-keywords
by @navin
Extracts the top N keywords from text using TF-IDF inspired scoring with built-in English stopword filtering, no external API calls required.
ai-guardrail-injection-detect
by @navin
Scores text for common prompt injection attack patterns including role overrides, instruction leaking, and jailbreak attempts.
ai-language-detect
by @navin
Detects the natural language of a text string using character trigram frequency analysis, supporting 13 languages with no external API calls.
Frequently asked questions
What does the Data Extractor function do? +
Data Extractor is a serverless edge function for ai automation written in cloudflare-worker. Deploy it to Cloudflare Workers via your Aerostack workspace.
How do I deploy the Data Extractor function? +
Install the Aerostack CLI and run: ```bash aerostack deploy function @aerostack/data-extractor ``` It will be live on Cloudflare Workers in seconds.
What runtime does Data Extractor use? +
Data Extractor runs on cloudflare-worker on the Cloudflare Workers edge runtime — zero cold starts, globally distributed.
Can I customise the Data Extractor function? +
Yes. Fork the function from your Aerostack dashboard, modify the source, and redeploy. All changes are version-controlled.