# Agent API Source: https://docs.tinyfish.ai/agent-api/index Use natural-language goals to automate workflows on real websites The TinyFish Agent API lets you describe a task in natural language and have TinyFish execute it on a real website. Use it when you want goal-based automation rather than low-level browser scripting. The Agent API is the right choice when TinyFish should decide the browser actions. If you need direct browser control instead, use the [Browser API](/browser-api). ## Canonical Endpoints | Endpoint | Pattern | Best for | | ------------ | ----------------- | -------------------------------------- | | `/run` | Synchronous | Quick tasks and simple integrations | | `/run-async` | Start then poll | Long tasks and batch processing | | `/run-sse` | Live event stream | Real-time progress in user-facing apps | ```bash theme={null} POST https://agent.tinyfish.ai/v1/automation/run-sse ``` ## Before You Start Create an API key at [agent.tinyfish.ai/api-keys](https://agent.tinyfish.ai/api-keys). ```bash theme={null} export TINYFISH_API_KEY="your_api_key_here" ``` All requests require the `X-API-Key` header. See [Authentication](/authentication) for the full setup and troubleshooting guide. ## Your First Request ```python Python theme={null} from tinyfish import TinyFish client = TinyFish() with client.agent.stream( url="https://scrapeme.live/shop", goal="Extract the first 2 product names and prices. Return JSON.", ) as stream: for event in stream: print(event) ``` ```typescript TypeScript theme={null} import { TinyFish } from "@tiny-fish/sdk"; const client = new TinyFish(); const stream = await client.agent.stream({ url: "https://scrapeme.live/shop", goal: "Extract the first 2 product names and prices. Return JSON.", }); for await (const event of stream) { console.log(event); } ``` ```bash cURL theme={null} curl -N -X POST https://agent.tinyfish.ai/v1/automation/run-sse \ -H "X-API-Key: $TINYFISH_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "url": "https://scrapeme.live/shop", "goal": "Extract the first 2 product names and prices. Return JSON." }' ``` ## What Success Looks Like ```json theme={null} {"type":"STARTED","run_id":"abc123"} {"type":"PROGRESS","run_id":"abc123","purpose":"Visit the page to extract product information"} {"type":"PROGRESS","run_id":"abc123","purpose":"Extract the first two products and prices"} {"type":"COMPLETE","run_id":"abc123","status":"COMPLETED","result":{"products":[{"name":"Bulbasaur","price":"$63.00"},{"name":"Ivysaur","price":"$87.00"}]}} ``` If you want a blocking request that returns only the final result, use `/run`. If you want to start work and poll later, use `/run-async`. ## When to Use Agent vs the Other APIs * Use **Agent** when TinyFish should decide the browser actions from your goal. * Use **Browser** when you want to drive Playwright or CDP yourself. * Use **Fetch** when you already know the URLs and only need extracted page content. * Use **Search** when you need ranked search results rather than page automation. ## Writing Good Goals A goal is the plain-English instruction you pass in the `goal` field. TinyFish uses it to decide what to click, type, extract, and return. Good goals are: * specific about the output you want * explicit about the page or flow to follow * clear about response format when you need structured JSON ## Read Next Full request schema, run lifecycle, browser profiles, and errors Choose between `/run`, `/run-async`, and `/run-sse` Understand status, polling, and completion Write more reliable automation instructions API key setup and troubleshooting # Agent API Reference Source: https://docs.tinyfish.ai/agent-api/reference Technical reference for goal-based TinyFish automation ## Endpoints | Endpoint | Method | Returns | Cancel support | | --------------------------------------------------- | ------ | -------------------- | -------------- | | `https://agent.tinyfish.ai/v1/automation/run` | `POST` | Final run result | No | | `https://agent.tinyfish.ai/v1/automation/run-async` | `POST` | `run_id` immediately | Yes | | `https://agent.tinyfish.ai/v1/automation/run-sse` | `POST` | SSE event stream | Yes | All requests require the `X-API-Key` header. See [Authentication](/authentication). ## Common Request Body All three automation endpoints accept the same JSON body. ```json theme={null} { "url": "https://example.com", "goal": "Find the pricing page and extract all plan details", "browser_profile": "lite", "proxy_config": { "enabled": true, "type": "tetra", "country_code": "US" } } ``` | Field | Type | Required | Notes | | --------------------------- | ---------------------------------------- | -------- | -------------------------------------------------------------------------------- | | `url` | `string` | Yes | Target website URL to automate | | `goal` | `string` | Yes | Natural-language instruction for the automation | | `browser_profile` | `lite \| stealth` | No | Defaults to `lite` | | `proxy_config.enabled` | `boolean` | No | Enables proxying for the run | | `proxy_config.type` | `tetra \| custom` | No | `tetra` uses TinyFish proxy infrastructure; `custom` requires your own proxy URL | | `proxy_config.country_code` | `US \| GB \| CA \| DE \| FR \| JP \| AU` | No | Used with `type: "tetra"` | | `proxy_config.url` | `string` | No | Required when `type: "custom"` | | `proxy_config.username` | `string` | No | Custom proxy username | | `proxy_config.password` | `string` | No | Custom proxy password | See [Browser Profiles](/key-concepts/browser-profiles) and [Proxies](/key-concepts/proxies) for operational guidance. ## `POST /v1/automation/run` Use this endpoint when you want the final result in one blocking response. ```json theme={null} { "run_id": "a1b2c3d4-e5f6-7890-abcd-ef1234567890", "status": "COMPLETED", "started_at": "2024-01-01T00:00:00Z", "finished_at": "2024-01-01T00:00:30Z", "num_of_steps": 5, "result": { "product": "iPhone 15", "price": "$799" }, "error": null } ``` | Field | Type | Notes | | -------------- | --------------------- | -------------------------------- | | `run_id` | `string \| null` | Unique identifier for the run | | `status` | `COMPLETED \| FAILED` | Final run state | | `started_at` | `string \| null` | ISO 8601 timestamp | | `finished_at` | `string \| null` | ISO 8601 timestamp | | `num_of_steps` | `number \| null` | Number of steps taken | | `result` | `object \| null` | Extracted JSON result | | `error` | `object \| null` | Present only when the run failed | Runs created via `/run` cannot be cancelled. ## `POST /v1/automation/run-async` Use this endpoint when you want a `run_id` immediately and will fetch the full run later. ```json theme={null} { "run_id": "a1b2c3d4-e5f6-7890-abcd-ef1234567890", "error": null } ``` | Field | Type | Notes | | -------- | ---------------- | ------------------------------------- | | `run_id` | `string \| null` | Created run ID | | `error` | `object \| null` | Present only when run creation failed | Fetch the full run state later with `GET /v1/runs/{id}`. ## `POST /v1/automation/run-sse` Use this endpoint when you want a streaming event feed while the automation runs. Possible SSE event types: | Event type | Fields | | --------------- | ------------------------------------------------------------------------------------------ | | `STARTED` | `type`, `run_id`, `timestamp` | | `STREAMING_URL` | `type`, `run_id`, `streaming_url`, `timestamp` | | `PROGRESS` | `type`, `run_id`, `purpose`, `timestamp` | | `HEARTBEAT` | `type`, `timestamp` | | `COMPLETE` | `type`, `run_id`, `status`, `result?`, `error?`, `help_url?`, `help_message?`, `timestamp` | Example stream: ```text theme={null} data: {"type":"STARTED","run_id":"run_123","timestamp":"2025-01-01T00:00:00Z"} data: {"type":"STREAMING_URL","run_id":"run_123","streaming_url":"https://...","timestamp":"..."} data: {"type":"PROGRESS","run_id":"run_123","purpose":"Clicking submit button","timestamp":"..."} data: {"type":"COMPLETE","run_id":"run_123","status":"COMPLETED","result":{...},"timestamp":"..."} ``` **Reconnection:** SSE streams do not support `Last-Event-ID` reconnection. If your client disconnects mid-stream, recover by polling `GET /v1/runs/{run_id}` until the run reaches a terminal status (`COMPLETED`, `FAILED`, or `CANCELLED`). ### Raw HTTP (no SDK) Parse SSE events directly without the TinyFish SDK: ```python theme={null} import httpx import json url = "https://agent.tinyfish.ai/v1/automation/run-sse" headers = {"x-api-key": "sk-tinyfish-..."} payload = {"url": "https://example.com", "goal": "Extract the page title"} with httpx.stream("POST", url, headers=headers, json=payload, timeout=120) as response: for line in response.iter_lines(): if line.startswith("data: "): event = json.loads(line[6:]) print(f"Event: {event['type']}") if event["type"] == "COMPLETE": print(f"Result: {event.get('result', {})}") ``` ## `GET /v1/runs/{id}` Use this endpoint to retrieve the current or final state of an async or streaming run. | Field | Type | Notes | | ---------------- | -------------------------------------------------------- | ----------------------------------------- | | `run_id` | `string` | Unique run identifier | | `status` | `PENDING \| RUNNING \| COMPLETED \| FAILED \| CANCELLED` | Current run state | | `goal` | `string` | Original goal text | | `created_at` | `string` | ISO 8601 timestamp | | `started_at` | `string \| null` | ISO 8601 timestamp | | `finished_at` | `string \| null` | ISO 8601 timestamp | | `num_of_steps` | `integer \| null` | Number of steps taken | | `result` | `object \| null` | Extracted JSON result | | `error` | `object \| null` | Error details if failed | | `streaming_url` | `string \| null` | Live browser stream URL while running | | `browser_config` | `object \| null` | Proxy settings used for the run | | `video_url` | `string \| null` | Presigned run recording URL, if available | | `steps` | `array` | Recorded step events for the run | `error` may include: | Field | Type | Notes | | -------------- | --------------------------------------------------------------- | ---------------------------------- | | `code` | `string` | Machine-readable error code | | `message` | `string` | Human-readable failure description | | `category` | `SYSTEM_FAILURE \| AGENT_FAILURE \| BILLING_FAILURE \| UNKNOWN` | Error class | | `retry_after` | `number \| null` | Suggested retry delay in seconds | | `help_url` | `string` | Troubleshooting link | | `help_message` | `string` | Human-readable guidance | ## `POST /v1/runs/{id}/cancel` Only runs created via `/run-async` or `/run-sse` can be cancelled. ```json theme={null} { "run_id": "a1b2c3d4-e5f6-7890-abcd-ef1234567890", "status": "CANCELLED", "cancelled_at": "2026-01-14T10:30:55Z", "message": null } ``` | Field | Type | Notes | | -------------- | ---------------------------------- | -------------------------------------- | | `run_id` | `string` | Run identifier | | `status` | `CANCELLED \| COMPLETED \| FAILED` | Actual status after the cancel attempt | | `cancelled_at` | `string \| null` | ISO 8601 timestamp | | `message` | `string \| null` | Idempotency or terminal-state message | ## Error Codes Common HTTP-level errors across automation endpoints: | Status | Meaning | | ------ | ----------------------------------------------- | | `400` | Invalid request body or missing required fields | | `401` | Missing or invalid API key | | `429` | Rate limit exceeded | | `500` | Internal server error | The `COMPLETE` SSE event or `GET /v1/runs/{id}` may also include run-level failures such as `TASK_FAILED`, `SITE_BLOCKED`, `MAX_STEPS_EXCEEDED`, `TIMEOUT`, or `INSUFFICIENT_CREDITS`. ## Related First request, endpoint selection, and goal-writing basics Statuses, polling, and lifecycle behavior Improve automation reliability API key setup and troubleshooting # AI Integration Guide Source: https://docs.tinyfish.ai/ai-integration Add TinyFish Web Agent as a tool for your AI agent or LLM application This guide helps you integrate TinyFish Web Agent as a tool in AI agents, chatbots, and LLM-powered applications. **For AI agents:** A machine-readable capability reference is available at [/skills.md](/skills.md). *** ## When to Use TinyFish Web Agent TinyFish Web Agent excels at tasks that require a real browser with full JavaScript execution. | Use Case | Example | | ------------------------------- | ---------------------------------------------------- | | **Multi-step workflows** | Login → navigate to dashboard → extract account data | | **JavaScript-rendered content** | SPAs, infinite scroll, lazy-loaded content | | **Interactive elements** | Click dropdowns, dismiss modals, paginate results | | **Authenticated sessions** | Access content behind login walls | | **Bot-protected sites** | Cloudflare, DataDome protected pages | *** ## Writing Goals for AI When your AI generates goals for TinyFish Web Agent, follow these patterns for reliable results. ### Specify Output Schema Define the exact JSON structure you want returned. This helps TinyFish Web Agent format data consistently. ``` Extract product data and return as JSON matching this structure: { "product_name": "string", "price": number or null, "in_stock": boolean } ``` ### Include Termination Conditions Prevent infinite loops by specifying when the automation should stop. ``` Stop when ANY of these is true: - You have extracted 20 items - No more "Load More" button exists - You have processed 5 pages - The page shows a login prompt ``` ### Handle Edge Cases Tell TinyFish Web Agent how to handle unexpected states like missing data or blocked access. ``` If price shows "Contact Us" or "Request Quote": Set price to null Set price_type to "contact_required" If a CAPTCHA appears: Stop immediately Return partial results with an error flag ``` ### Request Structured Errors Ask for error details in a parseable format so your agent can decide what to do next. ``` If extraction fails, return: { "success": false, "error_type": "timeout" or "blocked" or "not_found", "error_message": "Description of what went wrong", "partial_results": [any data captured before failure] } ``` *** ## Parsing Results When your AI receives results from TinyFish Web Agent, handle both success and failure cases. ### Success Response The automation completed and the goal was achieved. ```json theme={null} { "status": "COMPLETED", "result": { "products": [ { "name": "Widget", "price": 29.99, "in_stock": true } ] } } ``` ### Goal Failure The browser worked, but the goal wasn't achieved. Try a different approach or inform the user. ```json theme={null} { "status": "COMPLETED", "result": { "success": false, "error_type": "not_found", "error_message": "No products found on this page" } } ``` ### Infrastructure Failure The browser itself failed. Retry with stealth mode or a proxy. ```json theme={null} { "status": "FAILED", "error": { "message": "Navigation timeout" } } ``` *** ## Related Write goals that succeed Connect with Claude and Cursor # Anti-Bot Guide Source: https://docs.tinyfish.ai/anti-bot-guide Diagnose and bypass bot detection when automating protected websites You've sent a run, it came back `COMPLETED`, but the result is empty or wrong. Or maybe it outright `FAILED`. Before you start rewriting your goal, check whether the site is blocking you — bot detection is the most common cause of silent failures, and the fix is usually two lines of code. This guide walks through the full process: confirm the problem, apply the right configuration, and tune your goal to behave more like a human. Examples use the Python SDK. The same parameters work across all SDKs and the REST API — see [API Reference](/api-reference) for TypeScript and cURL equivalents. ## Installation ```bash theme={null} pip install tinyfish ``` Set your API key as an environment variable so you don't have to pass it explicitly: ```bash theme={null} export TINYFISH_API_KEY="your-api-key" ``` *** ## Step 1: Confirm Anti-Bot Is the Problem Don't assume. Sites can fail for lots of reasons — slow JavaScript, unexpected layout changes, ambiguous goals. Anti-bot has specific fingerprints. Look for them first. ### Get the streaming URL and watch the browser Every run produces a `streaming_url` — a live browser preview you can open while the run is happening, or replay afterward. This is the fastest way to see exactly what the agent encountered. Use `agent.stream()` to capture it as soon as it's available: ```python theme={null} from tinyfish import TinyFish, CompleteEvent client = TinyFish() with client.agent.stream( goal="Extract the product name and price", url="https://example.com/products", on_streaming_url=lambda e: print(f"Watch live: {e.streaming_url}"), on_progress=lambda e: print(f" > {e.purpose}"), ) as stream: for event in stream: if isinstance(event, CompleteEvent): print("Status:", event.status) print("Result:", event.result_json) ``` The `on_progress` callback shows each step the agent took — if it got stuck on a challenge page, you'll see it stop there. If you already started a run with `agent.queue()`, retrieve the streaming URL from the run object: ```python theme={null} run = client.runs.get("run_abc123") print(run.streaming_url) # open this in your browser ``` ### What to look for in the browser preview Open `streaming_url` in your browser. What you see tells you what happened: | What you see | Likely cause | | ---------------------------------------------- | --------------------------------------------- | | Cloudflare challenge / "Checking your browser" | Cloudflare bot detection | | DataDome popup or redirect | DataDome protection | | Blank page or infinite spinner | IP-based block or JS fingerprinting | | CAPTCHA (reCAPTCHA, hCaptcha) | CAPTCHA gate — cannot be solved automatically | | "Access Denied" or 403 page | IP or User-Agent block | | Login page when you expected content | Session-based bot detection | ### Check the result — `COMPLETED` doesn't mean it worked The `result` field is a better indicator of agentic success. ```python theme={null} from tinyfish import TinyFish, RunStatus, CompleteEvent client = TinyFish() with client.agent.stream( goal="Extract the product name and price", url="https://example.com/products", ) as stream: for event in stream: if isinstance(event, CompleteEvent): if event.status == RunStatus.COMPLETED and event.result_json: # Anti-bot shows up here as null fields or explicit failure flags result = event.result_json if result.get("status") == "failure" or not any(result.values()): print("Blocked — result is empty despite COMPLETED status") elif event.status == RunStatus.FAILED: print("Run failed:", event.error.message if event.error else "unknown") ``` **Anti-bot signatures in the result:** * Fields are all `null` or empty arrays AND the streaming view shows the target content was never loaded * `result.reason` mentions "access denied", "blocked", or "could not find" If the streaming view shows a challenge page and the result is empty or a failure — you've confirmed anti-bot. Move to Step 2. *** ## Step 2: Enable Stealth Mode and Proxy Apply both together. Stealth changes the browser fingerprint; the proxy changes the IP. Sites that use anti-bot services correlate both signals — changing only one often isn't enough. ### Switch to stealth browser ```python theme={null} from tinyfish import TinyFish, BrowserProfile client = TinyFish() response = client.agent.run( goal="Extract the product name and price", url="https://protected-site.com/products", browser_profile=BrowserProfile.STEALTH, # was BrowserProfile.LITE or omitted ) ``` `BrowserProfile.STEALTH` is a modified browser with anti-detection techniques. The default (`BrowserProfile.LITE`) is faster but doesn't include these measures. ### Add a proxy ```python theme={null} from tinyfish import TinyFish, BrowserProfile, ProxyConfig, ProxyCountryCode client = TinyFish() response = client.agent.run( goal="Extract the product name and price", url="https://protected-site.com/products", browser_profile=BrowserProfile.STEALTH, proxy_config=ProxyConfig( enabled=True, country_code=ProxyCountryCode.US, # match the site's expected audience ), ) ``` **Choosing a country:** Pick the country where the site's primary users are. Available values: | Enum | Country | | --------------------- | -------------- | | `ProxyCountryCode.US` | United States | | `ProxyCountryCode.GB` | United Kingdom | | `ProxyCountryCode.CA` | Canada | | `ProxyCountryCode.DE` | Germany | | `ProxyCountryCode.FR` | France | | `ProxyCountryCode.JP` | Japan | | `ProxyCountryCode.AU` | Australia | ### Verify what proxy was actually used After a run, `browser_config` on the run object confirms what was applied: ```python theme={null} run = client.runs.get("run_abc123") print(run.browser_config.proxy_enabled) # True/False print(run.browser_config.proxy_country_code) # "US" or None ``` ### Full example with both applied ```python theme={null} from tinyfish import TinyFish, BrowserProfile, ProxyConfig, ProxyCountryCode, CompleteEvent, RunStatus client = TinyFish() with client.agent.stream( goal="Extract the product name and price", url="https://protected-site.com/products", browser_profile=BrowserProfile.STEALTH, proxy_config=ProxyConfig(enabled=True, country_code=ProxyCountryCode.US), on_streaming_url=lambda e: print(f"Watch: {e.streaming_url}"), on_progress=lambda e: print(f" > {e.purpose}"), ) as stream: for event in stream: if isinstance(event, CompleteEvent): if event.status == RunStatus.COMPLETED: print("Result:", event.result_json) else: print("Failed:", event.error.message if event.error else "unknown") ``` Watch the streaming view again after this change. If the actual page loads instead of a challenge screen — you're through. Move to Step 3 to make the run more reliable at scale. TinyFish cannot solve CAPTCHAs (reCAPTCHA, hCaptcha, etc.). The configurations above — stealth mode, proxies, and human-like goal patterns — reduce the likelihood of CAPTCHAs being triggered, but if a site serves one, it's a hard limit for now. We're actively working on expanding our anti-detection capabilities. *** ## Step 3: Guide the Agent to Behave More Like a Human Stealth and proxy get you past the door. But some sites layer behavioral analysis on top of fingerprinting — they watch for robotic patterns like instant form submissions, missing cookie consent dismissals, or zero mouse dwell time. Your goal controls a lot of this behavior. ### Handle cookie and consent banners Bot detection systems often look at whether a user interacted with a consent banner before the main content. Always dismiss it explicitly: ```python theme={null} goal = """ Close any cookie consent or GDPR banner that appears before doing anything else. Then extract the product name, current price, and availability status. Return as JSON: { "name": string, "price": number, "available": boolean } """ ``` ### Add deliberate pauses at suspicious checkpoints Sites with aggressive behavioral detection (checkout pages, login flows) flag runs that move too fast: ```python theme={null} goal = """ 1. Wait for the page to fully load before interacting with anything. 2. Close any cookie banner. 3. Wait for the banner to disappear before proceeding. 4. Scroll down to view the pricing section. 5. Wait for the pricing section to fully render, then extract all plan names and monthly prices. Return as JSON array: [{ "plan": string, "price_monthly": number }] """ ``` ### Describe elements visually, not by selector Automation-aware selectors are sometimes deliberately changed to trip scrapers. Visual descriptions are more resilient: ```python theme={null} # Fragile — may be intentionally changed by the site goal = "Click the button with id='add-to-cart-btn'" # Resilient — describes what a human would see goal = "Click the blue 'Add to Cart' button directly below the product price" ``` ### Use numbered steps for multi-step flows For login flows or multi-page workflows, numbered steps give the agent explicit decision points rather than leaving it to guess: ```python theme={null} goal = """ 1. Wait for the page to fully load (spinner should disappear). 2. If a cookie consent banner is visible, click 'Accept' or 'Accept All'. 3. Locate the search bar at the top of the page and type "running shoes". 4. Wait for autocomplete suggestions to appear, then press Enter. 5. Wait for results to load. 6. Extract the first 10 results: product name, price, and product URL. Stop after 10 results. Do not paginate. Return as JSON array. """ ``` ### Add explicit fallback instructions Protected sites sometimes show intermediate pages (challenge passed, now redirecting). Tell the agent how to handle them: ```python theme={null} goal = """ Extract the product price from this page. If a loading screen or redirect page appears, wait for it to complete before extracting. If an 'Access Denied' page appears, return { "error": "access_denied" }. If the price shows 'Contact Us', return { "price": null, "contact_required": true }. Return: { "price": number or null, "currency": string } """ ``` *** ## Putting It All Together A complete hardened run for a protected site: ```python theme={null} from tinyfish import ( TinyFish, BrowserProfile, ProxyConfig, ProxyCountryCode, CompleteEvent, RunStatus, ) client = TinyFish() with client.agent.stream( url="https://protected-site.com/pricing", browser_profile=BrowserProfile.STEALTH, proxy_config=ProxyConfig(enabled=True, country_code=ProxyCountryCode.US), goal=""" 1. Wait for the page to fully load. 2. Close any cookie consent or GDPR banner that appears. 3. Wait 1 second before proceeding. 4. Locate the pricing section — it typically shows plan names in a grid or table. 5. For each plan, extract: plan name, monthly price, and annual price if shown. If a Cloudflare or security check page appears, wait for it to complete automatically. If you see an 'Access Denied' or CAPTCHA page, return { "error": "blocked" }. Do not click any purchase or checkout buttons. Return as JSON array: [{ "plan": "Pro", "monthly_price": 49, "annual_price": 39 }] """, on_streaming_url=lambda e: print(f"Watch run: {e.streaming_url}"), on_progress=lambda e: print(f" > {e.purpose}"), ) as stream: for event in stream: if isinstance(event, CompleteEvent): if event.status == RunStatus.COMPLETED: print("Result:", event.result_json) else: print("Failed:", event.error.message if event.error else "unknown") ``` *** ## Decision Tree ``` Run returned empty or wrong result? │ ├── Open streaming_url (from on_streaming_url callback or runs.get()) │ ├── Challenge / "Checking your browser" page → Anti-bot confirmed │ ├── Access Denied / 403 → Anti-bot confirmed │ ├── Blank page → Likely anti-bot (fingerprint-based) │ └── Page loaded but result wrong → Goal issue, not anti-bot │ └── Anti-bot confirmed? ├── Add browser_profile=BrowserProfile.STEALTH ├── Add proxy_config=ProxyConfig(enabled=True, country_code=ProxyCountryCode.US) ├── Re-run and watch stream again │ ├── Page loads → Add goal hardening (Step 3) for reliability at scale │ └── Still blocked → Site likely requires CAPTCHA (hard limit) └── Done ``` *** ## Creative Solutions and Iteration When a site is actively defended, the most effective approach is often to rethink the workflow rather than force the original one. **Watch yourself do it first.** Before writing your goal, navigate to the target site yourself and think through what a human would actually do. Use that as your script. The streaming view is also useful here — watch a run or two to understand exactly what the agent encounters before committing to a final goal. **Start at the front door.** Linking directly to a filtered search results page or a deep URL can look robotic. Starting at `target.com` and navigating to your destination — searching for a product, clicking through a category — often succeeds where a direct deep link fails. **Go to the source.** If the formatted data you need lives behind anti-bot and paywalls, ask whether the underlying raw data is available elsewhere. Aggregator sites are often heavily protected; their primary sources may not be. Synthesizing from multiple simpler sources is frequently more reliable than fighting for one complex one. **Check for a public API or feed first.** Some sites that actively block scraping also publish APIs, RSS feeds, or sitemaps. Five minutes checking saves a lot of iteration. **Keep dwell time low.** The longer an agent stays on a site, the higher the likelihood of detection. Balancing human-like navigation with speed matters — break large workflows into focused, smaller tasks that can be handled by multiple agents running in parallel. You get both the human-like pacing and the throughput. Scale is one of TinyFish's superpowers. **Time your runs intentionally.** Anti-bot systems are sensitive to traffic volume. Running during off-peak hours for your target site (for US-based sites, late morning to early afternoon PST often works well) can reduce the likelihood of triggering rate-based challenges. If you're testing a new workflow, start with a single run during a quiet period before scaling up. **Vary your entry points at scale.** If you're running hundreds of batch jobs against the same site, uniform traffic patterns can themselves become a fingerprint — even across different IPs. Mixing up how runs navigate to their destination (some via homepage search, some via category pages, some direct) makes the aggregate traffic look more organic. Runs have a 10-minute timeout, so this also naturally encourages breaking complex workflows into smaller, parallelizable pieces. Design goals to complete their core task well within that limit. **Manage concurrency intentionally.** TinyFish does not throttle runs by domain — if you enqueue a large batch against the same site, they will fire in parallel up to your account's concurrency limit. For sensitive sites, consider staggering your jobs in your own queuing logic rather than enqueuing everything at once. *** ## What's Coming TinyFish continuously improves browser behavior and anti-detection performance across the web. If a site blocked you on a previous project, it's worth trying again — the same run that failed before may work without any changes on your end. **Authenticated sessions (in beta):** TinyFish is adding a first-class Auth tool for logging into sites as part of a run. Beyond unlocking gated content, authenticated sessions naturally bypass many anti-bot measures — logged-in users are treated very differently by most protection systems. [Contact us](mailto:support@tinyfish.io) to request early access. *** ## Need Help? If you're stuck on a specific site, share the URL and your current configuration with us — we can often diagnose the issue quickly. * **Email:** [support@tinyfish.io](mailto:support@tinyfish.io) * **Discord:** [discord.gg/tinyfish](https://discord.gg/tinyfish) # Run browser automation synchronously Source: https://docs.tinyfish.ai/api-reference/automation/run-browser-automation-synchronously https://agent.tinyfish.ai/v1/openapi.json post /v1/automation/run Execute a browser automation task synchronously and wait for completion. Returns the final result once the automation finishes (success or failure). Use this endpoint when you need the complete result in a single response. Note: Runs created via this endpoint cannot be cancelled. If you need cancellation support, use `/v1/automation/run-async` or `/v1/automation/run-sse` instead. # Run browser automation with SSE streaming Source: https://docs.tinyfish.ai/api-reference/automation/run-browser-automation-with-sse-streaming https://agent.tinyfish.ai/v1/openapi.json post /v1/automation/run-sse Execute a browser automation task with Server-Sent Events (SSE) streaming. Returns a real-time event stream with automation progress, browser streaming URL, and final results. # Start automation asynchronously Source: https://docs.tinyfish.ai/api-reference/automation/start-automation-asynchronously https://agent.tinyfish.ai/v1/openapi.json post /v1/automation/run-async Creates and enqueues an automation run, returning the run_id immediately without waiting for completion. Use this for long-running automations where you want to poll for results separately. # Start multiple automations asynchronously Source: https://docs.tinyfish.ai/api-reference/automation/start-multiple-automations-asynchronously https://agent.tinyfish.ai/v1/openapi.json post /v1/automation/run-batch Creates and enqueues multiple automation runs in a single request, returning run_ids immediately without waiting for completion. Maximum 100 runs per request. **Atomic creation:** Run creation is all-or-nothing. Either all runs are created successfully, or none are (returns error). **Idempotency:** This endpoint does not currently support idempotency keys. Retrying a failed request may create duplicate runs. # Create a remote browser session Source: https://docs.tinyfish.ai/api-reference/browser/create-a-remote-browser-session https://agent.tinyfish.ai/v1/openapi.json post /v1/browser Creates a remote tf-browser session and returns CDP connection details. Optionally accepts a target URL to select the best proxy for that domain. Connect to the browser via CDP WebSocket at `cdp_url`. Use `timeout_seconds` to set a custom inactivity timeout (5–86400 seconds). If omitted, null, or greater than your plan maximum, the plan maximum is used (15 min on free tier, 60 min on paid). # List browser session usage Source: https://docs.tinyfish.ai/api-reference/browser/list-browser-session-usage https://agent.tinyfish.ai/v1/openapi.json get /v1/browser/usage List Tetra browser session usage for the authenticated user. Returns session telemetry including duration, data transfer, mode, and status. # Fetch and extract content from URLs Source: https://docs.tinyfish.ai/api-reference/fetch/fetch-and-extract-content-from-urls https://agent.tinyfish.ai/v1/openapi.json post /v1/fetch Renders web pages using a real browser (including JavaScript-heavy sites) and returns clean extracted content in your preferred format. Submit up to 10 URLs, get back structured content. Per-URL failures appear in `errors[]` and do not fail the entire request. # Cancel multiple runs by IDs Source: https://docs.tinyfish.ai/api-reference/runs/cancel-multiple-runs-by-ids https://agent.tinyfish.ai/v1/openapi.json post /v1/runs/batch/cancel Cancel multiple runs by their IDs in a single request. Returns per-run results including cancelled runs, already-terminal runs, and not-found IDs. Maximum 100 IDs per request. Idempotent: calling twice returns consistent results. # Cancel run by ID Source: https://docs.tinyfish.ai/api-reference/runs/cancel-run-by-id https://agent.tinyfish.ai/v1/openapi.json post /v1/runs/{id}/cancel Cancel a run by ID. Only runs created via `/v1/automation/run-async` or `/v1/automation/run-sse` can be cancelled. Runs created via the synchronous `/v1/automation/run` endpoint cannot be cancelled. # Get multiple runs by IDs Source: https://docs.tinyfish.ai/api-reference/runs/get-multiple-runs-by-ids https://agent.tinyfish.ai/v1/openapi.json post /v1/runs/batch Retrieve multiple runs by their IDs in a single request. Returns found runs and lists any IDs that were not found or not owned. Maximum 100 IDs per request. # Get run by ID Source: https://docs.tinyfish.ai/api-reference/runs/get-run-by-id https://agent.tinyfish.ai/v1/openapi.json get /v1/runs/{id} Get detailed information about a specific automation run by its ID. # List and search runs Source: https://docs.tinyfish.ai/api-reference/runs/list-and-search-runs https://agent.tinyfish.ai/v1/openapi.json get /v1/runs List automation runs with optional filtering by status, goal text, and date range. Returns paginated results with total count. Default sort order is newest first. # List search usage Source: https://docs.tinyfish.ai/api-reference/search/list-search-usage https://agent.tinyfish.ai/v1/openapi.json get /v1/search/usage List search usage records for the authenticated user. Returns paginated results with query details, status, and result counts. # Search the web Source: https://docs.tinyfish.ai/api-reference/search/search-the-web https://agent.tinyfish.ai/v1/openapi.json get /v1/search Search the web and get structured results. Returns ranked results with titles, snippets, and URLs. **Location and language resolution:** - If `location` is set but `language` is not, the language auto-resolves to the most predominantly used language in that country. - If `language` is set but `location` is not, the location auto-resolves to the country where that language is most predominantly used. - If neither `location` nor `language` is set, defaults to `us` and `en`. # Connect a vault provider Source: https://docs.tinyfish.ai/api-reference/vault/connect-a-vault-provider https://agent.tinyfish.ai/v1/openapi.json post /v1/vault/connections Connect a supported password manager and immediately sync display-safe credential metadata. # Disable a vault item Source: https://docs.tinyfish.ai/api-reference/vault/disable-a-vault-item https://agent.tinyfish.ai/v1/openapi.json patch /v1/vault/items/{itemId}/disable Disable a vault item for agent use. # Disconnect a vault provider Source: https://docs.tinyfish.ai/api-reference/vault/disconnect-a-vault-provider https://agent.tinyfish.ai/v1/openapi.json delete /v1/vault/connections/{connectionId} Disconnect a vault provider and remove its stored enabled items. # Enable a vault item Source: https://docs.tinyfish.ai/api-reference/vault/enable-a-vault-item https://agent.tinyfish.ai/v1/openapi.json patch /v1/vault/items/{itemId}/enable Enable a vault item for agent use. # List vault connections Source: https://docs.tinyfish.ai/api-reference/vault/list-vault-connections https://agent.tinyfish.ai/v1/openapi.json get /v1/vault/connections List all connected vault providers for the authenticated user. # List vault items Source: https://docs.tinyfish.ai/api-reference/vault/list-vault-items https://agent.tinyfish.ai/v1/openapi.json get /v1/vault/items List all vault items currently available from connected providers. # Sync vault items Source: https://docs.tinyfish.ai/api-reference/vault/sync-vault-items https://agent.tinyfish.ai/v1/openapi.json post /v1/vault/items/sync Sync items from connected providers and return merged item state plus sync counters. # Authentication Source: https://docs.tinyfish.ai/authentication How to authenticate with the TinyFish API TinyFish uses different authentication methods depending on how you're accessing the API: | Access Method | Auth Type | When to Use | | --------------- | --------- | ----------------------------------- | | REST API | API Key | Direct HTTP requests from your code | | MCP Integration | OAuth 2.1 | AI assistants (Claude, Cursor) | *** ## REST API Authentication All REST API requests require an API key passed in the `X-API-Key` header. ### Getting Your API Key Visit [agent.tinyfish.ai/api-keys](https://agent.tinyfish.ai/api-keys) Click "Create API Key" Copy and store your key securely API keys are shown only once. Store them securely and never commit them to version control. ### Using Your API Key Pass your API key when making requests. The Python SDK reads `TINYFISH_API_KEY` from your environment automatically: ```python Python theme={null} from tinyfish import TinyFish client = TinyFish() # Reads TINYFISH_API_KEY from environment result = client.agent.run( url="https://example.com", goal="Extract the page title", ) ``` ```typescript TypeScript theme={null} import { TinyFish } from "@tiny-fish/sdk"; const client = new TinyFish(); // Reads TINYFISH_API_KEY from environment const response = await client.agent.run({ url: "https://example.com", goal: "Extract the page title", }); ``` ```bash cURL theme={null} curl -X POST https://agent.tinyfish.ai/v1/automation/run \ -H "X-API-Key: $TINYFISH_API_KEY" \ -H "Content-Type: application/json" \ -d '{"url": "https://example.com", "goal": "Extract the page title"}' ``` ### Environment Variables Store your API key in an environment variable: ```bash theme={null} # Add to your shell profile (.bashrc, .zshrc, etc.) export TINYFISH_API_KEY="your_api_key_here" ``` For Node.js projects, use a `.env` file: ```bash theme={null} # .env TINYFISH_API_KEY=your_api_key_here ``` Add `.env` to your `.gitignore` to prevent accidental commits. *** ## MCP Authentication The MCP endpoint uses OAuth 2.1 for secure authentication with AI assistants. ### How It Works Add the TinyFish MCP server to your AI client configuration. See the [MCP Integration guide](/mcp-integration) for setup instructions. When you first use the tool, a browser window opens for authentication Log in with your TinyFish account Authorization is cached for future sessions You need a TinyFish account with an active subscription or credits. [Sign up here](https://agent.tinyfish.ai/api-keys). *** ## Error Responses Authentication errors return standard HTTP status codes with a JSON error body. See [Error Codes](/error-codes) for the full reference. The request is missing the `X-API-Key` header. ```json theme={null} { "error": { "code": "MISSING_API_KEY", "message": "X-API-Key header is required" } } ``` **How to fix:** * Add the `X-API-Key` header to your request * Check the header name is exactly `X-API-Key` (case-sensitive) The API key in the request is not valid. ```json theme={null} { "error": { "code": "INVALID_API_KEY", "message": "The provided API key is invalid" } } ``` **How to fix:** * Verify your API key is correct * Ensure no extra whitespace around the key * Check if the key has been revoked or regenerated ```bash theme={null} # Debug: Check your key is set echo $TINYFISH_API_KEY # Debug: Test authentication curl -I -X POST https://agent.tinyfish.ai/v1/automation/run \ -H "X-API-Key: $TINYFISH_API_KEY" \ -H "Content-Type: application/json" \ -d '{"url": "https://example.com", "goal": "test"}' ``` Authentication succeeded, but you lack credits or an active subscription. ```json theme={null} { "error": { "code": "FORBIDDEN", "message": "Insufficient credits or no active subscription" } } ``` **How to fix:** * Check your account at [agent.tinyfish.ai/api-keys](https://agent.tinyfish.ai/api-keys) * Add credits or upgrade your plan *** ## Security Best Practices Never hardcode API keys in source code Regenerate keys periodically and after team changes Use separate keys for development and production Review API usage in your dashboard for anomalies *** ## Related Run your first automation Full error code reference # Browser API Source: https://docs.tinyfish.ai/browser-api/index Create a remote browser session and control it programmatically The Browser API creates a remote browser session and returns a WebSocket connection URL. Use it when you need direct, low-level browser control — scripting page interactions, running your own automation framework, or tasks that go beyond the [automation run API](/key-concepts/endpoints). ```bash theme={null} POST https://api.browser.tinyfish.ai ``` *** ## Before You Start Create a key at [agent.tinyfish.ai/api-keys](https://agent.tinyfish.ai/api-keys). ```bash theme={null} export TINYFISH_API_KEY="your_api_key_here" ``` ```bash Python theme={null} pip install playwright httpx playwright install chromium ``` ```bash Node theme={null} npm install playwright npx playwright install chromium ``` All requests require the `X-API-Key` header. See [Authentication](/authentication) for the full setup and troubleshooting guide. ## Your First Request Session creation typically takes 10-30 seconds. Set your HTTP client timeout to at least 60 seconds. ```python Python theme={null} import httpx response = httpx.post( "https://api.browser.tinyfish.ai", headers={"X-API-Key": "your_api_key_here"}, json={"url": "https://www.tinyfish.ai"}, timeout=60, ) response.raise_for_status() session = response.json() print(session["session_id"]) print(session["cdp_url"]) ``` ```typescript TypeScript theme={null} const res = await fetch("https://api.browser.tinyfish.ai", { method: "POST", headers: { "X-API-Key": "your_api_key_here", "Content-Type": "application/json", }, body: JSON.stringify({ url: "https://www.tinyfish.ai" }), signal: AbortSignal.timeout(60_000), }); if (!res.ok) throw new Error(`Session creation failed: ${res.status}`); const session = await res.json(); console.log(session.session_id); console.log(session.cdp_url); ``` ```bash cURL theme={null} curl -X POST https://api.browser.tinyfish.ai \ -H "X-API-Key: $TINYFISH_API_KEY" \ -H "Content-Type: application/json" \ -d '{"url": "https://www.tinyfish.ai"}' ``` ## What Success Looks Like ```json theme={null} { "session_id": "br-a1b2c3d4-e5f6-7890-abcd-ef1234567890", "cdp_url": "wss://example.tinyfish.io/cdp", "base_url": "https://example.tinyfish.io" } ``` Then connect with Playwright using `cdp_url`: ```python Python theme={null} import asyncio from playwright.async_api import async_playwright CDP_URL = "" async def main(): async with async_playwright() as p: browser = await p.chromium.connect_over_cdp(CDP_URL) await asyncio.sleep(2) # let startup navigation settle page = browser.contexts[0].pages[0] await page.wait_for_load_state("domcontentloaded") print(await page.title()) asyncio.run(main()) ``` ```typescript TypeScript theme={null} import { chromium } from 'playwright'; const CDP_URL = ''; const browser = await chromium.connectOverCDP(CDP_URL); await new Promise(r => setTimeout(r, 2000)); // let startup navigation settle const page = browser.contexts()[0].pages()[0]; await page.waitForLoadState('domcontentloaded'); console.log(await page.title()); ``` Pass `cdp_url` (the WebSocket URL) to `connect_over_cdp`. Do not use `base_url` — it is for polling session status via `/pages`, not for Playwright connections. ## When to Use Browser vs the Other APIs * Use **Browser** when you want direct Playwright or CDP control. * Use **Agent** when TinyFish should decide the browser actions from a goal. * Use **Fetch** when you only need extracted content from one or more URLs. * Use **Search** when you need ranked search results, not a browser session. *** ## Session Lifecycle | Behavior | Details | | ---------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | **Startup navigation** | If `url` was provided at session creation, the browser navigates there immediately. The 201 response is returned before navigation completes — the page may still be loading when you connect. | | **Inactivity timeout** | Sessions automatically terminate after **1 hour of inactivity**. A session is considered inactive when no CDP commands are being sent. | | **No explicit delete** | There is no endpoint to delete a session. Sessions are cleaned up automatically when the inactivity timeout elapses. | | **Session isolation** | Each session is a fully isolated browser instance. No cookies, storage, or state is shared between sessions. | *** ## SDK Quick Start ```python Python theme={null} from tinyfish import TinyFish client = TinyFish() session = client.browser.sessions.create(url="https://www.tinyfish.ai") print(session.session_id) print(session.cdp_url) ``` ```typescript TypeScript theme={null} import { TinyFish } from "@tiny-fish/sdk"; const client = new TinyFish(); const session = await client.browser.sessions.create({ url: "https://www.tinyfish.ai", }); console.log(session.session_id); console.log(session.cdp_url); ``` *** ## Read Next Request and response schema API key setup Configure browser behavior for automation runs Understand when to use Agent vs Browser One page that routes an agent to the right TinyFish API # Browser API Reference Source: https://docs.tinyfish.ai/browser-api/reference Complete reference for the Browser API endpoint ## Endpoint ```bash theme={null} POST https://api.browser.tinyfish.ai ``` All requests require an `X-API-Key` header. See [Authentication](/authentication). Session creation typically takes 10-30 seconds. Set your HTTP client timeout to at least 60 seconds. *** ## Request ```json theme={null} { "url": "https://www.tinyfish.ai", "timeout_seconds": 300 } ``` ### Parameters Target URL the session will navigate to on startup. Bare domains (e.g. `tinyfish.ai`) are automatically prefixed with `https://`. Omit to start at `about:blank`. Inactivity timeout in seconds (5–86400). Defaults to your plan maximum. *** ## Response ```json theme={null} { "session_id": "br-a1b2c3d4-e5f6-7890-abcd-ef1234567890", "cdp_url": "wss://example.tinyfish.io/cdp", "base_url": "https://example.tinyfish.io" } ``` Unique identifier for this session. WebSocket URL for browser connection. Pass this to Playwright's `connect_over_cdp` or any CDP client. HTTPS base URL for the session. Use to access session endpoints such as `/pages`. *** ## Debugging — Open DevTools Inspector Poll `GET {base_url}/pages` and open the `devtoolsFrontendUrl` of the first non-blank page to inspect the live browser session. ```python Python theme={null} async def get_inspector_url(base_url: str) -> str: async with httpx.AsyncClient() as client: for _ in range(20): pages = (await client.get(f"{base_url}/pages", timeout=5)).json() nav = next((p for p in pages if p.get("url") not in ("", "about:blank", "about:newtab")), None) if nav: return nav["devtoolsFrontendUrl"] await asyncio.sleep(1) raise TimeoutError("navigation never completed") ``` ```typescript TypeScript theme={null} async function getInspectorUrl(baseUrl: string): Promise { for (let i = 0; i < 20; i++) { const res = await fetch(`${baseUrl}/pages`); const pages: { url: string; devtoolsFrontendUrl: string }[] = await res.json(); const nav = pages.find(p => !['', 'about:blank', 'about:newtab'].includes(p.url)); if (nav) return nav.devtoolsFrontendUrl; await new Promise(r => setTimeout(r, 1000)); } throw new Error('navigation never completed'); } ``` The page starts at `about:blank` and navigates asynchronously — skip blank pages when polling to get the correct inspector URL. *** ## Session Lifecycle | Behavior | Details | | ---------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | **Startup navigation** | If `url` was provided at session creation, the browser navigates there immediately. The 201 response is returned before navigation completes — the page may still be loading when you connect. | | **Inactivity timeout** | Sessions automatically terminate after **1 hour of inactivity**. A session is considered inactive when no CDP commands are being sent. | | **No explicit delete** | There is no endpoint to delete a session. Sessions are cleaned up automatically when the inactivity timeout elapses. | | **Session isolation** | Each session is a fully isolated browser instance. No cookies, storage, or state is shared between sessions. | *** ## SDK Methods ```python Python theme={null} from tinyfish import TinyFish client = TinyFish() session = client.browser.sessions.create(url="https://www.tinyfish.ai") print(session.session_id) print(session.cdp_url) ``` ```typescript TypeScript theme={null} import { TinyFish } from "@tiny-fish/sdk"; const client = new TinyFish(); const session = await client.browser.sessions.create({ url: "https://www.tinyfish.ai", }); console.log(session.session_id); console.log(session.cdp_url); ``` *** ## End-to-End Example Create a session, connect with Playwright, take a screenshot, and extract the page title. ```python Python theme={null} import asyncio from tinyfish import TinyFish from playwright.async_api import async_playwright client = TinyFish() async def main(): # 1. Create a browser session session = client.browser.sessions.create(url="https://scrapeme.live/shop") # 2. Connect via Playwright CDP async with async_playwright() as p: browser = await p.chromium.connect_over_cdp(session.cdp_url) await asyncio.sleep(2) # let startup navigation settle page = browser.contexts[0].pages[0] await page.wait_for_load_state("domcontentloaded") # 3. Interact with the page print(await page.title()) await page.screenshot(path="shop.png") # Session auto-cleans up after 1 hour of inactivity asyncio.run(main()) ``` ```typescript TypeScript theme={null} import { TinyFish } from "@tiny-fish/sdk"; import { chromium } from "playwright"; const client = new TinyFish(); // 1. Create a browser session const session = await client.browser.sessions.create({ url: "https://scrapeme.live/shop", }); // 2. Connect via Playwright CDP const browser = await chromium.connectOverCDP(session.cdp_url); await new Promise((r) => setTimeout(r, 2000)); // let startup navigation settle const page = browser.contexts()[0].pages()[0]; await page.waitForLoadState("domcontentloaded"); // 3. Interact with the page console.log(await page.title()); await page.screenshot({ path: "shop.png" }); // Session auto-cleans up after 1 hour of inactivity ``` *** ## Error Reference | HTTP Status | Error Code | Cause | Resolution | | ----------- | ------------------------------------- | --------------------------------------------------- | ------------------------------------------------------------------------------------------------------------- | | 400 | `INVALID_INPUT` | `url` field is not a valid URL. | Check the `details` field in the error response for specifics. | | 401 | `MISSING_API_KEY` / `INVALID_API_KEY` | Missing or invalid `X-API-Key` header. | Verify your API key at the [dashboard](https://agent.tinyfish.ai/api-keys). | | 402 | `INSUFFICIENT_CREDITS` | No credits or active subscription. | Add credits or upgrade your plan. | | 404 | `NOT_FOUND` | Browser API is not available on your plan. | Contact support to enable access. | | 500 | `INTERNAL_ERROR` | Unexpected server error. | Retry after a brief delay. If persistent, check [status.agent.tinyfish.ai](https://status.agent.tinyfish.ai). | | 502 | `INTERNAL_ERROR` | Browser infrastructure failed to start the session. | Retry — this is usually transient. | ## Related First request, success shape, and product routing API key setup Full list of API error codes Understand where Browser fits in the overall API surface # CLI Commands Source: https://docs.tinyfish.ai/cli/commands Full reference for TinyFish CLI commands ## `tinyfish agent run` Execute a browser automation. By default the output streams as newline-delimited JSON — one object per event. ```bash theme={null} tinyfish agent run "goal" --url example.com ``` ### Flags | Flag | Description | | ------------- | ------------------------------------------------------------- | | `--url ` | Target URL to automate (required) | | `--sync` | Wait for the run to complete and return the full result | | `--async` | Submit only — return the `run_id` immediately without waiting | | `--pretty` | Human-readable output instead of JSON | ### Output modes One JSON object per line as events arrive. Use this when you want live progress or are piping to another tool. ```bash theme={null} tinyfish agent run "Extract the pricing" --url example.com/pricing ``` ```json theme={null} {"type":"STARTED","run_id":"abc123","run_url":"https://agent.tinyfish.ai/runs/abc123"} {"type":"PROGRESS","run_id":"abc123","purpose":"Navigating to the pricing page"} {"type":"COMPLETE","run_id":"abc123","status":"COMPLETED","result":{"price":"$99"},"run_url":"https://agent.tinyfish.ai/runs/abc123"} ``` Pressing **Ctrl+C** during a streaming run cancels the run server-side before exiting. Waits for the run to finish and returns a single JSON object with the full result. Use this for scripts where you only care about the final output. ```bash theme={null} tinyfish agent run "Extract the pricing" --url example.com/pricing --sync ``` ```json theme={null} {"run_id":"abc123","run_url":"https://agent.tinyfish.ai/runs/abc123","status":"COMPLETED","result":{"price":"$99"},"num_of_steps":4} ``` Submits the run and returns immediately with the `run_id`. Use this when you want to fire-and-forget or manage polling yourself. ```bash theme={null} tinyfish agent run "Extract the pricing" --url example.com/pricing --async ``` ```json theme={null} {"run_id":"abc123","run_url":"https://agent.tinyfish.ai/runs/abc123","error":null} ``` *** ## `tinyfish agent run list` List recent runs. ```bash theme={null} tinyfish agent run list ``` ### Flags | Flag | Description | | ------------------- | ----------------------------------------------------------------------------- | | `--status ` | Filter by status: `PENDING`, `RUNNING`, `COMPLETED`, `FAILED`, or `CANCELLED` | | `--limit ` | Number of runs to return (default `20`, max `100`) | | `--cursor ` | Pagination cursor from a previous response | | `--pretty` | Human-readable output | ### Examples ```bash List all runs theme={null} tinyfish agent run list ``` ```bash Filter by status theme={null} tinyfish agent run list --status COMPLETED ``` ```bash Paginate theme={null} tinyfish agent run list --limit 50 --cursor ``` ```bash Human-readable theme={null} tinyfish agent run list --pretty ``` *** ## `tinyfish agent run get ` Get the full details of a specific run. ```bash theme={null} tinyfish agent run get abc123 ``` Returns the complete run object including status, result, and metadata. ### Flags | Flag | Description | | ---------- | --------------------- | | `--pretty` | Human-readable output | ### Example ```bash JSON output theme={null} tinyfish agent run get abc123 ``` ```bash Human-readable theme={null} tinyfish agent run get abc123 --pretty ``` *** ## `tinyfish agent run cancel ` Cancel a run that is `PENDING` or `RUNNING`. ```bash theme={null} tinyfish agent run cancel abc123 ``` ```json theme={null} {"run_id":"abc123","status":"CANCELLED","cancelled_at":"2024-01-15T10:30:00Z","message":null} ``` ### Flags | Flag | Description | | ---------- | --------------------- | | `--pretty` | Human-readable output | *** ## Auth commands | Command | Description | | ---------------------- | --------------------------------------------------------- | | `tinyfish auth login` | Open the API keys page and save a key interactively | | `tinyfish auth set` | Read an API key from stdin and save it | | `tinyfish auth status` | Check whether a key is configured and where it comes from | | `tinyfish auth logout` | Remove the saved API key | # CLI Source: https://docs.tinyfish.ai/cli/index Run TinyFish automations directly from your terminal The TinyFish CLI lets you run browser automations, check results, and manage runs from your terminal — no code required. ## Installation ```bash theme={null} npm install -g @tiny-fish/cli ``` **Verify:** ```bash theme={null} tinyfish --version ``` ## Authentication ```bash theme={null} tinyfish auth login ``` Opens the API keys page in your browser. Paste your key when prompted. The key is saved to `~/.tinyfish/config.json`. **For CI/CD:** ```bash theme={null} echo $TINYFISH_API_KEY | tinyfish auth set ``` **Check status:** ```bash theme={null} tinyfish auth status ``` Returns JSON with `source` (`env`, `config`, or `none`), `key_preview` (first/last chars of the key, or `null`), and `authenticated` (`true`/`false`). Exit code 1 when not authenticated. ## Quick Start ```bash theme={null} npm install -g @tiny-fish/cli export TINYFISH_API_KEY="sk-tinyfish-..." tinyfish agent run --url "https://example.com" "Extract product data. Return JSON." ``` ## Environment variables | Variable | Description | | ------------------ | ------------------------------------------ | | `TINYFISH_API_KEY` | API key — takes priority over saved config | ## Output By default all commands output JSON to stdout. Errors go to stderr as JSON. Exit code 1 on failure. Add `--pretty` to any command for human-readable output. # Common Patterns Source: https://docs.tinyfish.ai/common-patterns Ready-to-use code patterns for common TinyFish Web Agent use cases These patterns cover the most common ways to integrate TinyFish Web Agent into your application. ## Simple Extraction Use the synchronous endpoint for quick, one-off extractions where you need the result immediately. ```typescript theme={null} import { TinyFish, RunStatus } from "@tiny-fish/sdk"; const client = new TinyFish(); async function extractData(url: string, dataDescription: string) { const run = await client.agent.run({ url, goal: `Extract ${dataDescription}. Return as JSON.`, }); return run.status === RunStatus.COMPLETED ? run.result : null; } // Usage async function main() { const products = await extractData( "https://example.com/products", "all product names and prices" ); console.log(products); } main(); ``` *** ## Batch Processing For multiple URLs, use the async endpoint to submit all tasks at once, then poll for results. This avoids blocking while waiting for each task to complete. ```typescript theme={null} import { TinyFish, RunStatus } from "@tiny-fish/sdk"; const client = new TinyFish(); async function processBatch(tasks: { url: string; goal: string }[]) { // Submit all tasks const responses = await Promise.all( tasks.map((task) => client.agent.queue(task)) ); // Poll for completion const maxAttempts = 150; // 5 minutes at 2s intervals const results = await Promise.all( responses.map(async (r) => { if (r.error) { throw r.error; } for (let attempt = 0; attempt < maxAttempts; attempt++) { const run = await client.runs.get(r.run_id); if ( run.status === RunStatus.COMPLETED || run.status === RunStatus.FAILED || run.status === RunStatus.CANCELLED ) { return run; } await new Promise((resolve) => setTimeout(resolve, 2000)); } throw new Error(`Run ${r.run_id} timed out after ${maxAttempts} attempts`); }) ); return results; } // Usage async function main() { const results = await processBatch([ { url: "https://example.com/page1", goal: "Extract product info" }, { url: "https://example.com/page2", goal: "Extract product info" }, ]); console.log(results); } main(); ``` *** ## Retry with Stealth Mode Some sites block automated requests. Start with lite mode for speed, then automatically retry with stealth mode if you get blocked. ```typescript theme={null} import { TinyFish, RunStatus, BrowserProfile, ProxyCountryCode } from "@tiny-fish/sdk"; const client = new TinyFish(); async function extractWithFallback(url: string, goal: string) { // Try standard mode first let result = await client.agent.run({ url, goal, browser_profile: BrowserProfile.LITE, }); if (result.status === RunStatus.FAILED && result.error?.message.includes("blocked")) { // Retry with stealth mode result = await client.agent.run({ url, goal, browser_profile: BrowserProfile.STEALTH, proxy_config: { enabled: true, country_code: ProxyCountryCode.US }, }); } return result; } ``` *** ## Result Validation A run with `COMPLETED` status means the agent finished, but the result may still describe a failure (e.g., the site showed a captcha or access-denied page). Always validate the result content. ```python Python theme={null} def is_real_success(result): """COMPLETED status is necessary but not sufficient.""" if not result: return False result_str = str(result).lower() failure_signals = ["captcha", "blocked", "access denied", "could not", "unable to"] return not any(signal in result_str for signal in failure_signals) # Usage from tinyfish import TinyFish, RunStatus client = TinyFish() run = client.agent.run( url="https://example.com", goal="Extract pricing data. Return as JSON.", ) if run.status == RunStatus.COMPLETED and is_real_success(run.result): print("Success:", run.result) else: print("Needs retry or manual review") ``` ```typescript TypeScript theme={null} import { TinyFish, RunStatus } from "@tiny-fish/sdk"; function isRealSuccess(result: unknown): boolean { if (!result) return false; let resultStr: string; try { resultStr = JSON.stringify(result).toLowerCase(); } catch { resultStr = String(result).toLowerCase(); } const failureSignals = ["captcha", "blocked", "access denied", "could not", "unable to"]; return !failureSignals.some((signal) => resultStr.includes(signal)); } // Usage async function main() { const client = new TinyFish(); const run = await client.agent.run({ url: "https://example.com", goal: "Extract pricing data. Return as JSON.", }); if (run.status === RunStatus.COMPLETED && isRealSuccess(run.result)) { console.log("Success:", run.result); } else { console.log("Needs retry or manual review"); } } main(); ``` *** ## Rate Limit Handling TinyFish has concurrency limits based on your plan. The SDK automatically retries `429` and `5xx` errors with exponential backoff (up to `maxRetries` attempts, default 2). ```typescript theme={null} import { TinyFish, RateLimitError } from "@tiny-fish/sdk"; // Adjust retry behavior via client options const client = new TinyFish({ maxRetries: 3, // default is 2 }); async function main() { try { const run = await client.agent.run({ url: "https://example.com", goal: "Extract data", }); console.log(run.result); } catch (e) { if (e instanceof RateLimitError) { console.log("Rate limited after all retries exhausted"); } throw e; } } main(); ``` *** ## Cross-API Workflows Chain multiple TinyFish APIs together for complex workflows. ### Search + Fetch Search for URLs, then fetch full content from the top results: ```python Python theme={null} from tinyfish import TinyFish client = TinyFish() # Step 1: Search for relevant pages search_results = client.search.query("best python web frameworks 2026") # Step 2: Fetch content from top 3 results urls = [r.url for r in search_results.results[:3]] fetched = client.fetch.get_contents(urls=urls, format="markdown") for page in fetched.results: print(f"{page.title}: {page.text[:200]}...") ``` ```typescript TypeScript theme={null} import { TinyFish } from "@tiny-fish/sdk"; const client = new TinyFish(); // Step 1: Search for relevant pages const searchResults = await client.search.query({ query: "best python web frameworks 2026", }); // Step 2: Fetch content from top 3 results const urls = searchResults.results.slice(0, 3).map((r) => r.url); const fetched = await client.fetch.getContents({ urls, format: "markdown" }); fetched.results.forEach((page) => console.log(`${page.title}: ${page.text.slice(0, 200)}...`) ); ``` ### Search + Agent Search for a URL, then extract structured data via the Agent API: ```python Python theme={null} from tinyfish import TinyFish, RunStatus client = TinyFish() # Step 1: Find a product page search_results = client.search.query("scrapeme pokemon shop bulbasaur") target_url = search_results.results[0].url # Step 2: Extract structured data with the Agent run = client.agent.run( url=target_url, goal="""Extract from this product page: - product_name, price (number), in_stock (boolean) Return as JSON.""", ) if run.status == RunStatus.COMPLETED: print(run.result) ``` ```typescript TypeScript theme={null} import { TinyFish, RunStatus } from "@tiny-fish/sdk"; const client = new TinyFish(); // Step 1: Find a product page const searchResults = await client.search.query({ query: "scrapeme pokemon shop bulbasaur", }); const targetUrl = searchResults.results[0].url; // Step 2: Extract structured data with the Agent const run = await client.agent.run({ url: targetUrl, goal: `Extract from this product page: - product_name, price (number), in_stock (boolean) Return as JSON.`, }); if (run.status === RunStatus.COMPLETED) { console.log(run.result); } ``` *** ## Related Best practices for AI agents More detailed examples # Error Codes Source: https://docs.tinyfish.ai/error-codes API error codes and how to resolve them ## Error Response Format API errors use one of two schemas depending on the HTTP status code: ```json theme={null} // 400 Bad Request — may include validation details { "error": { "code": "INVALID_INPUT", "message": "Validation failed", "details": [ { "code": "too_small", "path": ["goal"], "message": "Too small: expected string to have >=1 characters" } ] } } ``` ```json theme={null} // 401 Unauthorized and other errors — simpler schema without details { "error": { "code": "INVALID_API_KEY", "message": "The provided API key is invalid" } } ``` 400 validation errors may include a `details` array with Zod validation issues. Not all 400 errors include `details`. 401 and other errors use a simpler schema without `details`. ## Error Codes Reference ### MISSING\_API\_KEY **HTTP Status:** 401 The `X-API-Key` header was not included in the request. ```json theme={null} { "error": { "code": "MISSING_API_KEY", "message": "X-API-Key header is required" } } ``` **Solution:** Add the `X-API-Key` header to your request: ```bash theme={null} curl -H "X-API-Key: $TINYFISH_API_KEY" ... ``` *** ### INVALID\_API\_KEY **HTTP Status:** 401 The provided API key does not exist or has been revoked. ```json theme={null} { "error": { "code": "INVALID_API_KEY", "message": "The provided API key is invalid" } } ``` **Solutions:** 1. Verify your API key is correct (no extra whitespace) 2. Check if the key was deleted in the [API Keys dashboard](https://agent.tinyfish.ai/api-keys) 3. Generate a new key if needed *** ### INVALID\_INPUT **HTTP Status:** 400 The request body failed validation. ```json theme={null} { "error": { "code": "INVALID_INPUT", "message": "Validation failed", "details": [ { "code": "invalid_string", "path": ["url"], "message": "Invalid URL" }, { "code": "too_small", "path": ["goal"], "message": "Required field missing" } ] } } ``` **Common Causes:** * `url` is missing or not a valid URL (must include `https://`) * `goal` is empty or missing * `browser_profile` is not "lite" or "stealth" * `proxy_config.country_code` is not a supported 2-letter code **Solution:** Check the `details` field for specific validation errors. *** ### RATE\_LIMIT\_EXCEEDED **HTTP Status:** 429 Too many requests in a short period. ```json theme={null} { "error": { "code": "RATE_LIMIT_EXCEEDED", "message": "Rate limit exceeded. Try again in 60 seconds." } } ``` Rate limits vary by API and plan. Search API: 5 requests/minute. Other APIs have higher limits. The `Retry-After` header is not currently returned — use exponential backoff (see example below). **Solutions:** 1. Implement exponential backoff in your code 2. Space out requests (recommended: 1-2 seconds between calls) 3. Use batch endpoints for high-volume workloads 4. Contact support for higher rate limits **Example: Exponential Backoff** ```python theme={null} import time import random from tinyfish import TinyFish, RateLimitError client = TinyFish() def call_with_backoff(fn, max_retries=5): for attempt in range(max_retries): try: return fn() except RateLimitError: if attempt == max_retries - 1: raise wait = (2 ** attempt) + random.uniform(0, 1) time.sleep(wait) ``` *** ### UNAUTHORIZED **HTTP Status:** 401 Authentication failed for a reason other than missing/invalid key. ```json theme={null} { "error": { "code": "UNAUTHORIZED", "message": "Authentication failed" } } ``` **Solutions:** 1. Check your account status at [agent.tinyfish.ai/api-keys](https://agent.tinyfish.ai/api-keys) 2. Verify your API key hasn't expired 3. Try generating a new API key *** ### FORBIDDEN **HTTP Status:** 403 Authentication succeeded, but you lack permission for this action. ```json theme={null} { "error": { "code": "FORBIDDEN", "message": "Insufficient credits or no active subscription" } } ``` **Common Causes:** * No remaining credits * Subscription has expired * Attempting to access a resource you don't own **Solution:** Check your account balance and subscription status at [agent.tinyfish.ai/api-keys](https://agent.tinyfish.ai/api-keys). *** ### NOT\_FOUND **HTTP Status:** 404 The requested resource does not exist. ```json theme={null} { "error": { "code": "NOT_FOUND", "message": "Run not found" } } ``` **Common Causes:** * Invalid `run_id` in `GET /v1/runs/:id` * Run was deleted or never existed * Typo in the run ID **Solution:** Verify the run ID is correct. Run IDs are returned from `/v1/automation/run-async` or can be listed via `GET /v1/runs`. *** ### INTERNAL\_ERROR **HTTP Status:** 500 An unexpected error occurred on the server. ```json theme={null} { "error": { "code": "INTERNAL_ERROR", "message": "An unexpected error occurred" } } ``` **Solutions:** 1. Retry the request after a brief delay 2. If the error persists, check [status.agent.tinyfish.ai](https://status.agent.tinyfish.ai) for outages 3. Contact support with your request details and timestamp ## Run Status vs Error Codes Error codes indicate **API-level failures** (authentication, validation, server errors). For **automation-level failures** (browser crashed, goal couldn't be achieved), check the `status` and `error` fields in the run response. See [Understanding Run Status](/faq#what-does-completed-status-mean). ## HTTP Status Code Summary | Status | Meaning | Error Codes | | ------ | ----------------- | ---------------------------------------------------- | | 400 | Bad Request | `INVALID_INPUT` | | 401 | Unauthorized | `MISSING_API_KEY`, `INVALID_API_KEY`, `UNAUTHORIZED` | | 403 | Forbidden | `FORBIDDEN` | | 404 | Not Found | `NOT_FOUND` | | 429 | Too Many Requests | `RATE_LIMIT_EXCEEDED` | | 500 | Server Error | `INTERNAL_ERROR` | ## Related API key setup and troubleshooting Common questions and issues # Async Bulk Requests Source: https://docs.tinyfish.ai/examples/bulk-requests-async Submit multiple runs and poll for results ## Overview The async API pattern is ideal when you want to submit multiple long-running tasks and check their status later. Instead of waiting for each run to complete, you submit all requests and get back run IDs that you can poll for completion. ## How It Works 1. Submit requests to `/v1/automation/run-async`, which returns corresponding `run_id`s, which you will need if you want to check the status of a particular run. 2. Check individual runs with `GET /v1/runs/:id` to check status 3. Or fetch all runs with `GET /v1/runs` to monitor batch progress ## Basic Example Submit multiple TinyFish Web Agent runs and poll for completion: ```python Python theme={null} import asyncio from tinyfish import AsyncTinyFish, RunStatus async def wait_for_completion(client, run_id, poll_interval=2): """Poll a run until it completes""" while True: run = await client.runs.get(run_id) if run.status in (RunStatus.COMPLETED, RunStatus.FAILED, RunStatus.CANCELLED): return run await asyncio.sleep(poll_interval) async def main(): client = AsyncTinyFish() # Define your batch of tasks tasks_to_run = [ { "url": "https://scrapeme.live/shop/", "goal": "Extract all available products on page two with their name, price, and review rating (if available)", }, { "url": "https://books.toscrape.com/", "goal": "Extract all available books on page two with their title, price, and review rating (if available)", }, ] # Step 1: Submit all tinyfish runs and collect run_ids print("Submitting tinyfish runs...") submit_tasks = [ client.agent.queue(url=task["url"], goal=task["goal"]) for task in tasks_to_run ] responses = await asyncio.gather(*submit_tasks) run_ids = [r.run_id for r in responses] print(f"Submitted {len(run_ids)} runs: {run_ids}") # Step 2: Wait for all runs to complete print("Waiting for completion...") completion_tasks = [ wait_for_completion(client, run_id) for run_id in run_ids ] results = await asyncio.gather(*completion_tasks) # Step 3: Process results for i, run in enumerate(results): print(f"Run {i + 1} ({run.run_id}):") print(f" Status: {run.status}") if run.status == RunStatus.COMPLETED: print(f" Result: {run.result}") # Run the async main function asyncio.run(main()) ``` ```typescript TypeScript theme={null} import { TinyFish, RunStatus } from "@tiny-fish/sdk"; const client = new TinyFish(); async function waitForCompletion(runId: string, pollInterval = 2000) { while (true) { const run = await client.runs.get(runId); if ( run.status === RunStatus.COMPLETED || run.status === RunStatus.FAILED || run.status === RunStatus.CANCELLED ) { return run; } await new Promise((r) => setTimeout(r, pollInterval)); } } async function main() { // Define your batch of tasks const tasksToRun = [ { url: "https://scrapeme.live/shop/", goal: "Extract all available products on page two with their name, price, and review rating (if available)", }, { url: "https://books.toscrape.com/", goal: "Extract all available books on page two with their title, price, and review rating (if available)", }, ]; // Step 1: Submit all tinyfish runs and collect run_ids console.log("Submitting tinyfish runs..."); const responses = await Promise.all( tasksToRun.map((task) => client.agent.queue(task)) ); const runIds = responses.map((response) => { if (response.error) { throw new Error(`Failed to queue run: ${response.error.message}`); } return response.run_id; }); console.log(`Submitted ${runIds.length} runs:`, runIds); // Step 2: Wait for all runs to complete console.log("Waiting for completion..."); const results = await Promise.all(runIds.map((id) => waitForCompletion(id))); // Step 3: Process results results.forEach((run, i) => { console.log(`Run ${i + 1} (${run.run_id}):`); console.log(` Status: ${run.status}`); if (run.status === RunStatus.COMPLETED) { console.log(` Result:`, run.result); } }); } main(); ``` ## Fire and Forget Pattern Submit tasks without waiting for completion: ```python Python theme={null} async def main(): client = AsyncTinyFish() tasks_to_run = [ {"url": "https://example.com/page1", "goal": "Extract product info"}, {"url": "https://example.com/page2", "goal": "Extract product info"}, {"url": "https://example.com/page3", "goal": "Extract product info"}, ] # Submit all tasks submit_tasks = [ client.agent.queue(url=task["url"], goal=task["goal"]) for task in tasks_to_run ] responses = await asyncio.gather(*submit_tasks) run_ids = [r.run_id for r in responses] print(f"Submitted {len(run_ids)} runs") print(f"Run IDs: {run_ids}") print("Check status later using client.runs.get(run_id)") asyncio.run(main()) ``` ```typescript TypeScript theme={null} import { TinyFish } from "@tiny-fish/sdk"; const client = new TinyFish(); async function main() { const tasksToRun = [ { url: "https://example.com/page1", goal: "Extract product info" }, { url: "https://example.com/page2", goal: "Extract product info" }, { url: "https://example.com/page3", goal: "Extract product info" }, ]; // Submit all tasks const responses = await Promise.all( tasksToRun.map((task) => client.agent.queue(task)) ); const runIds = responses.map((response) => { if (response.error) { throw new Error(`Failed to queue run: ${response.error.message}`); } return response.run_id; }); console.log(`Submitted ${runIds.length} runs`); console.log("Run IDs:", runIds); console.log("Check status later using client.runs.get(runId)"); } main(); ``` ## When to Use Async vs Sync | Use Case | API Pattern | Why | | ------------------- | ------------------ | ------------------------------------ | | Quick tasks (\<30s) | Sync `/run` | Simpler code, immediate results | | Long-running tasks | Async `/run-async` | Don't block, check later | | Large batches | Async `/run-async` | Submit all at once, monitor progress | | Fire and forget | Async `/run-async` | No need to wait | | Real-time feedback | SSE `/run-sse` | Stream progress events | ## Best Practices ### Polling Interval * **Short tasks (under 1 min)**: Poll every 2-3 seconds * **Medium tasks (1-5 min)**: Poll every 5-10 seconds * **Long tasks (over 5 min)**: Poll every 30-60 seconds ### Error Handling Always check run status and handle failures: ```python Python theme={null} async def process_completed_run(run): if run.status == RunStatus.COMPLETED: return run.result elif run.status == RunStatus.FAILED: print(f"Run {run.run_id} failed: {run.error}") return None elif run.status == RunStatus.CANCELLED: print(f"Run {run.run_id} was cancelled") return None ``` ## API Reference Start TinyFish Web Agent run asynchronously Get all runs Check individual run status Cancel a running automation ## Related Sync API for immediate results Extract data from pages # Concurrent Requests Source: https://docs.tinyfish.ai/examples/bulk-requests-sync Process multiple runs in parallel for better performance ## Overview When you need to scrape multiple pages, fill multiple forms, or process a batch of URLs, firing requests concurrently can significantly speed up your workflow. This guide shows how to run multiple TinyFish Web Agent runs in parallel using the sync API. ## Basic Example Fire multiple requests concurrently and gather results: ```python Python theme={null} import asyncio from tinyfish import AsyncTinyFish async def main(): client = AsyncTinyFish() # Define your batch of tasks - scraping multiple sites tasks_to_run = [ { "url": "https://scrapeme.live/shop/", "goal": "Extract all available products on page two with their name, price, and review rating (if available)", }, { "url": "https://books.toscrape.com/", "goal": "Extract all available books on page two with their title, price, and review rating (if available)", }, ] # Fire all requests concurrently tasks = [ client.agent.run(url=task["url"], goal=task["goal"]) for task in tasks_to_run ] # Wait for all tasks to complete results = await asyncio.gather(*tasks) # Process results for i, response in enumerate(results): print(f"Task {i + 1} result:", response.result) # Run the async main function asyncio.run(main()) ``` ```typescript TypeScript theme={null} import { TinyFish } from "@tiny-fish/sdk"; const client = new TinyFish(); async function main() { // Define your batch of tasks - scraping multiple sites const tasksToRun = [ { url: "https://scrapeme.live/shop/", goal: "Extract all available products on page two with their name, price, and review rating (if available)", }, { url: "https://books.toscrape.com/", goal: "Extract all available books on page two with their title, price, and review rating (if available)", }, ]; // Fire all requests concurrently const results = await Promise.all( tasksToRun.map((task) => client.agent.run(task)) ); // Process results results.forEach((response, i) => { console.log(`Task ${i + 1} result:`, response.result); }); } main(); ``` The sync `/run` API is perfect for concurrent requests - you get clean, simple code without SSE stream handling, making it ideal for batch operations with `asyncio.gather()` or `Promise.all()`. ## Batch Multiple Forms Fill multiple contact forms concurrently: ```python Python theme={null} async def main(): client = AsyncTinyFish() companies = [ {"name": "Acme Corp", "url": "https://acme.com/contact"}, {"name": "TechStart", "url": "https://techstart.io/contact"}, {"name": "BuildIt", "url": "https://buildit.com/contact"}, ] tasks = [ client.agent.run( url=company["url"], goal=f"""Fill in the contact form: - Name field: "John Doe" - Email field: "john@example.com" - Message field: "Interested in partnership with {company['name']}" Then click Submit and extract the success message. """, ) for company in companies ] results = await asyncio.gather(*tasks) for company, response in zip(companies, results): print(f"{company['name']}: {response.result}") ``` ## Gotchas and Caveats **Concurrency Limits**: Each user account has a concurrency limit for simultaneous browser sessions. When you exceed this limit, additional requests will be queued automatically rather than returning a 429 error. ### Queueing Behavior When you hit your account's concurrency cap: * **No 429 errors**: Unlike traditional rate-limited APIs, TinyFish won't reject your request with a 429 status code * **Automatic queueing**: Your request will be accepted and queued until a browser session becomes available * **Longer run times**: The total run time will include both queue wait time and execution time **Example scenario**: If your account allows 3 concurrent sessions and you fire 10 requests simultaneously: * Requests 1-3 start immediately * Requests 4-10 are queued * As each request completes, the next queued request begins * You won't get errors, but later requests will take longer to complete We're actively working on improving the queueing experience with better visibility into queue position and estimated wait times. This behavior will be enhanced in an upcoming release. ### Best Practices * **Know your limits**: Check your plan's concurrency limit in your dashboard * **Batch sizing**: Size your concurrent batches to match your concurrency limit for optimal performance * **Progress tracking**: Implement timing/logging to monitor which requests are queued vs executing * **Error handling**: Always handle potential timeouts for long-running or queued requests ## Related Extract data from pages Automate form submissions Complete API documentation # Form Filling Source: https://docs.tinyfish.ai/examples/form-filling Automate form filling and submission ## Basic Example Fill and submit a contact form: ```python Python theme={null} from tinyfish import TinyFish, EventType, RunStatus client = TinyFish() with client.agent.stream( url="https://example.com/contact", goal="""Fill in the contact form: - Name field: "John Doe" - Email field: "john@example.com" - Message field: "I am interested in your services." Then click the Submit button and extract the success message. """, ) as stream: for event in stream: if event.type == EventType.COMPLETE and event.status == RunStatus.COMPLETED: print("Result:", event.result_json) ``` ```typescript TypeScript theme={null} import { TinyFish, EventType, RunStatus } from "@tiny-fish/sdk"; async function main() { const client = new TinyFish(); const stream = await client.agent.stream({ url: "https://example.com/contact", goal: `Fill in the contact form: - Name field: "John Doe" - Email field: "john@example.com" - Message field: "I am interested in your services." Then click the Submit button and extract the success message. `, }); for await (const event of stream) { if (event.type === EventType.COMPLETE && event.status === RunStatus.COMPLETED) { console.log("Result:", event.result); } } } main(); ``` ```bash cURL theme={null} curl -N -X POST https://agent.tinyfish.ai/v1/automation/run-sse \ -H "X-API-Key: $TINYFISH_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "url": "https://example.com/contact", "goal": "Fill in the contact form with name John Doe and email john@example.com, then click Submit" }' ``` ```bash CLI theme={null} tinyfish agent run "Fill in the contact form: Name: John Doe, Email: john@example.com, Message: I am interested in your services. Then click Submit and return the success message." \ --url example.com/contact --pretty ``` **Output:** ```json theme={null} { "success": true, "message": "Thank you for contacting us!" } ``` ## Multi-Step Form Handle multi-step forms in a single goal: ```python Python theme={null} from tinyfish import TinyFish client = TinyFish() with client.agent.stream( url="https://example.com/signup", goal="""Complete the multi-step signup form: Step 1 (Personal Info): - First name: "John" - Last name: "Doe" - Email: "john@example.com" - Click "Next" Step 2 (Address): - Street: "123 Main St" - City: "San Francisco" - State: "CA" - ZIP: "94102" - Click "Next" Step 3 (Preferences): - Select "Email notifications" checkbox - Click "Submit" Extract the confirmation number from the success page. """, ) as stream: for event in stream: print(event) ``` ```typescript TypeScript theme={null} import { TinyFish } from "@tiny-fish/sdk"; async function main() { const client = new TinyFish(); const stream = await client.agent.stream({ url: "https://example.com/signup", goal: `Complete the multi-step signup form: Step 1 (Personal Info): - First name: "John" - Last name: "Doe" - Email: "john@example.com" - Click "Next" Step 2 (Address): - Street: "123 Main St" - City: "San Francisco" - State: "CA" - ZIP: "94102" - Click "Next" Step 3 (Preferences): - Select "Email notifications" checkbox - Click "Submit" Extract the confirmation number from the success page. `, }); for await (const event of stream) { console.log(event); } } main(); ``` ```bash cURL theme={null} curl -N -X POST https://agent.tinyfish.ai/v1/automation/run-sse \ -H "X-API-Key: $TINYFISH_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "url": "https://example.com/signup", "goal": "Complete the multi-step signup form: Step 1 - First name John, Last name Doe, Email john@example.com, click Next. Step 2 - Street 123 Main St, City San Francisco, State CA, ZIP 94102, click Next. Step 3 - Select Email notifications checkbox, click Submit. Extract the confirmation number." }' ``` ## Tips * Use stealth mode for login/signup forms * Be explicit about field values in your goal * Describe buttons by their text ("click 'Submit'") * Handle multi-step forms in one goal ## Try It Save any example above as `form.ts` `export TINYFISH_API_KEY="your_api_key" ` `npx tsx form.ts ` ## Related Extract data from pages Complete API docs # Web Scraping Source: https://docs.tinyfish.ai/examples/scraping Extract data from any website using natural language ## Basic Example Extract product data from any page: ```python Python theme={null} from tinyfish import TinyFish, EventType, RunStatus client = TinyFish() with client.agent.stream( url="https://scrapeme.live/shop/Bulbasaur/", goal="Extract the product name, price, and stock status", ) as stream: for event in stream: if event.type == EventType.COMPLETE and event.status == RunStatus.COMPLETED: print("Result:", event.result_json) ``` ```typescript TypeScript theme={null} import { TinyFish, EventType, RunStatus } from "@tiny-fish/sdk"; async function main() { const client = new TinyFish(); const stream = await client.agent.stream({ url: "https://scrapeme.live/shop/Bulbasaur/", goal: "Extract the product name, price, and stock status", }); for await (const event of stream) { if (event.type === EventType.COMPLETE && event.status === RunStatus.COMPLETED) { console.log("Result:", event.result); } } } main(); ``` ```bash cURL theme={null} curl -N -X POST https://agent.tinyfish.ai/v1/automation/run-sse \ -H "X-API-Key: $TINYFISH_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "url": "https://scrapeme.live/shop/Bulbasaur/", "goal": "Extract the product name, price, and stock status" }' ``` ```bash CLI theme={null} tinyfish agent run "Extract the product name, price, and stock status" \ --url scrapeme.live/shop/Bulbasaur/ --pretty ``` **Output:** ```json theme={null} { "name": "Bulbasaur", "price": 63, "inStock": true } ``` ## Extract Multiple Items Get all products from a category page: ```python Python theme={null} from tinyfish import TinyFish client = TinyFish() with client.agent.stream( url="https://scrapeme.live/shop/", goal="Extract all products on this page. For each product return: name, price, and link", ) as stream: for event in stream: print(event) ``` ```typescript TypeScript theme={null} import { TinyFish } from "@tiny-fish/sdk"; async function main() { const client = new TinyFish(); const stream = await client.agent.stream({ url: "https://scrapeme.live/shop/", goal: "Extract all products on this page. For each product return: name, price, and link", }); for await (const event of stream) { console.log(event); } } main(); ``` ```bash cURL theme={null} curl -N -X POST https://agent.tinyfish.ai/v1/automation/run-sse \ -H "X-API-Key: $TINYFISH_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "url": "https://scrapeme.live/shop/", "goal": "Extract all products on this page. For each product return: name, price, and link" }' ``` **Output:** ```json theme={null} { "products": [ { "name": "Bulbasaur", "price": 63, "link": "https://..." }, { "name": "Ivysaur", "price": 87, "link": "https://..." }, { "name": "Venusaur", "price": 105, "link": "https://..." } ] } ``` ## Use Stealth Mode For sites with bot protection: ```python Python theme={null} from tinyfish import TinyFish, BrowserProfile client = TinyFish() with client.agent.stream( url="https://protected-site.com", goal="Extract product data", browser_profile=BrowserProfile.STEALTH, ) as stream: for event in stream: print(event) ``` ```typescript TypeScript theme={null} import { TinyFish, BrowserProfile } from "@tiny-fish/sdk"; async function main() { const client = new TinyFish(); const stream = await client.agent.stream({ url: "https://protected-site.com", goal: "Extract product data", browser_profile: BrowserProfile.STEALTH, }); for await (const event of stream) { console.log(event); } } main(); ``` ```bash cURL theme={null} curl -N -X POST https://agent.tinyfish.ai/v1/automation/run-sse \ -H "X-API-Key: $TINYFISH_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "url": "https://protected-site.com", "goal": "Extract product data", "browser_profile": "stealth" }' ``` ## Use Proxy Route through a specific country: ```python Python theme={null} from tinyfish import TinyFish, BrowserProfile, ProxyConfig, ProxyCountryCode client = TinyFish() with client.agent.stream( url="https://geo-restricted-site.com", goal="Extract data", browser_profile=BrowserProfile.STEALTH, proxy_config=ProxyConfig(enabled=True, country_code=ProxyCountryCode.US), ) as stream: for event in stream: print(event) ``` ```typescript TypeScript theme={null} import { TinyFish, BrowserProfile, ProxyCountryCode } from "@tiny-fish/sdk"; async function main() { const client = new TinyFish(); const stream = await client.agent.stream({ url: "https://geo-restricted-site.com", goal: "Extract data", browser_profile: BrowserProfile.STEALTH, proxy_config: { enabled: true, country_code: ProxyCountryCode.US }, }); for await (const event of stream) { console.log(event); } } main(); ``` ```bash cURL theme={null} curl -N -X POST https://agent.tinyfish.ai/v1/automation/run-sse \ -H "X-API-Key: $TINYFISH_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "url": "https://geo-restricted-site.com", "goal": "Extract data", "browser_profile": "stealth", "proxy_config": { "enabled": true, "country_code": "US" } }' ``` ## Try It Save any example above as `scraper.ts` `export TINYFISH_API_KEY="your_api_key" ` `npx tsx scraper.ts ` ## Related Automate form submissions Complete API docs # Frequently Asked Questions Source: https://docs.tinyfish.ai/faq Common questions about TinyFish Web Agent ## General TinyFish Web Agent is an AI-powered web automation API that lets you automate any website using natural language. Instead of writing brittle selectors, you describe what you want to do, and our AI handles the rest. Any publicly accessible website. For authenticated sites, include login steps in your goal. For sites with bot protection, use stealth mode. Typically 3-10 seconds for simple pages, 30-60 seconds for complex multi-step automations. Time depends on page load speed and task complexity. Yes! Every run provides a `streaming_url` where you can watch the browser execute live for 24 hours after completion. ## API Usage The REST API uses the `X-API-Key` header: ``` X-API-Key: $TINYFISH_API_KEY ``` The MCP endpoint uses OAuth 2.1 for AI assistant integrations. Yes, configure proxy routing: ```typescript theme={null} { proxy_config: { enabled: true, country_code: "US" } } ``` Supported countries: US, GB, CA, DE, FR, JP, AU. ## Technical We use Chromium-based browsers. Choose between: * **Lite**: Standard Chromium (fast) * **Stealth**: Modified Chromium with anti-detection (slower but bypasses bot protection) We fully support SPAs (React, Vue, Angular). Pages are rendered and JavaScript is executed before extraction. Yes! The recommended approach is to connect your password manager (1Password or Bitwarden) in **Settings → Vault**. Select credentials per run and TinyFish handles login securely — the AI agent never sees your actual passwords. See the [Connect Your Vault](/vault-setup) guide for setup and [Vault Credentials](/key-concepts/credentials) for how it works. If you haven't connected a vault, you can still include login steps in your goal: ```typescript theme={null} goal: ` 1. Login with username "user@example.com" and password "pass123" 2. Navigate to dashboard 3. Extract account balance ` ``` Including credentials directly in goals is less secure — they appear in run logs and AI context. Use vault credentials when possible. Yes, describe pagination in your goal: ```typescript theme={null} goal: `Click "Next Page" button 5 times, extracting products from each page` ``` ## Troubleshooting `status: "COMPLETED"` means the **infrastructure succeeded** - the browser launched, navigated, and the automation finished without crashing. **It does NOT mean the goal was achieved.** You must check the `result` field to determine if the goal succeeded. **Scenario 1: Goal achieved** ```json theme={null} { "status": "COMPLETED", "result": { "products": [ { "name": "iPhone 15", "price": "$799" } ] }, "error": null } ``` The `result` contains the extracted data - goal succeeded. **Scenario 2: Infrastructure succeeded, goal failed** ```json theme={null} { "status": "COMPLETED", "result": { "status": "failure", "reason": "Could not find any products on the page", "product_price": null }, "error": null } ``` Status is COMPLETED (browser worked), but `result.status` is "failure" indicating the goal wasn't achieved. **Scenario 3: Infrastructure failed** ```json theme={null} { "status": "FAILED", "result": null, "error": { "message": "Browser crashed during execution" } } ``` The automation couldn't complete due to infrastructure issues. **Best Practice:** Always validate `result` content, not just `status`: ```typescript theme={null} if (run.status === "COMPLETED" && run.result) { // Check if result indicates goal failure if (run.result.status === "failure" || run.result.error) { console.log("Goal not achieved:", run.result.reason || run.result.error); } else { console.log("Data extracted:", run.result); } } else if (run.status === "FAILED") { console.log("Automation failed:", run.error?.message); } ``` Common causes: 1. **Timeout** - Site is slow or down * Solution: Retry or use stealth mode 2. **Access Denied** - Anti-bot protection * Solution: Use stealth mode + proxy 3. **Element Not Found** - Goal is too specific * Solution: Make goal more flexible (describe visually) 4. **Invalid URL** - URL is malformed * Solution: Ensure URL includes `https://` ```typescript theme={null} // Use stealth mode browser_profile: "stealth" // Add proxy proxy_config: { enabled: true, country_code: "US" } // Reduce speed (add delays in goal) goal: "Wait 3 seconds, then click button" ``` ## Best Practices **Good** (specific, actionable): ```typescript theme={null} goal: "Extract product name, price, and stock status from the product details section" ``` **Bad** (vague): ```typescript theme={null} goal: "Get data" ``` Use stealth when: * Site shows CAPTCHA * Getting "Access Denied" errors * Site uses Cloudflare or anti-bot protection Otherwise use lite mode (faster). ## Getting Help [support@tinyfish.io](mailto:support@tinyfish.io) Join our community ## Related Get started in 5 minutes Complete endpoint docs # Fetch API Source: https://docs.tinyfish.ai/fetch-api/index Render any URL and extract clean text — no external APIs required The TinyFish Fetch API renders web pages using a real browser (including JavaScript-heavy sites) and returns clean extracted text in your preferred format. Submit a URL, get back structured content. ```bash theme={null} POST https://api.fetch.tinyfish.ai ``` `api.fetch.tinyfish.ai` is the public Fetch API endpoint. ## Before You Start Visit [agent.tinyfish.ai/api-keys](https://agent.tinyfish.ai/api-keys) and create a key. Store it in your environment: ```bash theme={null} export TINYFISH_API_KEY="your_api_key_here" ``` All requests require the `X-API-Key` header. See [Authentication](/authentication) for the full setup and troubleshooting guide. ## Your First Request ```python Python theme={null} import httpx response = httpx.post( "https://api.fetch.tinyfish.ai", headers={"X-API-Key": "your_api_key_here"}, json={"urls": ["https://www.tinyfish.ai/"]}, timeout=120, ) data = response.json() print(data["results"][0]["title"]) print(data["results"][0]["text"]) ``` ```typescript TypeScript theme={null} const response = await fetch("https://api.fetch.tinyfish.ai", { method: "POST", headers: { "X-API-Key": "your_api_key_here", "Content-Type": "application/json", }, body: JSON.stringify({ urls: ["https://www.tinyfish.ai/"], }), }); const data = await response.json(); console.log(data.results[0].title); console.log(data.results[0].text); ``` ```bash cURL theme={null} curl -X POST https://api.fetch.tinyfish.ai \ -H "X-API-Key: $TINYFISH_API_KEY" \ -H "Content-Type: application/json" \ -d '{"urls": ["https://www.tinyfish.ai/"]}' ``` ## What Success Looks Like ```json theme={null} { "results": [ { "url": "https://www.tinyfish.ai/", "final_url": "https://www.tinyfish.ai/", "title": "TinyFish | Enterprise Web Agent Infrastructure", "description": "TinyFish provides enterprise infrastructure for AI web agents.", "language": "en", "text": "# TinyFish | Enterprise Web Agent Infrastructure\n\nTinyFish provides enterprise infrastructure for AI web agents...\n" } ], "errors": [] } ``` ## When to Use Fetch vs the Other APIs * Use **Fetch** when you already know the URL and need clean extracted page content. * Use **Search** when you need help finding the right URLs first. * Use **Agent** when TinyFish should perform a multi-step workflow on the site. * Use **Browser** when you need direct browser control from your own code. *** ## Fetching Multiple URLs Submit up to 10 URLs in a single request. Each URL is processed independently — one failure doesn't affect the others. ```python Python theme={null} response = httpx.post( "https://api.fetch.tinyfish.ai", headers={"X-API-Key": "your_api_key_here"}, json={ "urls": [ "https://www.tinyfish.ai/", "https://en.wikipedia.org/wiki/Web_scraping", "https://docs.python.org/3/tutorial/index.html", ] }, ) data = response.json() for result in data["results"]: print(result["url"], "→", result["title"]) for error in data["errors"]: print("Failed:", error["url"], "–", error["error"]) ``` ```typescript TypeScript theme={null} const response = await fetch("https://api.fetch.tinyfish.ai", { method: "POST", headers: { "X-API-Key": "your_api_key_here", "Content-Type": "application/json", }, body: JSON.stringify({ urls: [ "https://www.tinyfish.ai/", "https://en.wikipedia.org/wiki/Web_scraping", "https://docs.python.org/3/tutorial/index.html", ], }), }); const data = await response.json(); data.results.forEach((r) => console.log(r.url, "→", r.title)); data.errors.forEach((e) => console.log("Failed:", e.url, "–", e.error)); ``` ```bash cURL theme={null} curl -X POST https://api.fetch.tinyfish.ai \ -H "X-API-Key: $TINYFISH_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "urls": [ "https://www.tinyfish.ai/", "https://en.wikipedia.org/wiki/Web_scraping", "https://docs.python.org/3/tutorial/index.html" ] }' ``` Per-URL failures (timeouts, DNS errors, anti-bot blocks) appear in `errors[]` alongside a `200` response — they do not cause the entire request to fail. *** ## Output Formats Control the format of the `text` field with the `format` parameter. When omitted, the default is `markdown`. Semantic HTML. ```json theme={null} { "format": "html", "text": "

Async Fn in Traits Are Now Available

Starting with Rust 1.75, you can use async fn directly inside traits.

What Changed

Works in all stable traits
No heap allocation for simple cases