Independent Firecrawl reference

Plan Firecrawl workflows
for AI web data.

Choose the right Firecrawl endpoint.Compare cloud and self-host paths.Check source links and license tradeoffs.

Open planner Self-host checklist Official docs

Search Scrape Crawl Map Agent-ready

Unofficial guide Upstream: firecrawl/firecrawl License: AGPL-3.0

Web pages flowing through a crawler and extraction pipeline into markdown cards, JSON tables, and an agent console.

Web data workflow planner

Turn a research or extraction task into an endpoint plan.

Goal

Scale Output Deployment Compliance posture

{
  "status": "ready_for_input",
  "suggested_path": ["agent", "search", "scrape"],
  "source_of_truth": "Use official Firecrawl docs and GitHub as the canonical reference."
}

Product thesis

Firecrawl Space is a decision layer, not a shadow API.

The site helps developers choose Firecrawl workflows, compare hosted and self-hosted tradeoffs, and cite upstream facts without implying official affiliation.

Match endpoint to job

Search for discovery, scrape for one URL, crawl for site jobs, map for URL discovery, batch scrape for known lists, and agent for broader research.

Keep evidence attached

Source URLs, timestamps, output format, and limitations stay visible so AI answers remain auditable.

Separate cloud from self-host

Hosted use should go to official Firecrawl. Self-host plans need Docker, Redis, browser service, security, and AGPL review.

Reduce brand risk

Every core page states this is an independent, unofficial guide and links back to the upstream source of truth.

Endpoint map

Primary Firecrawl workflows this site explains.

Search

Find sources and return page content for research flows that start with a query.

Scrape

Convert one URL into Markdown, HTML, screenshots, or schema-shaped data.

Crawl

Run an asynchronous site crawl when the target is a whole domain or directory.

Map

Discover site URLs before deciding what to scrape or crawl.

Batch scrape

Process a known URL list with async job handling and output checks.

Agent and interact

Use agent tasks or page actions when a job needs research, navigation, or UI interaction.

Firecrawl API guide

Endpoint selection, output modes, code snippets, error handling, and production guardrails.

Self-host Firecrawl

Docker setup, environment names, Redis and browser service notes, security controls, and AGPL obligations.

Firecrawl alternatives

Compare Firecrawl against crawler libraries, scraping APIs, browser automation, and AI search tools by decision criteria.

Source-of-truth block

What this site can safely claim.

Firecrawl is an open-source TypeScript project for web search, scraping, crawling, mapping, interaction, and agent-oriented extraction. The upstream repository is AGPL-3.0, with official hosted service and docs maintained by Firecrawl.

Upstream repo: github.com/firecrawl/firecrawl
Official site: firecrawl.dev
Official docs: docs.firecrawl.dev
Relationship: Independent, unofficial reference and planning site.

Plan Firecrawl workflowsfor AI web data.