Independent Firecrawl reference

Plan Firecrawl workflows
for AI web data.

Choose the right Firecrawl endpoint.Compare cloud and self-host paths.Check source links and license tradeoffs.

Search Scrape Crawl Map Agent-ready
Unofficial guide Upstream: firecrawl/firecrawl License: AGPL-3.0
Web pages flowing through a crawler and extraction pipeline into markdown cards, JSON tables, and an agent console.

Web data workflow planner

Turn a research or extraction task into an endpoint plan.

{
  "status": "ready_for_input",
  "suggested_path": ["agent", "search", "scrape"],
  "source_of_truth": "Use official Firecrawl docs and GitHub as the canonical reference."
}

Product thesis

Firecrawl Space is a decision layer, not a shadow API.

The site helps developers choose Firecrawl workflows, compare hosted and self-hosted tradeoffs, and cite upstream facts without implying official affiliation.

1

Match endpoint to job

Search for discovery, scrape for one URL, crawl for site jobs, map for URL discovery, batch scrape for known lists, and agent for broader research.

2

Keep evidence attached

Source URLs, timestamps, output format, and limitations stay visible so AI answers remain auditable.

3

Separate cloud from self-host

Hosted use should go to official Firecrawl. Self-host plans need Docker, Redis, browser service, security, and AGPL review.

4

Reduce brand risk

Every core page states this is an independent, unofficial guide and links back to the upstream source of truth.

Endpoint map

Primary Firecrawl workflows this site explains.

Search

Find sources and return page content for research flows that start with a query.

Scrape

Convert one URL into Markdown, HTML, screenshots, or schema-shaped data.

Crawl

Run an asynchronous site crawl when the target is a whole domain or directory.

Map

Discover site URLs before deciding what to scrape or crawl.

Batch scrape

Process a known URL list with async job handling and output checks.

Agent and interact

Use agent tasks or page actions when a job needs research, navigation, or UI interaction.

Source-of-truth block

What this site can safely claim.

Firecrawl is an open-source TypeScript project for web search, scraping, crawling, mapping, interaction, and agent-oriented extraction. The upstream repository is AGPL-3.0, with official hosted service and docs maintained by Firecrawl.

Official site
firecrawl.dev
Official docs
docs.firecrawl.dev
Relationship
Independent, unofficial reference and planning site.