Name: Crawlstack
Author: Crawlstack

Question 1

What is Crawlstack?

Accepted Answer

Crawlstack is enterprise browser infrastructure for AI agents and developers. It runs a real Chrome browser — as an extension or in a Docker container — giving your agents and scripts programmatic access to authenticated, dynamic, and bot-protected sites. It includes a task scheduler, JavaScript runner, SQLite database, MCP integration, and a relay for remote control. No proxies, no cloud dependency, no separate infrastructure to manage.

Question 2

How is Crawlstack different from Scrapy, Playwright, or Puppeteer?

Accepted Answer

Tools like Playwright and Puppeteer run headless browsers on remote servers and need proxy networks to avoid detection. Crawlstack runs inside your real browser, using your actual session — so there's nothing to detect. It also replaces your entire scraping stack: storage, scheduling, and deduplication are built in. No assembly required.

Question 3

Can Crawlstack bypass Cloudflare Turnstile and other bot protection?

Accepted Answer

Yes. Because Crawlstack runs inside a real, lived-in browser with your existing cookies and browsing history, it passes the environmental checks that trip up headless scrapers automatically. For Cloudflare Turnstile specifically, Crawlstack uses the Chrome DevTools Protocol (CDP) to dispatch hardware-level mouse events that the browser reports as isTrusted: true — the same signal a physical click generates.

Question 4

Do I need to know how to code?

Accepted Answer

You need basic JavaScript — specifically, enough to select elements with document.querySelector. If you've ever pasted a snippet into the Chrome DevTools console, you already have the skills. Crawlstack scripts use standard DOM APIs plus a simple runner object for storing data and queuing tasks. No new framework to learn.

Question 5

Can I run Crawlstack in production at scale?

Accepted Answer

Yes. Crawlstack runs identically as a headless Docker container for always-on production automation. For multi-node scale, connect multiple instances to a shared libSQL or Turso database — they'll coordinate via a shared task queue, turning your instances into a distributed scraping cluster. Start local, scale when you need to.

Question 6

What database does Crawlstack use, and how do I get my data out?

Accepted Answer

Crawlstack uses SQLite via WebAssembly (persisted with OPFS), capable of ingesting up to 10,000 records per second and handling millions of rows without browser slowdown. You can export data as CSV, JSON, or raw SQLite at any time. For live pipelines, you can configure real-time webhooks, sync to a Turso cloud instance, or query via the Public API relay.

Question 7

Is Crawlstack free? How does the pricing work?

Accepted Answer

Crawlstack is free and open source. You run it on your own hardware — a laptop, a cheap VPS, or a Docker container — and pay nothing in platform fees. The only costs are whatever hosting you choose to use. There's no per-request billing, no proxy markups, and no data egress charges. A managed Crawlstack Cloud option (hosted browser nodes, zero infrastructure) is coming soon — join the waitlist on the homepage.

Question 8

Why did you build Crawlstack?

Accepted Answer

AI agents need the web. But the web is increasingly hostile to automated access — bot detection, dynamic rendering, authentication walls. We built Crawlstack to give AI agents and developers a first-class browser runtime they can control programmatically: MCP-native, self-hosted, and portable from laptop to Docker cluster without changing a line of code.

Scrape the toughest websites.Without limits, on your own infrastructure.

Install in one command

Write plain JavaScript

Up to 10,000 items per second

Turnstile solver built in

Flight recorder for every run

Move like a human

Webhooks push data in real time

MCP native from day one

Dashboard from anywhere

Distributed runs across nodes

Extract with LLMs

Semantic search for your RAG

Ready-to-use Templates

Amundi

Example.com

Mac-Point.cz

Sitemap example

StartupJobs.cz

Frequently Asked Questions

Ready to get started?