Skip to content

BNNagaraj/blazecrawl

Repository files navigation

BlazeCrawl

The Web Scraping API That Doesn't Rip You Off.

Turn any website into LLM-ready markdown. Pay-per-use. Credits never expire. Zero data retention by default.

Why BlazeCrawl?

Feature BlazeCrawl Firecrawl
Pay-per-use pricing Yes No β€” monthly tiers only
Credits roll over Yes, forever No β€” expire monthly
Zero data retention Default for all Enterprise only
Anti-bot stealth mode Included free Premium only
SDKs Python, Node, Go, Rust, C#, PHP Python, Node, Go, Rust only
Rate limit on free tier 10 concurrent 2 concurrent

API Endpoints

  • POST /api/v1/scrape β€” Scrape a single URL into markdown, HTML, or text
  • POST /api/v1/crawl β€” Crawl an entire website with depth/page limits
  • GET /api/v1/crawl/:id β€” Check crawl job status
  • POST /api/v1/map β€” Discover all URLs on a site (free, no credits consumed)
  • POST /api/v1/extract β€” AI-powered structured data extraction using Claude

Quick Start

curl -X POST https://blazecrawl-dev.web.app/api/v1/scrape \
  -H "Authorization: Bearer bc_live_xxx" \
  -H "Content-Type: application/json" \
  -d '{"url": "https://example.com", "format": "markdown"}'

Self-Hosting

See SELF_HOST.md for instructions on running BlazeCrawl on your own infrastructure.

# Quick start with Docker
cp .env.example .env
# Edit .env with your Firebase and Anthropic API keys
docker compose up

Tech Stack

  • Frontend: Next.js 16 (App Router), Tailwind CSS
  • Backend: Firebase Cloud Functions (Node.js 22), Express
  • Database: Cloud Firestore
  • Auth: Firebase Auth (Email/Password + Google OAuth)
  • AI: Anthropic Claude (for Extract API)
  • Scraping: JSDOM, Readability, Turndown

Project Structure

blazecrawl/
β”œβ”€β”€ src/
β”‚   β”œβ”€β”€ app/
β”‚   β”‚   β”œβ”€β”€ page.tsx              # Landing page
β”‚   β”‚   β”œβ”€β”€ login/page.tsx        # Auth page
β”‚   β”‚   β”œβ”€β”€ dashboard/            # Dashboard (API keys, usage, playground)
β”‚   β”‚   β”œβ”€β”€ docs/page.tsx         # API documentation
β”‚   β”‚   └── api/v1/              # Next.js API routes (local dev)
β”‚   └── lib/                     # Firebase config, auth, scraper utilities
β”œβ”€β”€ functions/
β”‚   └── index.ts                 # Cloud Functions API (production)
β”œβ”€β”€ firebase.json                # Firebase hosting + functions config
β”œβ”€β”€ firestore.rules              # Security rules
└── .env.example                 # Environment variable template

Environment Variables

Copy .env.example and fill in your values:

cp .env.example .env.local

Required variables:

  • NEXT_PUBLIC_FIREBASE_* β€” Firebase client SDK config
  • ANTHROPIC_API_KEY β€” For the Extract API endpoint

Pricing

Plan Price Pages/month Concurrent
Free $0 1,000 10
Pro $29/mo 50,000 100
Scale $99/mo Unlimited 500

All tiers: credits never expire, zero data retention, full API access.

Contributing

We welcome contributions! Please open an issue or submit a pull request.

License

AGPL-3.0 β€” See LICENSE for details.

About

πŸ”₯ The Web Scraping API That Doesn't Rip You Off β€” Turn any website into LLM-ready markdown. 10x cheaper than Firecrawl.

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages