Transform any website
into markdown
Turn any website into structured JSON or Markdown. The API for developers looking to agents, RAG pipelines, and data products.
Built for the AI era.
Turn any website and documents into clean data so you can focus on the intelligence.
RAG Datasets
Turn website into context for your RAG
- Clean-ready formatted
- Semantic sitemap
- Easy-ready formatted
- Best-in-class
LLM Training
Train your models with live-scraped datasets for fine-tuning models.
- Custom format support
- Large-batch requests
- Custom filtering
- Data scheduling
Knowledge Bases
Turn all your documents into a unified knowledge
- Text-data
- Question support
- Content linking
- Semantic search
Market Intelligence
Monitor competitors, news, and market data.
- Real-time tracking
- Content scraping
- Price monitoring
- Data-ready pipelines
Benchmarks
Gather raw world data to evaluate code performance.
- Edge-case discovery
- Quality checks
- Real-world data
- Result comparisons
Documentation
Convert structured docs into reproducible formats for AI.
- Markdown export
- Clean formatting
- Link preservation
- Version tracking
Everything you need to scale.
Reliability, security, and SDKs. We have the infrastructure so you can focus on the intelligence.
Smart Crawling & Navigation
Intelligently navigate websites, handle dynamic content like infinite scrolls, and even interact with pages that require interaction.
await crawler.navigate({
url: "https://example.com",
waitFor: "networkidle",
interactions: ["click", "scroll"]
});Headless Browser
Get the latest data easily with our auto-managed browsers that handle all the heavy work behind the scenes.
const browser = await launcher.launch({
headless: true,
stealth: true
});Clean Markdown
Transform any HTML into LLM-ready Markdown. Works with tables, code blocks, and maintain all formatting.
const markdown = await converter.toMarkdown({
html: rawHtml,
preserveFormatting: true
});Real-time Data
Live stream or every chunk of data within file and get real-time stats on processing progress.
stream.on('data', (chunk) => {
console.log('Progress:', chunk.progress);
process(chunk.data);
});Type-Safe Structured Data
Extract structured JSON with proper validation, zod style, with smart and auto-correction.
const schema = z.object({
title: z.string(),
price: z.number(),
inStock: z.boolean()
});Integrations
Webhooks are delivered from across the world so you can focus on the intelligence.
webhook.on('data.ready', async (payload) => {
await processData(payload);
return { status: 'success' };
});Simple, transparent pricing
Start free then scale as you grow. Cancel anytime.
Start Free Trial
Try our service with the free credits. No commitment required.
- ✓ No credit card required
- ✓ 3 business days for business
Starter
Perfect for small projects and individual developers.
- 5,000 credits per month
- 20 requests per requests
- HTML & Markdown formats
- 15 max crawls per minute
- API access
- Web scraping
Growth
Most teams choose our unlimited credit plan.
- 50,000 credits per month
- 20 requests per requests
- HTML & Markdown + scratchpad
- API-to clipboard command
- 15 max crawls per minute
- Priority support
Pro
Scale to growth businesses and service.
- 90,000 credits per month
- 50 requests per requests
- HTML & Markdown + scratchpad
- API-to clipboard format
- 30 max crawls per minute
- Dedicated support
- Priority access
- Custom rate limit analysis
Ready to transform web data into actionable intelligence?