✨ Turn any website into clean, contextualized data pipelines for your AI applications ✨
Maxun is the easiest way to extract web data with no code. The modern open-source alternative to BrowseAI, Octoparse and similar tools.
Go To App •
Documentation •
Website •
Discord •
Watch Tutorials
Maxun is a powerful no-code ecosystem for web data extraction. With its intuitive no-code interface, anyone can extract data from any website — no coding required. In just minutes, users can build automation robots to turn websites into structured APIs, LLM-ready markdown, spreadsheets, extract data at scale, and much more.
Maxun uses web robots to power everything you can do on the platform. There are two types of robots, each designed for a different job.
Extract robots emulate real user behavior and capture structured data at scale.
- Built for automation and structured data extraction
- Point-and-click interface - no coding required
- Extract from any website, including behind logins
- Record user actions (clicks, scrolls, form fills, pagination, etc.)
- Convert sites into APIs, spreadsheets, and workflows
- Scale extractions and run on schedules or via API
- Handle infinite scrolling and pagination
- Auto-adapt to website layout & structural changes
Maxun_Airbnb.mp4
Built for clean content and AI workflows.
- Get clean HTML and LLM-ready Markdown from any website
- Remove scripts, styling, ads, and clutter automatically
- Perfect for RAG systems, AI summarization, embeddings, and content pipelines
- Extract main content while filtering out navigation and irrelevant elements
- Ideal for feeding clean data to large language models
GitHub.Trending.Maxun.mp4
The simplest & fastest way to get started is to use the hosted version: https://app.maxun.dev. You can self-host if you prefer!
Maxun can run locally with or without Docker:
- Self Host Maxun With Docker & Portainer
- Upgrade Maxun With Docker Compose Setup
- Upgrade Maxun Without Docker Compose Setup
- ✨ Open webpages and navigate sites automatically
- ✨ Log in to secured websites and maintain sessions
- ✨ Click on buttons, links, and interactive elements
- ✨ Fill out forms with custom data
- ✨ Select from dropdowns, radios, checkboxes, dates, times, etc.
- ✨ Take screenshots - fullpage or visible sections
- ✨ Capture structured data without writing code
- ✨ Handle infinite scrolling and pagination automatically
- ✨ Run on schedules - set it and forget it
- ✨ Trigger via APIs for third-party integrations
- ✨ Extract behind login walls and authentication
- ✨ Integrate with applications like N8N, Google Sheets, Airtable, and more
- ✨ Send data to webhooks for real-time processing
- ✨ Get clean HTML from websites for AI applications
- ✨ Turn websites into LLM-ready markdown for AI pipelines
- ✨ Talk to your LLM with MCP (Model Context Protocol)
![]() LambdaTest GenAI-powered Quality Engineering Platform that empowers teams to test intelligently, smarter, and ship faster. |
CyberYozh App Infrastructure for developers working with multi‑accounting & automation in one place. |
- ✨ Extract Data With No-Code - Point and click interface
- ✨ Two Robot Types - Extract for structured data, Scrape for clean content
- ✨ Handle Pagination & Scrolling - Automatic navigation
- ✨ Run Robots On Schedules - Set it and forget it
- ✨ Turn Websites to APIs - RESTful endpoints from any site
- ✨ Turn Websites to Spreadsheets - Direct data export
- ✨ Adapt To Website Layout Changes - Auto-recovery from site updates
- ✨ Extract Behind Login - Handle authentication seamlessly
- ✨ Integrations - Connect with your favorite tools
- ✨ MCP Support - Model Context Protocol integration
- ✨ LLM-Ready Data - Clean Markdown for AI applications
- ✨ Self-Hostable - Full control over your infrastructure
- ✨ Open Source - Transparent and community-driven
Start extracting web data in minutes, not days. No code required.
Maxun can be used for various use-cases, including lead generation, market research, content aggregation and more. View use-cases in detail here: https://www.maxun.dev/#usecases
This project is in early stages of development. Your feedback is very important for us - we're actively working on improvements.
This project is licensed under AGPLv3.
Star the repository, contribute if you love what we’re building, or sponsor us.
Thank you to the combined efforts of everyone who contributes!









