GitHub - JWriter20/CaptchaKraken-cli: Does the main solving work for CaptchaKraken, designed as a cli to be language agnostic, as I want to make both a python and ts solver

CaptchaKraken CLI

AI-powered, fully local captcha-solving CLI that uses attention-based vision models to extract precise click coordinates for common web captchas.

Description

CaptchaKraken takes a screenshot of a captcha challenge, classifies the captcha type, highlights and numbers all interactable regions, and then plans the sequence of clicks needed to solve it.
It is designed to be:

CLI-first: run end‑to‑end solves from the command line.
Model-agnostic: pluggable attention models for coordinate extraction.
Debuggable: optional overlays and debug images to inspect detection and planning.

High-level flow:

Classify the captcha (checkbox vs image grid vs text prompt, etc.).
Detect and number all interactable elements in the captcha (checkboxes, tiles, buttons).
Plan actions using the point and detect tools to generate click coordinates.
Output the sequence of actions (clicks) that can be replayed in a browser automation stack.

Captcha support status

Checkbox captchas – end‑to‑end solving working.
Image selection / image grid captchas – end‑to‑end solving working.
Text captchas – basic plumbing present, solving still in progress.

Additional captcha types and more robust classification/solving strategies are under active development.

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
captchaimages		captchaimages
scripts		scripts
src		src
tests		tests
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
.sparse-checkout-example		.sparse-checkout-example
LICENSE		LICENSE
PLANNER_USAGE.md		PLANNER_USAGE.md
README.md		README.md
cli.py		cli.py
generate_visualization.py		generate_visualization.py
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CaptchaKraken CLI

Description

Captcha support status

About

Uh oh!

Releases

Packages

Languages

License

JWriter20/CaptchaKraken-cli

Folders and files

Latest commit

History

Repository files navigation

CaptchaKraken CLI

Description

Captcha support status

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages