Skip to content

Scan a page using CSS with JSON output. Ruby + Sinatra app for Heroku.

License

Notifications You must be signed in to change notification settings

runway7/scanner

Repository files navigation

#Scanner, by Runway7

Scanner is a small and fast Ruby + Sinatra app (easily runs on Heroku) that can quickly scans a webpage and gives back elements based on the given CSS paths. Results are returned in JSON.

The API takes two parameters:

url: The webpage you want to scan, with the protocol (http / https). URL encoded.

path: Comma separated list of CSS3 paths you want to extract (URL encode each path before joining them with commas).

Examples:

http://scanner.runway7.net/?url=http://www.google.com&path=title,meta[name=%22description%22]

http://scanner.runway7.net/?url=http://news.ycombinator.com&path=td.title%20a

http://scanner.runway7.net/?url=http://www.reddit.com&path=.sitetable%20.thing%20.entry%20a.title

If you intend to use Scanner for production or any high-volume work, please deploy your own instance. Heroku works great.

Crafted in India by Sudhir Jonathan.

About

Scan a page using CSS with JSON output. Ruby + Sinatra app for Heroku.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages