Skip to content

Pull requests: yasserg/crawler4j

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Began work on an asynchronous crawling
#157 opened Sep 8, 2016 by lostmsu
maxPagesToFetch bug
#155 opened Aug 9, 2016 by cmacdonald
Better management of proxies
#80 opened Jul 9, 2015 by Bouki Loading…
allow parsing script tag and other html tags
#114 opened Feb 10, 2016 by code-911 Loading…
Canonical URL meta tag handling and AJAX crawling
#82 opened Jul 15, 2015 by EgbertW Loading…
Improved delay handling
#57 opened May 20, 2015 by EgbertW Loading…
Allow to differentiate between queue sizes in Frontier
#60 opened May 20, 2015 by EgbertW Loading…
Feature: seed tracking
#63 opened May 20, 2015 by EgbertW Loading…
Allow negative priorities
#61 opened May 20, 2015 by EgbertW Loading…
added custom html content filter
#168 opened Nov 7, 2016 by pdesmet Loading…
Add tests for util
#411 opened Aug 5, 2019 by romainbrenguier Loading…
Base clases provide more protected methods for subclasses
#432 opened Jan 25, 2020 by dgoiko Loading…
Timeoutable regular expressions in RobotstxtServer
#429 opened Jan 24, 2020 by dgoiko Loading…
Generic crawl controller
#434 opened Jan 25, 2020 by dgoiko Loading…
Granularity in exception
#428 opened Jan 24, 2020 by dgoiko Loading…
Cloneable CrawlConfig
#431 opened Jan 25, 2020 by dgoiko Loading…
Feature/spring boot example
#382 opened Dec 18, 2018 by s17t Loading…
ProTip! Filter pull requests by the default branch with base:master.