CHANGELOG

Unreleased

v1.36.0

feat(sync): subscribe to page updates to perform async handling of data

v1.31.0

feat(js): add init of script parsing

v1.30.5

"feat(worker): add tls support"

v1.30.3

chore(request): add custom domain redirect policy

v1.30.2

chore(glob): fix glob crawl establish

v1.30.1

chore(crawl): fix crawl asset detection and trailing start

v1.29.0

feat(fs): add temp storage resource handling (#112)
feat(url-glob): URL globbing (#113) thanks to @roniemartinez)

v1.28.5

chore(request): fix resource success handling

v1.28.0

feat(proxies): add proxy support

v1.27.2

feat(decentralization): add workload split

v1.19.36

perf(crawl): add join handle task management

v1.19.26

perf(links): add fast pre serialized url anchor link extracting and reduced memory usage
perf(links): fix case sensitivity handling
perf(crawl): reduce memory usage on link gathering
chore(crawl): remove Website.reset method and improve crawl handling resource usage ( reset not needed now )
chore(crawl): add heap usage of links visited
perf(crawl): massive scans capability to utilize more cpu
feat(timeout): add optional configuration.request_timeout duration
build(tokio): remove unused net feature
chore(docs): add missing scrape section

v1.10.7

perf(req): enable brotli
chore(tls): add ALPN tls defaults
chore(statics): add initial static media ignore
chore(robots): add shared client handling across parsers
feat(crawl): add subdomain and tld crawling

v1.6.1

perf(links): filter dup links after async batch
chore(delay): fix crawl delay thread groups
perf(page): slim channel page sending required props

v1.5.3

feat(regex): add optional regex black listing

v1.5.0

chore(bin): fix bin executable #17
feat(cli): add cli separation binary #17
feat(robots): add robots crawl delay respect and ua assign #24
feat(async): add async page body gathering
perf(latency): add connection re-use across request #25

v1.4.0

feat(cli): add cli ability (#16 thanks to @j-mendez)
feat(concurrency): dynamic concurrent cpu defaults (#15 thanks to @j-mendez)
docs: add a changelog

v1.3.1

fix(crawl): fix field type (#14 thanks to @j-mendez)

v1.3.0

feat(crawl): callback to run when link is found (#13 thanks to @j-mendez)

v1.2.0

Add User Agent configuration (#5 thanks to @Dragnucs)
Add polite delay (#6 thanks to @Dragnucs )

v1.1.3

Handle page get errors (#4 thanks to @Dragnucs)
Fix link resolution (#3 thanks to @Dragnucs)