add deduplication index to a repeated crawl with same job config #696

Open

Assignees

Milestone

Crawl Deduplication

opened

on Mar 12, 2023

We need deduplication to save storage in repeated crawls of the same job based on a dynamic created index of the previous crawl

Metadata

Assignees

ikreymer

Labels

No labels

No labels

Type

No type

Projects

Webrecorder Projects

Status

Todo

Milestone

Crawl Deduplication

Relationships

None yet

Development

No branches or pull requests