internetarchive/heritrix3: Heritrix is the Internet Archive’s open-source, extensible, web-scale, archival-quality web crawler project.
▻https://github.com/internetarchive/heritrix3
Heritrix is the Internet Archive’s open-source, extensible, web-scale, archival-quality web crawler project.
– Le wiki de documentation: ▻https://github.com/internetarchive/heritrix3/wiki
– téléchargement: ▻http://builds.archive.org/maven2/org/archive/heritrix/heritrix