1
0
mirror of https://github.com/pirate/ArchiveBox.git synced 2025-08-26 15:54:36 +02:00

Updated Web Archiving Community (markdown)

Nick Sweeting
2019-05-08 17:05:43 -04:00
parent d3bc5c24a2
commit 753d0407a3

@@ -85,7 +85,6 @@ Indexes of archiving institutions and software maintained by other people. If t
- **[Archive.it](https://archive-it.org) commercial Wayback-Machine solution**
- **[Heretrix](https://github.com/internetarchive/heritrix3) The king of internet archiving crawlers, powers the Wayback Machine**
- **[Brozzler](https://github.com/internetarchive/brozzler) chrome headless crawler + WARC archiver maintained by Archive.org**
- [OpenWayback](https://github.com/iipc/openwayback/wiki) Toolkit of major open-source wayback-machine components
- [WarcProx](https://github.com/internetarchive/warcprox) warc proxy recording and playback utility
- [WarcTools](https://github.com/internetarchive/warctools) utilities for dealing with WARCs
- [Grab-Site](https://github.com/ArchiveTeam/grab-site) An easy preconfigured web crawler designed for backing up websites
@@ -137,7 +136,6 @@ Indexes of archiving institutions and software maintained by other people. If t
- **[OpenWayback](https://github.com/iipc/openwayback/wiki) Open source project developing core Wayback-Machine components**
- **[awesome-web-archiving](https://github.com/iipc/awesome-web-archiving) Large list of archiving projects and orgs**
- [OpenWayback](https://github.com/iipc/openwayback/wiki) Toolkit of major open-source wayback-machine components
- [JWARC](https://github.com/iipc/jwarc) A Java library for reading and writing WARC files.
- [More on their Github...](https://github.com/iipc)