1
0
mirror of https://github.com/pirate/ArchiveBox.git synced 2025-08-14 18:44:24 +02:00

Update README.md

This commit is contained in:
Nick Sweeting
2024-01-28 00:26:07 -08:00
committed by GitHub
parent b2d1083453
commit dbcbdc7691

View File

@@ -33,7 +33,7 @@ Without active preservation effort, everything on the internet eventually dissap
<img src="https://github.com/ArchiveBox/ArchiveBox/assets/511499/90f1ce3c-75bb-401d-88ed-6297694b76ae" alt="snapshot detail page" align="right" width="190px" style="float: right"/>
💾 **It saves snapshots of the URLs you feed it in several redundant formats.**
**It saves snapshots of the URLs you feed it in several redundant formats.**
It also detects any content featured *inside* each webpage & extracts it out into a folder:
- 🌐 **HTML**/**Any websites** ➡️ `original HTML+CSS+JS`, `singlefile HTML`, `screenshot PNG`, `PDF`, `WARC`, ...
- 🎥 **Social Media**/**News** ➡️ `post content TXT`, `comments`, `title`, `author`, `images`