1
0
mirror of https://github.com/pirate/ArchiveBox.git synced 2025-08-14 02:24:06 +02:00

Updated Configuration (markdown)

Nick Sweeting
2021-08-11 10:53:16 -04:00
parent db2a6eaaae
commit a3386c91c1

@@ -101,7 +101,7 @@ Some text to display in the footer of the archive index. Useful for providing s
---
#### `URL_BLACKLIST`
**Possible Values:** [`\.(css|js|otf|ttf|woff|woff2|gstatic\.com|googleapis\.com/css)(\?.*)?$`]/`.+\.exe$`/`http(s)?:\/\/(.+)?example.com\/.*'`/...
**Possible Values:** [`\.(css|js|otf|ttf|woff|woff2|gstatic\.com|googleapis\.com/css)(\?.*)?$`]/`.+\.exe$`/`http(s)?:\/\/(.+)?example.com\/.*`/...
A regex expression used to exclude certain URLs from archiving. You can use if there are certain domains, extensions, or other URL patterns that you want to ignore whenever they get imported. Blacklisted URLs wont be included in the index, and their page content wont be archived.