mirror of
https://github.com/pirate/ArchiveBox.git
synced 2025-08-22 22:25:24 +02:00
Updated Roadmap (markdown)
12
Roadmap.md
12
Roadmap.md
@@ -54,9 +54,9 @@
|
|||||||
---
|
---
|
||||||
|
|
||||||
### Major long-term changes
|
### Major long-term changes
|
||||||
- release **`pip`, `apt`, `pkg`, and `brew` packaged distributions** for installing ArchiveBox
|
- ✅ release **`pip`, `apt`, `pkg`, and `brew` packaged distributions** for installing ArchiveBox
|
||||||
- add an **optional web GUI** for managing sources, adding new links, and viewing the archive
|
- ✅ add an **optional web GUI** for managing sources, adding new links, and viewing the archive
|
||||||
- switch to django + **sqlite db with migrations system** & json/html export for managing archive schema changes and persistence
|
- ✅ switch to django + **sqlite db with migrations system** & json/html export for managing archive schema changes and persistence
|
||||||
- modularize internals to allow importing individual components
|
- modularize internals to allow importing individual components
|
||||||
- switch to sha256 of URL as unique link ID
|
- switch to sha256 of URL as unique link ID
|
||||||
- support **storing multiple snapshots** of pages over time
|
- support **storing multiple snapshots** of pages over time
|
||||||
@@ -66,12 +66,12 @@
|
|||||||
|
|
||||||
### Smaller planned features
|
### Smaller planned features
|
||||||
- support pushing pages to multiple 3rd party services using ArchiveNow instead of just archive.org
|
- support pushing pages to multiple 3rd party services using ArchiveNow instead of just archive.org
|
||||||
- body text extraction to markdown (using [fathom](https://hacks.mozilla.org/2017/04/fathom-a-framework-for-understanding-web-pages/)?)
|
- ✅ body text extraction to markdown (using ~~[fathom](https://hacks.mozilla.org/2017/04/fathom-a-framework-for-understanding-web-pages/)~~ readability and mercury)
|
||||||
- featured image / thumbnail extraction
|
- featured image / thumbnail extraction
|
||||||
- auto-tagging links based on important/frequent keywords in extracted text (like pocket)
|
- auto-tagging links based on important/frequent keywords in extracted text (like pocket)
|
||||||
- automatic article summary paragraphs from extracted text with nlp summarization library
|
- automatic article summary paragraphs from extracted text with nlp summarization library
|
||||||
- full-text search of extracted text with elasticsearch/elasticlunr/ag
|
- ✅ full-text search of extracted text with ~~elasticsearch/elasticlunr/ag~~ sonic and ripgrep
|
||||||
- download closed-caption subtitles from Youtube and other video sites for full-text indexing of video content
|
- ✅ download closed-caption subtitles from Youtube and other video sites (TODO: submit the subtitle files to the full-text search index)
|
||||||
- try pulling dead sites from archive.org and other sources if original is down (https://github.com/hartator/wayback-machine-downloader)
|
- try pulling dead sites from archive.org and other sources if original is down (https://github.com/hartator/wayback-machine-downloader)
|
||||||
- And more in the [issues list](https://github.com/ArchiveBox/ArchiveBox/issues/)...
|
- And more in the [issues list](https://github.com/ArchiveBox/ArchiveBox/issues/)...
|
||||||
|
|
||||||
|
Reference in New Issue
Block a user