1
0
mirror of https://github.com/pirate/ArchiveBox.git synced 2025-08-27 08:14:38 +02:00

Commit Graph

  • 87b79fe5e3 cleanup sdist and bdist build process Nick Sweeting 2020-08-18 17:12:49 -04:00
  • 6af08ae9f2 Install archivebox as an npm package (README.md) Cristian Vargas 2020-08-18 16:08:03 -05:00
  • 5ff852bd07 add npmignore Nick Sweeting 2020-08-18 17:00:51 -04:00
  • 44131f05ee Update README.md Cristian Vargas 2020-08-18 15:56:55 -05:00
  • c0d9bdf52f 0.4.19 release v0.4.19 Nick Sweeting 2020-08-18 15:58:40 -04:00
  • 73408fb035 fix version parsing and attempt to npm install during pip post_install Nick Sweeting 2020-08-18 15:58:05 -04:00
  • 9d7541ba47 0.4.18 release v0.4.18 Nick Sweeting 2020-08-18 15:02:53 -04:00
  • 80863ad050 fix release script semver bump Nick Sweeting 2020-08-18 15:02:36 -04:00
  • 4673f83749 bump docs Nick Sweeting 2020-08-18 15:00:32 -04:00
  • 71788cfd26 single-file supports version cli flag now Nick Sweeting 2020-08-18 15:00:12 -04:00
  • 8b427c9d79 get VERSION from package.json instead of VERSION to avoid duplication Nick Sweeting 2020-08-18 15:00:00 -04:00
  • 8e07fe3f2e expose sub-dependency Nick Sweeting 2020-08-18 13:31:57 -04:00
  • cc0fa747ce feat: Add options to ease management of node related extractors Cristian 2020-08-18 10:34:28 -05:00
  • 73fad928df 0.4.17 release v0.4.17 Nick Sweeting 2020-08-18 09:47:32 -04:00
  • 897a101c1f Merge pull request #449 from pirate/dev Nick Sweeting 2020-08-18 09:38:09 -04:00
  • e87f1d57a3 fix linters Nick Sweeting 2020-08-18 09:22:12 -04:00
  • 61ab952dab fix parser docstring Nick Sweeting 2020-08-18 09:20:05 -04:00
  • e29bfba053 change iframe panel size and shape on details index Nick Sweeting 2020-08-18 09:17:56 -04:00
  • 430d51756b show archive dir size in link details page Nick Sweeting 2020-08-18 09:17:37 -04:00
  • 273588c75a change main link to point to link details index to reduce confusion Nick Sweeting 2020-08-18 09:17:21 -04:00
  • 6087e30d38 dont allow ui editing of db fields for now Nick Sweeting 2020-08-18 09:17:01 -04:00
  • c9b3bab84d fix pull title not working Nick Sweeting 2020-08-18 08:49:26 -04:00
  • f18d92570e wip attempt to fix timestamp unique constraint errors Nick Sweeting 2020-08-18 08:30:09 -04:00
  • b0c0a676f8 re-enable readability and singlefile by default now that its less noisy Nick Sweeting 2020-08-18 08:29:46 -04:00
  • 15efb2d5ed new generic_html parser for extracting hrefs Nick Sweeting 2020-08-18 08:29:05 -04:00
  • a682a9c478 make all parsers accept arbitrary meta kwargs Nick Sweeting 2020-08-18 08:27:47 -04:00
  • 2e2b4f8150 fix url is too long to be a path error Nick Sweeting 2020-08-18 08:23:57 -04:00
  • e3ac4c2405 htmldecode downloaded sources before parsing for links Nick Sweeting 2020-08-18 08:23:20 -04:00
  • fc0d064ddf add hints Nick Sweeting 2020-08-18 08:22:34 -04:00
  • 9d093a34f8 better dependency version check Nick Sweeting 2020-08-18 08:21:55 -04:00
  • 2f282402c7 add hint and prefix stdout utils Nick Sweeting 2020-08-18 08:21:36 -04:00
  • 2557a0da04 tweak cli output coloring Nick Sweeting 2020-08-18 08:16:18 -04:00
  • d7d53cfb12 dont show skipped extractors to reduce visual noise Nick Sweeting 2020-08-18 08:13:35 -04:00
  • 5f84a7bc6e better handle the case where json index lags behind sql index Nick Sweeting 2020-08-18 08:13:13 -04:00
  • 77d2f08a5c show more info in merge conflict error message Nick Sweeting 2020-08-18 08:12:35 -04:00
  • 718d39e242 add common code extensions to default blacklist Nick Sweeting 2020-08-18 08:12:10 -04:00
  • 3658153cf8 fix url parsing through quotes Nick Sweeting 2020-08-18 08:04:57 -04:00
  • a218ceb4e8 add test for overwrite flag Nick Sweeting 2020-08-18 04:52:56 -04:00
  • a3155b88a1 better error message when nothing left to resume Nick Sweeting 2020-08-18 04:42:39 -04:00
  • f8eb245519 better archivebox schedule CLI UX Nick Sweeting 2020-08-18 04:39:58 -04:00
  • ec5609a09a fix resume flag on archivebox update Nick Sweeting 2020-08-18 04:39:39 -04:00
  • 12f3977da5 fix source filenames missing domain Nick Sweeting 2020-08-18 04:39:29 -04:00
  • ec1cce283e include failed link updates in total count stats Nick Sweeting 2020-08-18 04:39:19 -04:00
  • 6e142e3531 match existing snapshots on url only, ignoring timestamp Nick Sweeting 2020-08-18 04:39:04 -04:00
  • f371032b71 show warning when killing archivebox during index writing Nick Sweeting 2020-08-18 04:38:29 -04:00
  • 92de20af15 better detect missing dependencies on startup Nick Sweeting 2020-08-18 04:38:13 -04:00
  • b681a477ae add overwrite flag to add command to force re-archiving Nick Sweeting 2020-08-18 04:37:54 -04:00
  • da671532a4 fix schedule cmd defaulting to day and remove week Nick Sweeting 2020-08-18 04:32:36 -04:00
  • 84a8b0d713 hide progress bars on mac Nick Sweeting 2020-08-18 03:27:52 -04:00
  • b5745d5b6c also add version egg info to git commits Nick Sweeting 2020-08-18 02:14:18 -04:00
  • 8f7ee58a3c bump version Nick Sweeting 2020-08-18 02:12:33 -04:00
  • 29c8da83d6 0.4.16 release v0.4.16 Nick Sweeting 2020-08-18 02:08:52 -04:00
  • 6ba4343db1 bump version and add croniter Nick Sweeting 2020-08-18 02:08:16 -04:00
  • 7c16944a44 Merge pull request #446 from cdvv7788/hotfix/#445 Nick Sweeting 2020-08-18 02:06:32 -04:00
  • 7638dc45ea 0.4.15 release v0.4.15 Nick Sweeting 2020-08-18 01:59:50 -04:00
  • 235eb20dbd support cron in docker Nick Sweeting 2020-08-18 01:59:04 -04:00
  • 494be09bc2 add depth flag to schedule cmd Nick Sweeting 2020-08-18 01:58:54 -04:00
  • 09ad3a5303 Merge pull request #448 from pirate/skip-invalid-urls Nick Sweeting 2020-08-18 00:53:31 -04:00
  • ec4db1f75e add chrome and firefox bookmarks export to history script Nick Sweeting 2020-08-18 00:36:46 -04:00
  • 05c71fc302 fix: Organize readability extractor so a timeout does not break the whole process Cristian 2020-08-17 08:34:40 -05:00
  • 225b63b732 skip invalid urls at all stages Nick Sweeting 2020-08-17 03:12:17 -04:00
  • 26022fc9fb autocommit docs during release Nick Sweeting 2020-08-15 01:22:29 -04:00
  • 09b624f120 didnt realize grep supported negative indexes Nick Sweeting 2020-08-14 23:46:07 -04:00
  • bdd111d548 you can always mount it manually with cli args Nick Sweeting 2020-08-14 23:38:47 -04:00
  • e13d35807a version bumped Nick Sweeting 2020-08-14 23:37:34 -04:00
  • 429f39dec1 0.4.14 release v0.4.14 Nick Sweeting 2020-08-14 13:13:50 -04:00
  • 4a1d2a46ac update docs Nick Sweeting 2020-08-14 13:13:37 -04:00
  • 58e928520a tweak log output for skipped methods Nick Sweeting 2020-08-14 13:12:50 -04:00
  • e835afcf70 Merge pull request #426 from cdvv7788/readability-extractor Nick Sweeting 2020-08-14 12:56:30 -04:00
  • 03b73bfe77 Update archivebox/extractors/readability.py Nick Sweeting 2020-08-14 12:55:22 -04:00
  • 050b717bb9 Merge branch 'master' into readability-extractor Nick Sweeting 2020-08-14 12:35:35 -04:00
  • 601e1543c8 bump singlefile version Nick Sweeting 2020-08-14 11:41:31 -04:00
  • 356cfb30cb add archivebox version to github debug output Nick Sweeting 2020-08-14 00:39:02 -04:00
  • d89a50e2fb fix yaml Nick Sweeting 2020-08-14 00:35:43 -04:00
  • abfce2a210 remove uneeded dependencies Nick Sweeting 2020-08-14 00:14:18 -04:00
  • c37b58f100 remove python-dev after build process Nick Sweeting 2020-08-13 23:55:02 -04:00
  • fedbfda668 add missing yes flag Nick Sweeting 2020-08-13 23:43:02 -04:00
  • ffcb8ffdd4 autoremove uneeded after install Nick Sweeting 2020-08-13 23:40:04 -04:00
  • 7e1b0d30b7 install compiler only during dependency setup Nick Sweeting 2020-08-13 23:35:31 -04:00
  • 15e5d4f5ff add egg info to git Nick Sweeting 2020-08-13 23:32:25 -04:00
  • 0ef2b17678 only show data locations in version output when in a data dir Nick Sweeting 2020-08-13 23:21:57 -04:00
  • a0901ba474 use BIND_ADDR config default for runserver Nick Sweeting 2020-08-13 23:21:37 -04:00
  • e7948cf161 install node dependencies from package.json Nick Sweeting 2020-08-13 22:23:27 -04:00
  • aa085cdb60 Merge pull request #440 from pirate/docker-arm Nick Sweeting 2020-08-13 19:29:46 -04:00
  • f9c617e4aa arm support for docker root 2020-08-13 19:15:47 -04:00
  • 89775ae416 feat: Enable singlefile and readability in docker by default Cristian 2020-08-12 15:35:21 -05:00
  • b7aa3df8d2 feat: Disable singlefile and readability by default Cristian 2020-08-12 14:42:21 -05:00
  • eb3528fa9f feat: Add readability output to legacy index.html Cristian 2020-08-11 12:14:13 -05:00
  • 4d44b172e6 tests: Add readability steps to CI Cristian 2020-08-11 11:58:17 -05:00
  • 5dc7e63792 feat: Update dockerfile to support readability Cristian 2020-08-11 11:52:43 -05:00
  • 2a68af1b94 tests: Add readability tests Cristian 2020-08-11 11:15:15 -05:00
  • 824e5b6b80 Merge pull request #436 from zachscheider/roadmap-links Nick Sweeting 2020-08-11 10:30:33 -04:00
  • 8aa7b34de7 tests: Add readability to ignored methods in tests Cristian 2020-08-11 08:58:49 -05:00
  • dc87d8b68c tests: Update failing tests Cristian 2020-08-11 08:48:13 -05:00
  • 0ec747f64e feat: Look in wget, singlefile or dom outputs before attempting to download the information again Cristian 2020-08-11 08:36:03 -05:00
  • a14762640e feat: Avoid running readability when the target is a file Cristian 2020-08-11 07:40:55 -05:00
  • 61e08a7c43 docs: Update docs link Cristian 2020-08-10 13:17:55 -05:00
  • b33c66a9f7 feat: Split output of readability into multiple files Cristian 2020-08-10 13:15:28 -05:00
  • 7e2b249388 feat: Initial version of readability extractor Cristian 2020-08-07 08:05:17 -05:00
  • af5da93c05 Change roadmap links to wiki Zach Scheider 2020-08-11 00:32:31 -04:00