mirror of
https://github.com/ArchiveBox/ArchiveBox.git
synced 2025-05-13 06:34:25 -04:00
Update README.md
This commit is contained in:
parent
9b26b11d95
commit
7e8d02a969
1 changed files with 9 additions and 7 deletions
12
README.md
12
README.md
|
@ -39,17 +39,19 @@ apt/brew/pip3 install archivebox
|
||||||
|
|
||||||
archivebox init # run this in an empty folder
|
archivebox init # run this in an empty folder
|
||||||
archivebox add 'https://example.com' # start adding URLs to archive
|
archivebox add 'https://example.com' # start adding URLs to archive
|
||||||
archivebox schedule --every=day --depth=1 https://example.com/rss.xml
|
curl https://example.com/rss.xml | archivebox add # or add via stdin
|
||||||
|
archivebox schedule --every=day https://example.com/rss.xml
|
||||||
```
|
```
|
||||||
|
|
||||||
For each URL added, ArchiveBox saves several types of HTML snapshot (wget, Chrome headless, singlefile), a PDF, a screenshot, a WARC archive, any git repositories, images, audio, video, subtitles, article text, [and more...](#output-formats).
|
For each URL added, ArchiveBox saves several types of HTML snapshot (wget, Chrome headless, singlefile), a PDF, a screenshot, a WARC archive, any git repositories, images, audio, video, subtitles, article text, [and more...](#output-formats).
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
archivebox server 0.0.0.0:8000 # open http://127.0.0.1:8000/
|
archivebox server 0.0.0.0:8000 # use the web UI http://127.0.0.1:8000/
|
||||||
ls ./archive/*/index.json # or browse via the filesystem
|
archivebox list 'https://example.com' # use the CLI commands (--help for more)
|
||||||
|
ls ./archive/*/index.json # or browse directly via the filesystem
|
||||||
```
|
```
|
||||||
|
|
||||||
You can then manage your snapshots via the filesystem, CLI, web UI, or Python API (alpha).
|
You can then manage your snapshots via the [filesystem](https://github.com/ArchiveBox/ArchiveBox/wiki/Usage#disk-layout), [CLI](https://github.com/ArchiveBox/ArchiveBox/wiki/Usage#CLI-Usage), [Web UI](https://github.com/ArchiveBox/ArchiveBox/wiki/Usage#UI-Usage), [SQLite DB](https://github.com/ArchiveBox/ArchiveBox/blob/dev/archivebox/core/models.py) (`./index.sqlite3`), [Python API](https://docs.archivebox.io/en/latest/modules.html) (alpha), [REST API](https://github.com/ArchiveBox/ArchiveBox/issues/496) (alpha), or [desktop app](https://github.com/ArchiveBox/electron-archivebox) (alpha).
|
||||||
|
|
||||||
At the end of the day, the goal is to sleep soundly knowing that the part of the internet you care about will be automatically preserved in multiple, durable long-term formats that will be accessible for decades (or longer).
|
At the end of the day, the goal is to sleep soundly knowing that the part of the internet you care about will be automatically preserved in multiple, durable long-term formats that will be accessible for decades (or longer).
|
||||||
|
|
||||||
|
@ -71,7 +73,7 @@ archivebox help
|
||||||
- `archivebox add/remove/update/list` to manage Snapshots in the archive
|
- `archivebox add/remove/update/list` to manage Snapshots in the archive
|
||||||
- `archivebox schedule` to pull in fresh URLs in regularly from [boorkmarks/history/Pocket/Pinboard/RSS/etc.](#input-formats)
|
- `archivebox schedule` to pull in fresh URLs in regularly from [boorkmarks/history/Pocket/Pinboard/RSS/etc.](#input-formats)
|
||||||
- `archivebox oneshot` archive single URLs without starting a whole collection
|
- `archivebox oneshot` archive single URLs without starting a whole collection
|
||||||
- `archivebox shell` call the [Python API](https://docs.archivebox.io/en/latest/modules.html) (alpha), [REST API](https://github.com/ArchiveBox/ArchiveBox/issues/496) (alpha), or browse the `index.sqlite3` DB
|
- `archivebox shell` open a REPL to use the [Python API](https://docs.archivebox.io/en/latest/modules.html) (alpha)
|
||||||
|
|
||||||
<div align="center">
|
<div align="center">
|
||||||
<br/>
|
<br/>
|
||||||
|
|
Loading…
Add table
Add a link
Reference in a new issue