mirror of
https://github.com/ArchiveBox/ArchiveBox.git
synced 2025-05-18 17:14:39 -04:00
![]() Saves HTML text nodes and selected element attributes in `htmltotext.txt` for each Snapshot. Primarily intended to be used for search indexing. |
||
---|---|---|
.. | ||
__init__.py | ||
archive_org.py | ||
dom.py | ||
favicon.py | ||
git.py | ||
headers.py | ||
htmltotext.py | ||
media.py | ||
mercury.py | ||
pdf.py | ||
readability.py | ||
screenshot.py | ||
singlefile.py | ||
title.py | ||
wget.py |