ArchiveBox

mirror of https://github.com/ArchiveBox/ArchiveBox.git synced 2025-05-27 21:24:16 -04:00

Author	SHA1	Message	Date
Nick Sweeting	3b3bdab97d	update ignore files	2023-10-20 02:45:20 -07:00
Nick Sweeting	63c276a93d	redirect add page back to snapshots list automatically	2023-10-19 18:24:19 -07:00
Nick Sweeting	22bcffe5ee	remove accidental duplicate template code	2023-10-19 18:24:05 -07:00
Nick Sweeting	85f8583d62	catch ldap configuration and packaging errors and make them non fatal	2023-10-19 18:05:59 -07:00
Nick Sweeting	53cff45ec0	switch from pipenv and setup.py to PDM for python packaging	2023-10-19 17:49:06 -07:00
Nick Sweeting	8caffc4b4e	ignore .pdm-python file	2023-10-19 17:09:48 -07:00
Nick Sweeting	b6113cc00c	use pypa build instead of setup.py build and move ldap to extras	2023-10-19 16:44:48 -07:00
Nick Sweeting	4b26ab9d1f	only load image into docker when building, and push on deploy	2023-10-19 16:29:51 -07:00
Nick Sweeting	9fa210515e	Bump required python version to 3.9 in stdeb.cfg	2023-10-19 16:22:06 -07:00
Nick Sweeting	748363e685	Bump required Python version to 3.9	2023-10-19 16:21:32 -07:00
Nick Sweeting	6619e8005e	Update pip.yml github action python version	2023-10-19 16:20:43 -07:00
Nick Sweeting	a61f544547	Update SECURITY.md	2023-10-19 16:18:32 -07:00
Nick Sweeting	edde40898b	fix markdown underlining everything as link	2023-10-19 12:58:38 -07:00
Nick Sweeting	a8ce927937	fix markdown formatting	2023-10-19 12:57:05 -07:00
Nick Sweeting	0e3475a0eb	Create SECURITY.md policy	2023-10-18 18:40:58 -07:00
Nick Sweeting	4f655fc4a1	Replace chown of entire data directory when ownership mismatch detected	2023-10-18 18:24:43 -07:00
Nick Sweeting	e9c85aa812	Update README.md	2023-10-18 17:53:49 -07:00
Nick Sweeting	6f4a7e4e68	Add ADMIN_USERNAME and ADMIN_PASSWORD to example options	2023-10-18 11:53:13 -07:00
Nick Sweeting	2a1104c76a	Merge pull request #1245 from benmuth/add-user-env-vars	2023-10-18 11:50:58 -07:00
Nick Sweeting	d286dca925	better to log before doing a thing than after	2023-10-18 11:47:55 -07:00
Ben Muthalaly	9e6a87114b	Fix formatting, logging, logic issues	2023-10-18 12:07:36 -05:00
Ben Muthalaly	521ea70e0c	Add check for existing user, change varable names	2023-10-18 03:07:54 -05:00
Ben Muthalaly	44a94157be	Remove logging of configured username and password	2023-10-15 23:36:47 -05:00
Ben Muthalaly	11d473e536	Add config options to add admin user on first run	2023-10-14 00:38:04 -05:00
Nick Sweeting	d7b883b049	fix broken link	2023-10-12 00:22:47 -07:00
Nick Sweeting	dcef217e5e	Merge pull request #1242 from benmuth/fix-titles-with-empty-tag	2023-10-09 21:39:26 -07:00
Ben Muthalaly	77917e9b55	Fix HTML title parsing bugs. This slightly modifies the HTML_TITLE_REGEX to fix two parsing errors. The first occurred when title tags were empty (e.g. "<title></title>") which was parsed as "</title". The second occurred when titles were a single character (e.g. "<title>A</title>") which was not matched by the regex, and so would fall back to link.base_url. Now when tags are empty, it falls back to link.base_url, and single character titles are parsed correctly. The way the regex works now is still a bit wonky for some edge cases. I couldn't find any cases of incorrect behavior, but it still might be worth reworking more completely for robustness.	2023-10-09 02:00:01 -05:00
Nick Sweeting	4950cee3b6	Merge pull request #1229 from jamesob/dev	2023-09-18 12:19:00 -07:00
James O'Beirne	b28b3b7e67	README: update outdated links Most frustratingly, the outdated docker-compose link prompts users to download an older version of the docker-compose.yml file, which packages a broken YouTube retrieval method.	2023-09-18 10:37:35 -04:00
Nick Sweeting	5c1a14e4f2	ignore errors while getting system user name	2023-09-14 03:39:44 -07:00
Nick Sweeting	ffe2968e4f	improve some comments	2023-09-14 02:41:27 -07:00
Nick Sweeting	f809efce4d	Merge pull request #996 from barthalion/dev	2023-09-03 21:40:49 -07:00
Nick Sweeting	aaca74f6a8	only start parsing json after the first open brace	2023-09-03 21:40:12 -07:00
Nick Sweeting	cd9f228b2f	Merge pull request #1214 from DanielBatteryStapler/DanielBatteryStapler-patch-1	2023-09-03 21:25:12 -07:00
Nick Sweeting	16d278fbdb	Merge pull request #1168 from mAAdhaTTah/add-readwise-reader	2023-09-03 21:24:49 -07:00
Nick Sweeting	110a22ee32	Merge branch 'dev' into DanielBatteryStapler-patch-1	2023-08-31 15:20:46 -07:00
Nick Sweeting	73a5f74d38	update default YOUTUBEDL_ARGS to fix subs and filesize	2023-08-31 15:17:45 -07:00
Nick Sweeting	86366d5640	Update logging_util.py to fix generator subscripting error	2023-08-31 15:12:43 -07:00
Nick Sweeting	a837f870af	Merge pull request #1221 from spresse1/update-singlefile	2023-08-29 17:04:04 -07:00
spresse1	c8597a7fa1	Update singlefile to the latest version and switch it to single-file-cli. Unfortunately, this requires a rewrite of NPM dependency files.	2023-08-29 20:28:48 +02:00
Nick Sweeting	62fb56354b	Merge pull request #1219 from spresse1/chrome-cleanup	2023-08-28 20:02:22 -07:00
spresse1	603ce7ec10	After a timeout, chrome will leave behind a SingletonLock, which prevents future instances of chrome from starting. When an extractor fails due to a timeout, remove this file.	2023-08-28 17:27:03 +02:00
Nick Sweeting	0b6064b7dd	Update docker_entrypoint.sh to use /bin/bash	2023-08-22 16:35:43 -07:00
root	23f086aa40	add LDAP support	2023-08-17 19:51:02 -05:00
Nick Sweeting	00ecf57b0f	Merge pull request #1211 from DanielBatteryStapler/DanielBatteryStapler-patch-1	2023-08-15 21:34:53 -07:00
DanielBatteryStapler	94dacc49c7	Fix archive_org icon "exists"	2023-08-15 23:49:54 -04:00
Nick Sweeting	68e936e7c2	Merge pull request #1186 from ArchiveBox/dependabot/npm_and_yarn/word-wrap-1.2.4	2023-08-13 16:43:09 -07:00
Nick Sweeting	a7d7644dca	Merge pull request #1205 from overhacked/fix_url_regex_hyphen	2023-08-09 14:20:07 -07:00
Ross Williams	c039ef05b3	Fix hyphen placement in util.URL_REGEX Incorrect hyphen placement in `URL_REGEX` was allowing it to match more characters than intended. In a regex character class, a literal hyphen can only appear as the first character in the class, or it will be interpreted as the delimiter of a range of characters. The issue fixed here caused the range of characters from `[$-_]` be treated as valid URL characters, instead of the intended set of three characters `[-_$]`. The incorrect range interpretation inadvertantly included most ASCII punctuation, most importantly the angle brackets, square brackets, and single quote that the expression uses to mark the end of a match. This causes the expression to match a URL that has a "hostname" portion beginning with one of the intended "stop parsing" characters. For example: ``` https://<b>www</b>.example.com/ # MATCHES but should not https://[for example] # MATCHES but should not scheme='https://' # MATCHES, including final quote, but should not ``` Some test cases have been added to the `URL_REGEX` assert in archivebox.parsers to cover this possibility.	2023-08-08 15:24:16 -04:00
Nick Sweeting	b773041952	Merge pull request #1199 from overhacked/chrome_version_detection_fix	2023-08-01 10:14:18 -07:00

1 2 3 4 5 ...

3107 commits