delta/urlgrabber.git - github.com: rpm-software-management/urlgrabber.git

	Commit message (Collapse)	Author	Age	Files	Lines
*	set loglevel as opt and add set_loglevel to urlopen and urlreadHEAD master	mbussolotto	2022-11-28	1	-1/+18
\|
*	setup.py: Update url	John Vandenberg	2022-09-02	1	-1/+1
\| \| \|	http://urlgrabber.baseurl.org/ doesnt link to the current SCM
*	Avoid using system proxy when proxy set to _none_	Pablo Suárez Hernández	2022-09-02	1	-1/+1
\|
*	Fix the find_proxy bit different way	Victor Zhestkov	2022-09-02	1	-7/+7
\|
*	Fix wrong logic for find_proxy method	Pablo Suárez Hernández	2022-09-02	1	-4/+6
\|
*	Drop six usage	Pablo Suárez Hernández	2022-09-02	8	-33/+20
\|
*	Convert dict_keys to list to not crash when delegate is enabled	Pablo Suárez Hernández	2022-04-19	1	-1/+1
\|
*	Use binary mode when reopening files	Pablo Suárez Hernández	2022-01-29	1	-1/+1
\|
*	Respect ssl_verify_host set to False	Marek Brychta	2021-07-26	1	-0/+2
\|
*	Fix for TextMeter as progress_option	Jochen Breuer	2020-04-02	1	-2/+2
\| \| \| \| \| \| \| \| \|	Type errors would prevent TextMeter from being used as a progess_option. This commit fixes this with a type cast and an initialization of a variable as int. Fixes #24
*	Decode bytes to a string for ftplib.parse150() (RhBug:1734527)	Lukáš Hrázký	2020-01-27	1	-1/+9
\| \| \| \| \| \| \|	ftplib.parse150() expects a string, we need to decode the bytes before passing them to the function in Python 3. https://bugzilla.redhat.com/show_bug.cgi?id=1734527
*	Enable copr builds and add packit config	Dominika Hodovska	2019-11-22	2	-0/+434
\|
*	Release 4.1.0urlgrabber-4-1-0	Neal Gompa	2019-10-08	2	-1/+12
\|
*	setuptools: Update Development Status to "Production/Stable"	Neal Gompa	2019-10-08	1	-1/+1
\| \| \| \| \| \|	This module has been in use with YUM for well over a decade, and the API has not changed much in the last few years. At this point, most people would consider it production-grade.
*	Revise setup.py to remove need for extra setup-time dependencies	Neal Gompa	2019-10-06	2	-72/+70
\| \| \| \| \| \|	This rework removes the cyclic dependency on the urlgrabber code to install urlgrabber, while still supporting the propagation of setuptools properties into the module.
*	Fix issue with URLGRABBER_DEBUG on urlgrabber script	Pablo Suárez Hernández	2019-10-01	1	-4/+4
\|
*	Fix issue when URLGRABBER_DEBUG is not an integer on Python3	Pablo Suárez Hernández	2019-09-30	1	-2/+2
\|
*	Fix for usage of _levelNames from logging module	Jochen Breuer	2019-09-06	2	-2/+8
\| \| \| \|	With Python3 the internal dict has been renamed to _levelToName.
*	Support HTTP CONNECT with reget. BZ 1585596	Michal Domonkos	2019-08-24	1	-1/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Currently, we would reset the file upon seeing "200" in the response header. This is, however, easily fooled by the HTTP CONNECT method used to access an SSL server on behalf of a proxy (a common setting would be a company intranet behind a proxy server where an internal system is consuming Red Hat CDN repos with yum). The reason is that, in this protocol, there are two subsequent headers sent, the first of which is: "HTTP/1.1 200 Connection established". Therefore, we need to explicitly check for "200 OK". More details: https://tools.ietf.org/html/rfc7231#section-4.3.6 Kudos to Masahiro Matsuya for suggesting this patch! Note: As an alternative solution, it seems that setting the CURLOPT_SUPPRESS_CONNECT_HEADERS option on the curl handle would also do the trick (but that would require more scrutiny to ensure that nothing else breaks): https://curl.haxx.se/libcurl/c/CURLOPT_SUPPRESS_CONNECT_HEADERS.html
*	test: handle unknown file content in test_retry_no_cache	Michal Domonkos	2019-05-21	1	-1/+6
\|
*	urlgrabber-ext-down: convert url into bytes	Michal Domonkos	2019-05-21	2	-3/+3
\| \| \| \| \| \| \| \| \| \| \|	We need to convert the parsed url back into bytes before passing it to the PyCurlFileObject constructor (since _set_opts() expects self.scheme, constructed from the url, to be a bytes object). This caused the unit test "bypassing proxy cache on failure" to fail (together with a bug in the test itself which is also being fixed here). Closes #14.
*	Revert "Simplify mirror conversion to utf8"	Michal Domonkos	2019-05-20	1	-1/+6
\| \| \| \| \| \| \| \| \| \| \| \| \|	This reverts commit be8ee10e35319e80200d4ff384434d46fe7783d9. A list of dicts (as opposed to strings) is valid input as well; see the module-level doc string for details (section 2 under CUSTOMIZATION). In fact, the nested estimate() function in MirrorGroup.__init__() accounts for that, too. This fixes a traceback in YUM which does pass such a dict list. Closes #10.
*	urlgrabber-ext-down: another python 3 compat	Pavel Raiskup	2019-05-12	1	-1/+8
\| \| \| \| \| \| \|	Expect that _readlines() returns array of bytes objects in Python 3 environments. Fixes rhbz #1707657 and #1688173
*	Fix the confused license header in urlgrabber/__init__.py file	Neal Gompa	2019-02-27	1	-11/+13
\| \| \| \| \| \| \| \| \| \|	urlgrabber was relicensed to LGPLv2+ a long time ago, and this file seemed to have only a partially updated license header for this. It references the GNU Library General Public License, so clearly the intent was the header was supposed to be fully updated for it, so let's just do that by switching to the header used in the other source files.
*	Release 4.0.0urlgrabber-4-0-0	Neal Gompa	2019-02-25	2	-2/+11
\|
*	Define setup_requires in setup.py and add six to install_requires	Neal Gompa	2019-02-25	1	-1/+2
\|
*	Raise an obvious error message when "urlgrabber-ext-down" is not installed	Matthew Prahl	2019-02-25	1	-0/+5
\|
*	Add curl_obj option. BZ 1204825	Michal Domonkos	2019-02-25	1	-1/+23
\|
*	Fix setup.py to use setuptools and have correct metadata	Neal Gompa	2019-02-25	1	-3/+4
\|
*	makefile: detect modern Python 2 and 3 releases	Neal Gompa	2019-02-25	1	-7/+7
\|
*	Merge pull request #9 from keszybz/py3k	Neal Gompa (ニール・ゴンパ)	2019-02-25	21	-20880/+963
\|\ \| \| \| \|	Python3 compatibility
\| *	tests: do not exit with "success" on error	Zbigniew Jędrzejewski-Szmek	2019-02-24	1	-2/+2
\| \| \| \| \| \| \| \|	2 is the standard code for "command line usage error".
\| *	Drop some unnecessary continuation backslashes	Zbigniew Jędrzejewski-Szmek	2019-02-24	3	-21/+19
\| \| \| \| \| \| \| \|	When the expression is already in parentheses, the backslash has no effect.
\| *	Replace some type() with specific class names	Zbigniew Jędrzejewski-Szmek	2019-02-24	4	-6/+6
\| \| \| \| \| \| \| \| \| \|	We know what the types of basic types are, let's just put that directly in the code. It seems more idiomatic and slightly more efficient to do things this way.
\| *	Apply 'methodattrs' fixer	Zbigniew Jędrzejewski-Szmek	2019-02-24	1	-1/+1
\| \| \| \| \| \| \| \|	Just a cleanup.
\| *	Apply 'asserts' fixer	Zbigniew Jędrzejewski-Szmek	2019-02-24	3	-86/+86
\| \| \| \| \| \| \| \|	This just updates unittest method names.
\| *	test_grabber: enable the post test	Zbigniew Jędrzejewski-Szmek	2019-02-24	1	-1/+0
\| \|
\| *	test_grabber: define try..except block in test more narrowly	Zbigniew Jędrzejewski-Szmek	2019-02-24	1	-19/+22
\| \| \| \| \| \| \| \| \| \|	Bare except: is never nice. Also, let's put the try..except block around only the urlgrab() call.
\| *	py3: avoid "unbound variable" issue	Zbigniew Jędrzejewski-Szmek	2019-02-24	1	-4/+8
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Under python3, the variable defined in "except ... as ..." is only valid until the end of the block. In this case it would undefine the variable that was defined above, not reassign it as under python2, leading to the following tb: Traceback (most recent call last): File "test/test_mirror.py", line 379, in test_retry_no_cache urlgrabber.grabber.parallel_wait() File "test/../urlgrabber/grabber.py", line 2374, in parallel_wait perform() File "test/../urlgrabber/grabber.py", line 2313, in perform if ug_err is None: UnboundLocalError: local variable 'ug_err' referenced before assignment
\| *	Do encoding/decoding in the subprocess calls	Zbigniew Jędrzejewski-Szmek	2019-02-24	1	-6/+11
\| \| \| \| \| \| \| \| \| \|	s.endswith('x') works properly with bytes. s[-1] == 'x' doesn't, because s[-1] returns an integer. But .endswith() is clearer anyway, so that's OK.
\| *	Switch test URLs to a private server	Zbigniew Jędrzejewski-Szmek	2019-02-24	1	-1/+2
\| \| \| \| \| \| \| \| \| \|	The URLs under http://urlgrabber.baseurl.org/ are all broken. Let's switch until they are fixed.
\| *	Simplify quoter function declaration	Zbigniew Jędrzejewski-Szmek	2019-02-24	1	-7/+6
\| \| \| \| \| \| \| \| \| \|	There isn't much point in defining a nested function which doesn't use its closure for anything. Also doing it just once is generally quicker.
\| *	Allow overriding the path to urlgrabber-ext-down with an env var	Zbigniew Jędrzejewski-Szmek	2019-02-24	1	-1/+3
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Otherwise, the tests try to call the installed executable, which might be stale or not exist at all. I now run the tests with: URLGRABBER_EXT_DOWN=scripts/urlgrabber-ext-down PYTHONPATH=. python2 test/runtests.py URLGRABBER_EXT_DOWN=scripts/urlgrabber-ext-down PYTHONPATH=. python3 test/runtests.py
\| *	Replace number type enumeration by generic abc check	Zbigniew Jędrzejewski-Szmek	2019-02-24	1	-1/+2
\| \| \| \| \| \| \| \|	numbers existed already in 2.6.
\| *	Make the 'broker' mirror really broken	Zbigniew Jędrzejewski-Szmek	2019-02-24	5	-20004/+1
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	I guess those files are not supposed to be there, because otherwise the test don't get the exception they expect. The test is updated to expect 404, not 403, because the file is simply missing. It would be possible to change the server configuration to return 403, but this doesn't seem particularly important for this test. So let's just ask for a file that doesn't exist and expect 404.
\| *	test_mirror: do not use a fixed port for the internal test server	Zbigniew Jędrzejewski-Szmek	2019-02-24	1	-6/+14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The tests were fairly consistently failing with: error: [Errno 98] Address already in use Actually the port is a well-known port that could be used be some other program. So let's simplify things by opening a random port. It'd be nice to use the socket as a context manager, but unfortunately python2 does not support that.
\| *	Use a failing URL in two tests that are supposed to fail	Zbigniew Jędrzejewski-Szmek	2019-02-24	1	-2/+2
\| \| \| \| \| \| \| \| \| \| \| \|	IIUC, those tests called urlgrab or urlread with a valid address or a file name and expected it to fail. Let's call it with a URL that return 404 instead. Now the tests pass.
\| *	Add explicit encode/decode calls	Zbigniew Jędrzejewski-Szmek	2019-02-24	5	-20/+41
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is the patch that has the most potential for trouble in the whole series, because it affects python2 behaviour directly. io.BytesIO is the same as StringIO under python2, so this should have no effect on python2. Under python3 it is necessary to allow reading bytes from a byte data source. Under python2, encoding of an already encoding string is allowed, and actually works fine (is idempotent) for ASCII strings. So the effect of the .encode() calls under python2 should be limited. Under python3, they are of course necessary and can only be done once. So if there are errors here, they should show up when running under python3 pretty easily.
\| *	Preserve type in URLParser.quote()	Zbigniew Jędrzejewski-Szmek	2019-02-24	1	-2/+4
\| \|
\| *	Add a wrapper function around urlunquote() that decodes automatically	Zbigniew Jędrzejewski-Szmek	2019-02-24	1	-4/+9
\| \|