Trim excess leading path separators#6644

Merged

sigmavirus24 merged 1 commit intopsf:mainfrom

sigmavirus24:bug/6643

Feb 23, 2024

Contributor

sigmavirus24 commented Feb 22, 2024

A URL with excess leading / (path-separator)s would cause urllib3 to attempt to reparse the request-uri as a full URI with a host and port. This bypasses that logic in ConnectionPool.urlopen by replacing these leading /s with just a single /.

Closes #6643

sigmavirus24 assigned nateprewitt and sethmlarson

sigmavirus24 commented

View reviewed changes

tests/test_adapters.py Outdated Show resolved Hide resolved

sigmavirus24 commented

View reviewed changes

tests/test_adapters.py Outdated Show resolved Hide resolved

sigmavirus24 commented

View reviewed changes

tests/test_adapters.py Outdated Show resolved Hide resolved

sigmavirus24 commented

View reviewed changes

tests/test_adapters.py Outdated Show resolved Hide resolved

sigmavirus24 force-pushed the bug/6643 branch from 2cd2815 to 57e8c3b Compare

February 22, 2024 12:14

sigmavirus24 commented

View reviewed changes

src/requests/adapters.py Outdated

                           using_socks_proxy = proxy_scheme.startswith("socks")
-                      url = request.path_url
+                      url = re.sub("^/+", "/", request.path_url)

Contributor Author

sigmavirus24 Feb 22, 2024

As mentioned on urllib3/urllib3#3352 this could also be

url = f"/{request.path_url.lstrip('/')}"

I could benchmark these but I don't particularly care what the implementation is. I just threw this together to show that it can be fixed

Member

nateprewitt Feb 22, 2024

It looks like the f-string (Python 3.9-3.12 tested) is ~4x faster but we're talking on the scale of nanoseconds so it's basically moot. I'd vote the f-string for readability, but don't have a strong opinion.

Contributor Author

sigmavirus24 Feb 22, 2024

Yeah, I'm also happy to shove this into a branch too like

if path.startswith('//'):

To make it clearer that we only care about the separator being repeated. What I want is clarity in the reader as to why we're doing this. My old school brain things the regexp is clearer and the f-string looks sus but that's just my opinion and I'm not holding it closely

Member

nateprewitt Feb 22, 2024

Yeah, branching seems fine to me too.

Contributor Author

sigmavirus24 Feb 23, 2024

Did both (f string and branch)

nateprewitt approved these changes

View reviewed changes

nateprewitt added this to the 2.32.0 milestone


          Trim excess leading path separators

60389df

A URL with excess leading / (path-separator)s would cause urllib3 to
attempt to reparse the request-uri as a full URI with a host and port.
This bypasses that logic in ConnectionPool.urlopen by replacing these
leading /s with just a single /.

Closes psf#6643

sigmavirus24 force-pushed the bug/6643 branch from 57e8c3b to 60389df Compare

February 23, 2024 00:37

sigmavirus24 merged commit 3587a5f into psf:main

sigmavirus24 deleted the bug/6643 branch

February 23, 2024 00:49

nateprewitt mentioned this pull request

Merged

kristianelliott80 mentioned this pull request

breaking aws s3 usage with requests 2.32.0 #6711

Open

MeggyCal mentioned this pull request

test_httpretty_should_handle_paths_starting_with_two_slashes needs update gabrielfalcao/HTTPretty#457

Open

This was referenced Jun 3, 2024

Bump requests from 2.31.0 to 2.32.2 MozillaFoundation/foundation.mozilla.org#12389

Closed

Bump requests from 2.31.0 to 2.32.0 MozillaFoundation/foundation.mozilla.org#12364

Closed

martin-pil mentioned this pull request

Multiple path separators causes bad requests #6784

Open

github-actions bot locked as resolved and limited conversation to collaborators

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet