How can I percent-encode URL parameters in Python?

Asked 8/11, 2009 at 2:43 Answered 4/8, 2022 at 11:1

Solved python url encoding urllib urlencode

416

If I do

url = "http://example.com?p=" + urllib.quote(query)

It doesn't encode / to %2F (breaks OAuth normalization)
It doesn't handle Unicode (it throws an exception)

Is there a better library?

Dallon answered 8/11, 2009 at 2:43 Comment(4)

These are not URL parameters, FYI. You should clarify. – Superego 7/9, 2018 at 18:53

What is the language-agnostic canonical Stack Overflow question? (That is, only covering the encoding, not how it is achieved.) – Wives 27/11, 2022 at 21:54

@JamieMarshall what should they be called then if not URL parameters? – Donation 13/10, 2023 at 16:32

@BenCreasy- attributes. The specification for URL describes parameters as a separate part of the URL (not involving the query string). reference here I can't tell you how much time I've lost trying to authenticate an API because they were asking for parameters and I was sending query attributes. – Superego 2/11, 2023 at 19:12

536

From the Python 3 documentation:

urllib.parse.quote(string, safe='/', encoding=None, errors=None)

Replace special characters in string using the %xx escape. Letters, digits, and the characters '_.-~' are never quoted. By default, this function is intended for quoting the path section of a URL. The optional safe parameter specifies additional ASCII characters that should not be quoted — its default value is '/'.

That means passing '' for safe will solve your first issue:

>>> import urllib.parse
>>> urllib.parse.quote('/test')
'/test'
>>> urllib.parse.quote('/test', safe='')
'%2Ftest'

(The function quote was moved from urllib to urllib.parse in Python 3.)

By the way, have a look at urlencode.

About the second issue, there was a bug report about it and it was fixed in Python 3.

For Python 2, you can work around it by encoding as UTF-8 like this:

>>> query = urllib.quote(u"Müller".encode('utf8'))
>>> print urllib.unquote(query).decode('utf8')
Müller

Bent answered 8/11, 2009 at 2:52 Comment(8)

Thanks you, both worked great. urlencode just calls quoteplus many times in a loop, which isn't the correct normalization for my task (oauth). – Dallon 8/11, 2009 at 9:14

the spec: rfc 2396 defines these as reserved reserved = ";" | "/" | "?" | ":" | "@" | "&" | "=" | "+" | "$" | "," Which is what urllib.quote is dealing with. – Kindhearted 23/9, 2015 at 17:42

urllib.parse.quote docs – Greenwich 16/12, 2016 at 10:50

Also, in the case of encoding a search query, you maybe better off using quote_plus: docs.python.org/3/library/… 1. It encodes slashes by default 2. It also encodes spaces – Nyssa 30/5, 2018 at 9:50

six.moves.urllib.parse.quote(u"Müller".encode('utf8')) for Python 2 and 3. – Ephraim 10/12, 2018 at 21:0

if you wanna retain the colon from http: , do urllib.parse.quote('http://example.com/some path/').replace('%3A', ':') – Electronics 9/5, 2019 at 7:27

@Electronics Just use urllib.parse.quote(url, safe=':/'). Even better, encode some path, then join strings. This is Python, not PHP. – Tank 23/12, 2021 at 9:26

safe="" is missing in Python 3 answer! – Istanbul 4/6, 2022 at 14:48

210

In Python 3, urllib.quote has been moved to urllib.parse.quote, and it does handle Unicode by default.

>>> from urllib.parse import quote
>>> quote('/test')
'/test'
>>> quote('/test', safe='')
'%2Ftest'
>>> quote('/El Niño/')
'/El%20Ni%C3%B1o/'

Irradiate answered 29/11, 2012 at 11:52 Comment(3)

The name quote is rather vague as a global. It might be nicer to use something like urlencode: from urllib.parse import quote as urlencode. – Jadajadd 5/3, 2019 at 16:35

Note that there is a function named urlencode in urllib.parse already that does something completely different, so you'd be better off picking another name or risk seriously confusing future readers of your code. – Pervious 2/4, 2020 at 2:41

(style suggestion: @Jadajadd i agree that quote is "rather vague". rather than rename the variable/object to something else you can leave the name fully qualified as urllib.parse.quote. leaving it fully qualified does two things: takes a little extra time typing and saves time reading and maintaining the code. ) – Abhorrent 24/1, 2023 at 14:7

I think module requests is much better. It's based on urllib3.

You can try this:

>>> from requests.utils import quote
>>> quote('/test')
'/test'
>>> quote('/test', safe='')
'%2Ftest'

_{My answer is similar to Paolo's answer.}

Barrybarrymore answered 14/7, 2015 at 8:30 Comment(3)

requests.utils.quote is link to python quote. See request sources. – Profit 5/8, 2015 at 14:11

requests.utils.quote is a thin compatibility wrapper to urllib.quote for python 2 and urllib.parse.quote for python 3 – Kindhearted 23/9, 2015 at 17:30

without reading the comments, this is creating confusion... – Istanbul 4/6, 2022 at 14:46

If you're using Django, you can use urlquote:

>>> from django.utils.http import urlquote
>>> urlquote(u"Müller")
u'M%C3%BCller'

Note that changes to Python mean that this is now a legacy wrapper. From the Django 2.1 source code for django.utils.http:

A legacy compatibility wrapper to Python's urllib.parse.quote() function.
(was used for unicode handling on Python 2)

Phi answered 27/10, 2015 at 19:40 Comment(1)

it's deprecated from Django 3.0+ – Apiece 27/11, 2021 at 12:13

It is better to use urlencode here. There isn't much difference for a single parameter, but, IMHO, it makes the code clearer. (It looks confusing to see a function quote_plus! - especially those coming from other languages.)

In [21]: query='lskdfj/sdfkjdf/ksdfj skfj'

In [22]: val=34

In [23]: from urllib.parse import urlencode

In [24]: encoded = urlencode(dict(p=query,val=val))

In [25]: print(f"http://example.com?{encoded}")
http://example.com?p=lskdfj%2Fsdfkjdf%2Fksdfj+skfj&val=34

Documentation

urlencode
quote_plus

Guardrail answered 29/11, 2018 at 15:46 Comment(0)

An alternative method using furl:

import furl

url = "https://httpbin.org/get?hello,world"
print(url)
url = furl.furl(url).url
print(url)

Output:

https://httpbin.org/get?hello,world
https://httpbin.org/get?hello%2Cworld

Consistent answered 4/8, 2022 at 11:1 Comment(0)

Hot tags

Godot Unity Godot Help Programming Godot 4.X GUI GDScript 3D 2D Physics CSharp Godot 3.X VR XR Projects C++

Documentation

Recommended topics

Hot tags