How to read a CSV file from a URL with Python?

P

9

108

when I do curl to a API call link http://example.com/passkey=wedsmdjsjmdd

curl 'http://example.com/passkey=wedsmdjsjmdd'

I get the employee output data on a csv file format, like:

"Steve","421","0","421","2","","","","","","","","","421","0","421","2"

how can parse through this using python.

I tried:

import csv 
cr = csv.reader(open('http://example.com/passkey=wedsmdjsjmdd',"rb"))
for row in cr:
    print row

but it didn't work and I got an error

http://example.com/passkey=wedsmdjsjmdd No such file or directory:

Thanks!

Portion answered 29/4, 2013 at 16:36 Comment(6)

Can you access that domain directly? – Vesta 29/4, 2013 at 16:37

you need to open the url and read it in as a big text string (see urllib/requests) , then I assume you can initialize the csv reader with a string instead of a file object, but I dont know, Ive always used it with an open filehandle. – Ressler 29/4, 2013 at 16:39

@brbcoding, yes. I can get csv file when I put the link on the browser. – Portion 29/4, 2013 at 16:42

@JoranBeasley, I think that your method is correct, maybe I need something like this http://processing.org/reference/loadStrings_.html but using python – Portion 29/4, 2013 at 16:43

FYI: the read_csv function in the pandas library (pandas.pydata.org) accepts URLs. See pandas.pydata.org/pandas-docs/stable/generated/… – Phonation 29/4, 2013 at 17:35

Duplicate of How do I read and write CSV files with Python? and Get webpage contents with Python?. See What if a question is an exact duplicate of the conjunction of two other questions – Marmite 11/1, 2017 at 7:50

A

91

You need to replace open with urllib.urlopen or urllib2.urlopen.

e.g.

import csv
import urllib2

url = 'http://winterolympicsmedals.com/medals.csv'
response = urllib2.urlopen(url)
cr = csv.reader(response)

for row in cr:
    print row

This would output the following

Year,City,Sport,Discipline,NOC,Event,Event gender,Medal
1924,Chamonix,Skating,Figure skating,AUT,individual,M,Silver
1924,Chamonix,Skating,Figure skating,AUT,individual,W,Gold
...

The original question is tagged "python-2.x", but for a Python 3 implementation (which requires only minor changes) see below.

Anstice answered 29/4, 2013 at 16:42 Comment(14)

can you pass that to csv_reader ? I guess so ... its pretty "file-like", but I've never done it or even thought to do that – Ressler 29/4, 2013 at 16:45

lol I dunno that I was right I was just asking ... hadn't ever seen that done before – Ressler 29/4, 2013 at 16:47

I just assumed that it worked to be honest. Which is crazy as I have used this hundred of times. :D – Anstice 29/4, 2013 at 16:50

I think urllib2.urlopen returns a file-like object, so you can probably just remove the .read(), and pass response to the csv.reader. – Gainful 29/4, 2013 at 16:50

It does, but at least for me I don't get the excepted output. I think its a formating issue. – Anstice 29/4, 2013 at 16:51

when I try to output the result print cr I get this <_csv.reader object at 0x8e3db54> – Portion 29/4, 2013 at 16:54

@Portion that means it is working... That shows you where the object is in memory. Looks like it only reads a line at a time, so maybe cr.next() inside a loop is what you are looking for. (haven't used csv reader myself...) – Vesta 29/4, 2013 at 16:55

Like @Vesta said. I updated my example demonstrating how to display the result. – Anstice 29/4, 2013 at 16:57

I got this output: ['<addinfourl at 163944620 whose fp = <socket._fileobject object at 0x9beca6c>>'] – Portion 29/4, 2013 at 16:58

no I wasn't but when I did, I got an output but empty ` ['<pre> Method Not Allowed</pre> '] [' '] ['</body>'] ['</html>'] ************************************ ` – Portion 29/4, 2013 at 17:4

You did not include the address you are trying to download the data from. It looks like your web server won't allow the request. Try the csv I included in my example. And as an alternative to urllib2 you could try requests as well docs.python-requests.org/en/latest – Anstice 29/4, 2013 at 17:4

first of all Thanks a lot for putting a life example!!1 that is very helpfule, I tried to add csv I got this error,` response = urllib2.urlopen(NewUrlCall+'.csv',"rb").read() File "/usr/lib/python2.6/urllib2.py", line 124, in urlopen return _opener.open(url, data, timeout) File "/usr/lib/python2.6/urllib2.py", line 395, in open response = meth(req, response) File "/usr/lib/python2.6/urllib2.py", line 508, in http_response http_error_default raise HTTPError(req.get_full_url(), code, msg, hdrs, fp) urllib2.HTTPError: HTTP Error 405: Method Not Allowed ` – Portion 29/4, 2013 at 17:12

let us continue this discussion in chat – Portion 29/4, 2013 at 17:12

please check the chat for more info, – Portion 29/4, 2013 at 17:20

B

148

Using pandas it is very simple to read a csv file directly from a url

import pandas as pd
data = pd.read_csv('https://example.com/passkey=wedsmdjsjmdd')

This will read your data in tabular format, which will be very easy to process

Berseem answered 24/2, 2016 at 9:41 Comment(5)

This is one of the simplest approach I have come across so far! – Inamorato 17/5, 2018 at 6:21

So long as your CSV file fits into memory, this is okay. – Lop 20/4, 2019 at 22:23

Didn't work for me, maybe I ran out of memory. pandas.errors.ParserError: Error tokenizing data. C error: Expected 1 fields in line 33, saw 2 – Radius 13/4, 2020 at 15:13

is there anyway to use this with a retry, many times i get a 500 error and when i read_csv again it works. this happens a lot when i am reading from google sheets – Krys 12/8, 2020 at 1:50

This answer worked. The other with csv.reader() always gave me a _csv.Error: iterator should return strings, not bytes (did you open the file in text mode?). – Maziar 8/6, 2023 at 23:15

A

91

You need to replace open with urllib.urlopen or urllib2.urlopen.

e.g.

import csv
import urllib2

url = 'http://winterolympicsmedals.com/medals.csv'
response = urllib2.urlopen(url)
cr = csv.reader(response)

for row in cr:
    print row

This would output the following

Year,City,Sport,Discipline,NOC,Event,Event gender,Medal
1924,Chamonix,Skating,Figure skating,AUT,individual,M,Silver
1924,Chamonix,Skating,Figure skating,AUT,individual,W,Gold
...