"OverflowError: Python int too large to convert to C long" on windows but not mac

A

5

64

I am running the exact same code on both windows and mac, with python 3.5 64 bit.

On windows, it looks like this:

>>> import numpy as np
>>> preds = np.zeros((1, 3), dtype=int)
>>> p = [6802256107, 5017549029, 3745804973]
>>> preds[0] = p
Traceback (most recent call last):
  File "<pyshell#13>", line 1, in <module>
    preds[0] = p
OverflowError: Python int too large to convert to C long

However, this code works fine on my mac. Could anyone help explain why or give a solution for the code on windows? Thanks so much!

Aland answered 11/7, 2016 at 18:51 Comment(7)

You're sure both are 64 bit? can you test on linux? – Inundate 11/7, 2016 at 18:53

Even if both systems are on 64-bit Python, are they both on 64-bit NumPy? – Rickettsia 11/7, 2016 at 18:53

Another stackoverflow question explains 'why'. On Windows long is 32bit and on Unux-like long is 64bit. Please see the question #385002 – Whomever 11/7, 2016 at 18:57

Use dtype='int64' or dtype=np.int64. The int type uses a C long, which is always 32-bit on Windows. – Selfassured 11/7, 2016 at 19:34

to Tim: Yes, both are 64bit. I do not have a linux machine, sorry. to user2357112: Yes, both are 64bit python and numpy. to VladimirM: Thanks! I think that question answers mine! to eryksun: Thanks! It works! – Aland 11/7, 2016 at 21:39

How would you do this without numpy? – Trichromatic 10/7, 2019 at 14:1

This is a good solution #15064436 – Contrition 24/9, 2022 at 6:44

H

41

You'll get that error once your numbers are greater than sys.maxsize:

>>> p = [sys.maxsize]
>>> preds[0] = p
>>> p = [sys.maxsize+1]
>>> preds[0] = p
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
OverflowError: Python int too large to convert to C long

You can confirm this by checking:

>>> import sys
>>> sys.maxsize
2147483647

To take numbers with larger precision, don't pass an int type which uses a bounded C integer behind the scenes. Use the default float:

>>> preds = np.zeros((1, 3))

Harleyharli answered 11/7, 2016 at 18:54 Comment(9)

if you do get a number larger than this, how to tackle? – Complex 3/2, 2018 at 17:51

@VeronicaWenqianCheng Don't pass an int dtype, use the default float. – Harleyharli 5/2, 2018 at 11:20

what if it needs to passed as an index which then needs to be int ? – Lay 27/7, 2018 at 0:15

I don't understand your question clearly. The index or the value itself? In the case of the value, use a float. You can easily convert to int in plain Python if you need the value as an int. – Harleyharli 29/7, 2018 at 0:7

MosesKoledoye I think @fireball means what if a non-float is required as an index argument and hence cannot be a float (which you say is required to circumvent this problem)? Should one do int(float(x)) - surely not? – Trichromatic 10/7, 2019 at 14:2

e.g. TypeError: integer argument expected, got float – Trichromatic 10/7, 2019 at 14:4

@Trichromatic Maybe show some code. I can't see why a float is being passed when an int is required. – Harleyharli 11/7, 2019 at 21:18

At least according to the documentation sys.maxsize is the maximum value of py_ssize_t (essentially ssize_t), not long. In particular win64 has a 64-bit size_t, but a 32-bit long. – Meneses 22/12, 2019 at 17:24

With enormous ints, they are very likely to be id's of sorts. Doesn't converting them to floats mean that the digits will be truncated, thus breaking the uniqueness of the ids? – Benedikt 13/7, 2020 at 18:57

D

43

You can use dtype=np.int64 instead of dtype=int

Downer answered 13/10, 2018 at 19:12 Comment(3)

Thanks, I just had to use the unsigned type np.uint64 (to store hashes). – Pneumonic 1/5, 2020 at 13:21

I tried using both np.int64 and np.uint64 to store 109323892912381287389218291378123872293293923929392929289283928 Neither work – Amata 14/7, 2020 at 21:41

If you need to store insanely large numbers exactly then numpy probablly isn't the tool for you. If you can tolerate loss of precision then you can use the float type, alternatively it's possible to have a numpy array of python objects ( #6142353 ) but at that point some would question why you are using a numpy array at all. – Meneses 21/1, 2021 at 14:11

H

41

You'll get that error once your numbers are greater than sys.maxsize:

>>> p = [sys.maxsize]
>>> preds[0] = p
>>> p = [sys.maxsize+1]
>>> preds[0] = p
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
OverflowError: Python int too large to convert to C long

You can confirm this by checking:

>>> import sys
>>> sys.maxsize
2147483647

To take numbers with larger precision, don't pass an int type which uses a bounded C integer behind the scenes. Use the default float:

>>> preds = np.zeros((1, 3))