What is the difference between flatten and ravel functions in numpy?

Asked 8/3, 2015 at 18:49 Answered 22/1, 2019 at 17:11

Solved python numpy multidimensional-array flatten numpy-ndarray

437

import numpy as np
y = np.array(((1,2,3),(4,5,6),(7,8,9)))
OUTPUT:
print(y.flatten())
[1   2   3   4   5   6   7   8   9]
print(y.ravel())
[1   2   3   4   5   6   7   8   9]

Both function return the same list. Then what is the need of two different functions performing same job.

Hensley answered 8/3, 2015 at 18:49 Comment(5)

Ravel usually returns a view into the existing array (sometimes it returns a copy). Flatten returns a new array. – Frontispiece 8/3, 2015 at 18:55

Possible duplicate of What is the difference between flatten and ravel in numpy? – Sewoll 22/8, 2016 at 2:32

Here is a practical demonstration of subtle difference. – Beecham 22/1, 2019 at 17:12

So can someone give an example when it is better to flatten an array and when to ravel it ? – Eyeleen 28/3, 2020 at 19:0

Thank you for asking this, I had the same question. – Hortative 12/10, 2020 at 8:31

541

The current API is that:

flatten always returns a copy.
ravel returns a contiguous view of the original array whenever possible. This isn't visible in the printed output, but if you modify the array returned by ravel, it may modify the entries in the original array. If you modify the entries in an array returned from flatten this will never happen. ravel will often be faster since no memory is copied, but you have to be more careful about modifying the array it returns.
reshape((-1,)) gets a view whenever the strides of the array allow it even if that means you don't always get a contiguous array.

Hamite answered 8/3, 2015 at 19:0 Comment(13)

Any idea why NumPy developers didn't stick to one function with some parameter copy=[True,False]? – Unwell 17/11, 2015 at 17:49

@FranckDernoncourt Great question. I have no idea. The only reason I can think of is wanting to provide an easy analog to a similar matlab command. It doesn't appear to have any precedent in numarray or numeric. – Hamite 17/11, 2015 at 20:52

Backcompat guarantees sometimes cause odd things like this to happen. For example: the numpy developers recently (in 1.10) added a previously implicit guarantee that ravel would return a contiguous array (a property that is very important when writing C extensions), so now the API is a.flatten() to get a copy for sure, a.ravel() to avoid most copies but still guarantee that the array returned is contiguous, and a.reshape((-1,)) to really get a view whenever the strides of the array allow it even if that means you don't always get a contiguous array. – Hamite 17/11, 2015 at 20:59

@IanH: Whats the difference between ravel and reshape then? – Coombs 7/1, 2017 at 15:53

@Coombs IanH explained it: ravelguarantees a contiguous array, and so it is not guaranteed that it returns a view; reshape always returns a view, and so it is not guaranteed that it returns a contiguous array. – Colicroot 10/2, 2017 at 19:49

@iled: Thanks, then what's so important about being contiguous ? why would I want to care about that? – Coombs 10/2, 2017 at 20:24

@Coombs That would be a whole new question. Very briefly, it is much faster to read and write to a contiguous memory space. There are several questions and answers on that here on SO (nice example here), feel free to open a new one if you have any further questions. – Colicroot 10/2, 2017 at 20:40

reshape(-1) is equivalent to reshape((-1,)) – Aesop 6/10, 2017 at 17:37

@diraria , we find that we can also pass order in "ravel" method. If it is 'C' which is default order then view of original array is returned. If is 'F' then it is a copied version of the array i.e. modification in this array is not reflected in actual array. (Not sure if this is a feature or bug :D) – Thermodynamic 3/11, 2018 at 6:2

Update: view of original array is not possible if order is 'F' in "ravel" function.github.com/numpy/numpy/issues/12318 – Thermodynamic 3/11, 2018 at 7:21

Why is it called ravel? What is the idea behind the name? – Apple 26/1, 2019 at 6:49

As these discussions reveal, Numpy is great, but not perfect. – Florence 30/5, 2021 at 15:20

"Unravel" means "investigate and solve or explain (something complicated or puzzling)". Maybe they really wanted the opposite of that, keeping people puzzled just like in this thread. :) – Apostasy 25/1 at 0:29

As explained here a key difference is that:

flatten is a method of an ndarray object and hence can only be called for true numpy arrays.
ravel is a library-level function and hence can be called on any object that can successfully be parsed.

For example ravel will work on a list of ndarrays, while flatten is not available for that type of object.

@IanH also points out important differences with memory handling in his answer.

Venita answered 26/11, 2016 at 2:45 Comment(2)

thx for that info about the ravel() working on lists of ndarray's – Markel 28/10, 2018 at 2:39

Not only lists of arrays but also lists of lists :) – Quarto 18/6, 2020 at 9:51

Here is the correct namespace for the functions:

Both functions return flattened 1D arrays pointing to the new memory structures.

import numpy
a = numpy.array([[1,2],[3,4]])

r = numpy.ravel(a)
f = numpy.ndarray.flatten(a)  

print(id(a))
print(id(r))
print(id(f))

print(r)
print(f)

print("\nbase r:", r.base)
print("\nbase f:", f.base)

---returns---
140541099429760
140541099471056
140541099473216

[1 2 3 4]
[1 2 3 4]

base r: [[1 2]
 [3 4]]

base f: None

In the upper example:

the memory locations of the results are different,
the results look the same
flatten would return a copy
ravel would return a view.

How we check if something is a copy? Using the .base attribute of the ndarray. If it's a view, the base will be the original array; if it is a copy, the base will be None.

Check if a2 is copy of a1

import numpy
a1 = numpy.array([[1,2],[3,4]])
a2 = a1.copy()
id(a2.base), id(a1.base)

Out:

(140735713795296, 140735713795296)

Beecham answered 22/1, 2019 at 17:11 Comment(2)

id(a1.base) should be the same as id(a2.base) – Beecham 29/1, 2021 at 12:55

a1.base and a2.base are both None, which is why the id of the base can/will be the same. But id(a1) and id(a2) are different due to copying. The bases will be different if a2 is a slice of a1 , in which case "a1.base is None" but "a2.base is a1" – Caelum 25/11, 2022 at 15:16

Recommended topics

Hot tags