J

40

395

I have some code like:

good = [x for x in mylist if x in goodvals]
bad = [x for x in mylist if x not in goodvals]

The goal is to split up the contents of mylist into two other lists, based on whether or not they meet a condition.

How can I do this more elegantly? Can I avoid doing two separate iterations over mylist? Can I improve performance by doing so?

Jempty answered 4/6, 2009 at 7:37 Comment(4)

landed here looking for a way to have a condition in the set builder statement, your question answered my question :) – Worcester 21/6, 2012 at 13:27

split is an unfortunate description of this operation, since it already has a specific meaning with respect to Python strings. I think divide is a more precise (or at least less overloaded in the context of Python iterables) word to describe this operation. I landed here looking for a list equivalent of str.split(), to split the list into an ordered collection of consecutive sub-lists. E.g. split([1,2,3,4,5,3,6], 3) -> ([1,2],[4,5],[6]), as opposed to dividing a list's elements by category. – Appomattox 17/12, 2015 at 16:24

Discussion of the same topic on python-list. – Meliorism 10/10, 2016 at 19:13

IMAGE_TYPES should be a set instead of a tuple: IMAGE_TYPES = set('.jpg','.jpeg','.gif','.bmp','.png'). n(1) instead of n(o/2), with practically no difference in readability. – Deferment 5/3, 2017 at 17:58

C

154

good = [x for x in mylist if x in goodvals]
bad  = [x for x in mylist if x not in goodvals]

is there a more elegant way to do this?

That code is perfectly readable, and extremely clear!

# files looks like: [ ('file1.jpg', 33L, '.jpg'), ('file2.avi', 999L, '.avi'), ... ]
IMAGE_TYPES = ('.jpg','.jpeg','.gif','.bmp','.png')
images = [f for f in files if f[2].lower() in IMAGE_TYPES]
anims  = [f for f in files if f[2].lower() not in IMAGE_TYPES]

Again, this is fine!

There might be slight performance improvements using sets, but it's a trivial difference, and I find the list comprehension far easier to read, and you don't have to worry about the order being messed up, duplicates being removed as so on.

In fact, I may go another step "backward", and just use a simple for loop:

images, anims = [], []

for f in files:
    if f.lower() in IMAGE_TYPES:
        images.append(f)
    else:
        anims.append(f)

The a list-comprehension or using set() is fine until you need to add some other check or another bit of logic - say you want to remove all 0-byte jpeg's, you just add something like..

if f[1] == 0:
    continue

Calan answered 4/6, 2009 at 13:28 Comment(13)

Isn't there a list comprehension way without having to loop through the list twice? – Bernita 21/7, 2012 at 15:42

@Bernita no sensible way that I can think of. Why'd you ask? – Calan 21/7, 2012 at 21:49

The problem is that this violates the DRY principle. It'd be nice if there was a better way to do this. – Microdot 9/5, 2013 at 18:3

Once the appetite for functional programming (Haskell), or functional style (LINQ) is raised, we start to smell Python for its age - [x for x in blah if ...] - verbose, lambda is clumsy and limited... It feels like driving the coolest car from 1995 today. Not the same as back then. – Borderland 24/5, 2015 at 13:52

That simple for-loop should be a built-in function, encouraging people to run 2 list-comprehensions in an obvious task for no reason other than python lacking a function for this is a terrible idea imho. – Maypole 14/6, 2015 at 17:10

@TomaszGandor FTR, Haskell is older than Python (and actually influenced its design). I think the syntax for list comprehension and lambdas was deliberately kept a bit on the verbose side, perhaps to discourage over-using them. Which is indeed a bit of a risk... as much as I like Haskell, I can see why many people find Python generally more readable. – Zoomorphism 30/9, 2015 at 20:4

A more elegant, but not performant way to do this is to actually use groupby, because that's exactly what you trying to do. Only you don't want to save group key. – Netty 16/2, 2016 at 9:43

the simple for loop is the best way to do this... a single loop, very clear and readable – Plexor 21/4, 2016 at 10:4

The simple for loop is fastest. See the speed test here. – Deferment 25/3, 2019 at 16:5

What's the point of only looping once? Before answering stop and think about this: Is it faster to two 100 things twice or two things a 100 times? – Lashawnda 29/4, 2019 at 5:26

@vidstige: This question does say "list", but perhaps the iterable we want to use is a generator and can't be looped over twice. And on the subject of performance, doing 2 things 100 times is the same speed as doing 100 things 2 times, sure, but that's not what happens when you use a loop in a programming language. You're doing the things in the body, plus you're spending time doing instructions to manipulate the loop counters and iterable framework on every iteration. The fewer times you iterate, the faster; that's why loop unrolling is a thing. – Unvarnished 3/10, 2019 at 21:41

(This said, it seems highly unlikely the overhead of looping will make a practical difference in 99.9% of Python use cases. Readability should definitely be the concern here.) – Unvarnished 3/10, 2019 at 21:42

The DRY principle shouldn't be treated like a commandment in a fundamentalist religion. In many cases it is better to repeat oneself with a clear pattern using a common idiom in the language, rather than introducing some obscure hack, or a custom function, either or which may likely make things more difficult for readers of your code, and it certainly wastes time when you could accept the pragmatic and idiomatic solution and get on with the next task. – Pesce 5/5, 2021 at 10:44

H

328

Iterate manually, using the condition to select a list to which each element will be appended:

good, bad = [], []
for x in mylist:
    (bad, good)[x in goodvals].append(x)

Hilton answered 27/8, 2012 at 0:51 Comment(16)

That is incredibly ingenious! It took me a while to understand what was happening though. I'd like to know if others think this can be considered readable code or not. – Scute 11/4, 2013 at 11:11

@Scute I'd probably use a dict with boolean keys instead to make it more readable. Implicit bool to int conversions are confusing. – Microdot 9/5, 2013 at 18:4

good.append(x) if x in goodvals else bad.append(x) is more readable. – Tephra 30/5, 2013 at 23:49

nice. or: results = {True:[], False:[]} and for x in mylist: results[x in goodvals].append(x). Basically this is a simple version of the partition function mentioned here by DSM. – Thereof 16/11, 2013 at 22:39

@Tephra Especially since you can make it a one-liner with the for-cycle, and if you wanted to append something more complicated than x, you can make it into one append only: for x in mylist: (good if isgood(x) else bad).append(x) – Hideout 13/2, 2014 at 13:37

one may create the (bad, good) tuple outside the loop first, to avoid creating the tuple for every iteration of the loop. Well, the impact may be trivial if the mylist is short. – Furry 12/7, 2015 at 2:42

@MLister, in that case you should probably include the attribute lookup (bad.append, good.append) – Hilton 12/7, 2015 at 7:36

I just did this and was hoping for a cleaner way, but after reading the other answers I think this probably is the most compact/ cleanest. – Polyhedron 15/1, 2016 at 0:23

A slightly shorter variation: (good if x in goodvals else bad).append(x) – Tamaru 2/10, 2017 at 10:39

What a great answer. While it's debatable whether it's good code, I think it shows true appreciation of the flexibility that Python offers (though I might argue this is borderline abuse). – Bonfire 4/10, 2018 at 13:9

This is a great answer and I'm glad I read it, but I'm also glad it's not the accepted answer. – Shaneka 5/6, 2019 at 11:8

While this might be cute, I wouldn't ever want to see something like this in production code. Nor in non-production code for that matter. It being the most upvoted answer to this question is a bit concerning. – Spicy 14/2, 2020 at 10:16

Oneliner version (be kind with your futur you, please do not use it) : e,o = map(list, (filter(None, _) for _ in zip(*((i,None) if i % 2 == 0 else (None,i) for i in range(10))))) – Swanskin 20/1, 2022 at 11:23

I wouldn't allow this in my codebase. – Alpestrine 24/2, 2023 at 17:43

OMG, What the... code! – Cavitation 14/6, 2023 at 9:19

Low quality answer because nothing is explained. Beginners can not learn from it. Please improve your answer. – Nitro 16/4 at 7:46

C

154

good = [x for x in mylist if x in goodvals]
bad  = [x for x in mylist if x not in goodvals]