LINQ: RemoveAll and get elements removed

F

4

13

Which is the easiest way to remove items that match some condition from a list and then, get those items.

I can think in a few ways, I don't know which is the best one:

var subList = list.Where(x => x.Condition);
list.RemoveAll(x => x.Condition);

or

var subList = list.Where(x => x.Condition);
list.RemoveAll(x => subList.Contains(x));

Is any of this one of the best ways? If it is, which one? If it's not, how should I do it?

Forsythia answered 16/4, 2012 at 17:5 Comment(0)

S

6

I would go with the first option for readability purposes, with the note that you should materialize the list first, or you'll lose the very items you're trying to select on the next line:

var sublist = list.Where(x => x.Condition).ToArray();
list.RemoveAll(x => x.Condition);

The second example is O(n^2) for no reason and the last is perfectly fine, but less readable.

Edit: now that I reread your last example, note that as it's written right now will take out every other item. You're missing the condition check and the remove line should actually be list.RemoveAt(i--); because the i+1th element becomes the ith element after the removal, and when you increment i you're skipping over it.

Sevier answered 16/4, 2012 at 17:9 Comment(9)

It's actually O(n^3), but I'm assuming the lack of materialization just slipped your mind ;) – Sevier 16/4, 2012 at 17:10

Would the items (as I wrote it) be removed from subList with the second instruction? :O – Forsythia 16/4, 2012 at 17:11

Er you never remove from sublist, nor do you intend to if I read it correctly. – Sevier 16/4, 2012 at 17:13

Your edit is right, I'll remove the third options because its really wrong and make it right would be very unreadable – Forsythia 16/4, 2012 at 17:14

No, no.. its not the idea to remove them from subList. I think I misunderstood the materialization. Whats the difference? – Forsythia 16/4, 2012 at 17:15

When you run a linq query over a collection, you don't get back an array, you get back an object that when you iterate over it you run your actual selection. So take your first example. subList will be just an object, you remove the items from the main array, and then when you do foreach(var item in subList) you get back nothing because the condition will always return false. – Sevier 16/4, 2012 at 17:17

So what you do instead is read the items out of your linq query and put them in an array immediately. You can do it manually or using .ToArray() or .ToList() (or a few others depending on how you want your data to look). – Sevier 16/4, 2012 at 17:18

"you should materialize the list first, or you'll lose the very items you're trying to select" Maybe true if you're using, say, a lazy-loading recordset from an ORM, but in the general case, I don't think this is correct. Check this code. iterate true or false, you'll get 2,2.1 every time. No ToArray() needed. – Anticipation 21/1, 2017 at 18:51

I'll be honest with you, this is a few good years old and looking at it I don't understand the need for the first line at all. sublist isn't used by RemoveAll at all. – Sevier 23/1, 2017 at 15:44

M

8

I like to use a functional programming approach (only make new things, don't modify existing things). One advantage of ToLookup is that you can handle more than a two-way split of the items.

ILookup<bool, Customer> lookup = list.ToLookup(x => x.Condition);
List<Customer> sublist = lookup[true].ToList();
list = lookup[false].ToList();

Or if you need to modify the original instance...

list.Clear();
list.AddRange(lookup[false]);

Microgram answered 16/4, 2012 at 17:54 Comment(3)

I think it is much complex (and almost without knowledge I think its not really performance). Does this have any advantage? – Forsythia 16/4, 2012 at 18:36

Condition is evaluated exactly once per item. List instance is not modified, which can be a big advantage if that list instance is shared among threads. – Microgram 16/4, 2012 at 18:47

This is quite a genius idea. – Overcrop 24/4, 2018 at 15:48

S

6

I would go with the first option for readability purposes, with the note that you should materialize the list first, or you'll lose the very items you're trying to select on the next line:

var sublist = list.Where(x => x.Condition).ToArray();
list.RemoveAll(x => x.Condition);