How to get all sub-elements of an element tree with Python ElementTree?

M

5

42

I want to find a way to get all the sub-elements of an element tree like the way ElementTree.getchildren() does, since getchildren() is deprecated since Python version 2.7.
I don't want to use it anymore, though I can still use it currently.

Mcmillen answered 2/5, 2012 at 6:43 Comment(0)

M

29

All sub-elements (descendants) of elem:

all_descendants = list(elem.iter())

A more complete example:

>>> import xml.etree.ElementTree as ET
>>> a = ET.Element('a')
>>> b = ET.SubElement(a, 'b')
>>> c = ET.SubElement(a, 'c')
>>> d = ET.SubElement(a, 'd')
>>> e = ET.SubElement(b, 'e')
>>> f = ET.SubElement(d, 'f')
>>> g = ET.SubElement(d, 'g')
>>> [elem.tag for elem in a.iter()]
['a', 'b', 'e', 'c', 'd', 'f', 'g']

To exclude the root itself:

>>> [elem.tag for elem in a.iter() if elem is not a]
['b', 'e', 'c', 'd', 'f', 'g']

Meteoric answered 2/5, 2012 at 6:56 Comment(5)

Sorry Eli, but maybe I didn't make myself understood, I just want to get all the sub-elements, not also the root. i.e. the root is unwanted here. but I think your method also contains the root object, right? – Mcmillen 2/5, 2012 at 7:4

But what if there are more than one sturct with tag 'a' nested in element 'a' and I want to get all sub-elements of all 'a' structs? – Mcmillen 2/5, 2012 at 7:17

The element objects are iterable also without using the iter(). The element behaves also like a list; so, you can also index the subelements. – Ancier 2/5, 2012 at 7:20

@pepr: yes, but that only gives you the element's immediate children, not all descendants – Meteoric 2/5, 2012 at 7:23

@Eli Bendersky: I see. But the getchildren() also returns only the immediate children. The old equivalent of the new list(elem.iter()) is the list(elem.getiterator()). It depends what Steven really wants. – Ancier 2/5, 2012 at 11:24

F

19

in the pydoc it is mentioned to use list() method over the node to get child elements.
list(elem)

Ferdelance answered 22/8, 2018 at 8:47 Comment(1)

You can also do things like for child in elem and last_child = elem[-1]. – Samsun 1/7, 2023 at 22:43

A

8

If you want to get all elements 'a', you can use:

a_lst = list(elem.iter('a'))

If the elem is also 'a', it will be included.

Ancier answered 2/5, 2012 at 11:31 Comment(0)

A

3

None of the existing answers will find all children. This solution uses BeautifulSoup instead of ETree, but will find all children, instead of just top-level:

from bs4 import BeautifulSoup    

with open(filename) as f:
    soup = BeautifulSoup(f, 'xml')

results = soup.find_all('element_name')

Armorial answered 2/3, 2017 at 10:13 Comment(0)

C

3

Maybe this does not correspond to OP actual question but in a greater sense I would suggest that if someone want to get all elements named with a certain name e.g. 'object' can use (an alternative approach to @Turtles Are Cute which to me at least seems more natural):

objs = tree.findall('object')

which also returns a list.

Crate answered 24/8, 2018 at 9:27 Comment(0)

Recommended topics

Hot tags