lxml - McMap

4

Solved

How to tell lxml.etree.tostring(element) not to write namespaces in python?

I have a huge xml file (1 Gig). I want to move some of the elements (entrys) to another file with the same header and specifications. Let's say the original file contains this entry with tag <t...

python namespaces lxml tostring elementtree

Bogey asked 9/8, 2011 at 23:20

2

Parse SGML with Open Arbitrary Tags in Python 3

I am trying to parse a file such as: http://www.sec.gov/Archives/edgar/data/1409896/000118143112051484/0001181431-12-051484.hdr.sgml I am using Python 3 and have been unable to find a solution wit...

python xml python-3.x lxml sgml

Norenenorfleet asked 20/9, 2012 at 2:39

2

Solved

Should I use .text or .content when parsing a Requests response?

I occasionally use res.content or res.text to parse a response from Requests. In the use cases I have had, it didn't seem to matter which option I used. What is the main difference in parsing HTML...

python python-requests lxml

Adigranth asked 20/10, 2016 at 19:52

2

Solved

How do I match contents of an element in XPath (lxml)?

I want to parse HTML with lxml using XPath expressions. My problem is matching for the contents of a tag: For example given the <a href="http://something">Example</a> element...

python xpath lxml predicate

Stutsman asked 14/4, 2010 at 13:35

5

get errors when import lxml.etree to python

i have installed an lxml on my mac, when i type in python like this localhost:lxml-3.0.1 apple$ python Python 2.7.3 (v2.7.3:70274d53c1dd, Apr 9 2012, 20:52:43) [GCC 4.2.1 (Apple Inc. build 5666)...

python python-2.7 lxml

Collegiate asked 13/11, 2012 at 5:28

2

Solved

How to get an XPath from selenium webelement or from lxml?

I am using selenium and I need to find the XPaths of some selenium web elements. For example: import selenium.webdriver driver = selenium.webdriver.Firefox() element = driver.find_element_by_xpa...

python selenium xpath html-parsing lxml

Brillatsavarin asked 25/6, 2014 at 14:43

6

Failing to install lxml using pip

I am attempting to use pip to install lxml. I have Windows 11 and Python version python-3.10.2-amd64. I am using Visual Studio Code (VSC) as well. I realized I needed lxml from this error message i...

python lxml python-wheel

Ascribe asked 17/2, 2022 at 4:57

2

Solved

difference between lxml and html5lib in the context of beautifulsoup

Is there a difference between the capabiities of lxml and html5lib parsers in the context of beautifulsoup? I am trying to learn to use BS4 and using the following code construct -- ret = requests...

python beautifulsoup lxml html5lib

Saraband asked 3/9, 2013 at 0:44

22

bs4.FeatureNotFound: Couldn't find a tree builder with the features you requested: lxml. Do you need to install a parser library?

... soup = BeautifulSoup(html, "lxml") File "/Library/Python/2.7/site-packages/bs4/__init__.py", line 152, in __init__ % ",".join(features)) bs4.FeatureNotFound: Could...

python python-2.7 beautifulsoup lxml

Sutra asked 25/6, 2014 at 0:12

25

Solved

Cannot install Lxml on Mac OS X 10.9

I want to install Lxml so I can then install Scrapy. When I updated my Mac today it wouldn't let me reinstall lxml, I get the following error: In file included from src/lxml/lxml.etree.c:314: /priv...

python xcode macos scrapy lxml

Smelt asked 23/10, 2013 at 17:7

11

Solved

How to Pretty Print HTML to a file, with indentation

I am using lxml.html to generate some HTML. I want to pretty print (with indentation) my final result into an html file. How do I do that? This is what I have tried and got till now import lxml.htm...

python html lxml pretty-print

Swane asked 27/5, 2011 at 9:9

5

Solved

Python3, lxml and "Symbol not found: _lzma_auto_decoder" on Mac OS X 10.9

I have installed python 3 using homebrew and afterwards installed pip3 and lxml. The following line from lxml import entree leads to the following error: $ python3 Python 3.3.5 (v3....

python macos python-3.x lxml homebrew

Detector asked 10/5, 2014 at 20:56

6

ImportError: cannot import name 'etree' on Python 3.6

I am getting error while running "from lxml import tree" on python3.6 >>> import lxml >>> from lxml import etree Traceback (most recent call last): File "<stdin>", line 1, ...

python-3.x ubuntu-14.04 lxml xml.etree

Inseparable asked 21/11, 2018 at 6:50

12

Solved

How to install lxml on Ubuntu

I'm having difficulty installing lxml with easy_install on Ubuntu 11. When I type $ easy_install lxml I get: Searching for lxml Reading http://pypi.python.org/simple/lxml/ Reading http://codespea...

python ubuntu lxml libxml2 easy-install

Eldwun asked 28/6, 2011 at 10:8

6

Solved

Problems installing lxml on M1 mac

So, I'm having the classic trouble install lxml. Initially I was just pip installing, but when I tried to free up memory using Element.clear() I was getting the following error: Python(58695,0x1001...

python pip lxml apple-m1 libxml2

Lydia asked 4/4, 2022 at 12:32

3

Solved

builtins.TypeError: must be str, not bytes

I've converted my scripts from Python 2.7 to 3.2, and I have a bug. # -*- coding: utf-8 -*- import time from datetime import date from lxml import etree from collections import OrderedDict # Creat...

python python-3.x xml lxml

Allix asked 1/4, 2011 at 11:37

4

Solved

Changing the default indentation of etree.tostring in lxml

I have an XML document which I'm pretty-printing using lxml.etree.tostring print etree.tostring(doc, pretty_print=True) The default level of indentation is 2 spaces, and I'd like to change this ...

python lxml pretty-print

Arcuation asked 6/8, 2009 at 13:47

2

Solved

Python lxml - get index of tag's text

I have an xml-file with a format similar to docx, i.e.: <w:r> <w:rPr> <w:sz w:val="36"/> <w:szCs w:val="36"/> </w:rPr> <w:t>BIG_TEXT<...

python xml lxml

Valdavaldas asked 10/9, 2012 at 14:40

2

Solved

Parse HTML/XML and find locations of elements in original document

Is there a way to get the original location of an element in a document, ie. the start and end character index, when parsing html/xml in Python? I've looked through the lxml documentation and coul...

python xml-parsing html-parsing lxml

Entwistle asked 24/11, 2011 at 14:21

6

How to obtain all the links in a domain using Python?

I want to use Python to obtain all the links in a domain given the 'root' URL (in a list). Suppose given a URL http://www.example.com this should return all the links on this page of the same domai...

python web-scraping beautifulsoup lxml

Honeyman asked 6/1, 2016 at 10:59

2

Scrapy startproject command failed on import etree, Mac M1. The error message is: "symbol not found in flat namespace"

I am trying to start a project using the Scrapy library, for a small webscraping project, but it fails on the import etree module. The exact error on the traceback is: from .. import etree ImportEr...

python scrapy lxml apple-m1 xml.etree

Melee asked 26/1, 2022 at 11:17

5

Installing LXML, facing a "legacy-install-failure" error

Trying to install lxml on Python 311. Faced with this error. PS C:\Users\chharlie\Desktop\code> pip install lxml Collecting lxml Using cached lxml-4.9.1.tar.gz (3.4 MB) Preparing metadata (set...

python lxml python-wheel yfinance pandas-datareader

Reld asked 6/11, 2022 at 1:38

8

Solved

Error while installing lxml through pip: Microsoft Visual C++ 14.0 is required

I am on a windows 10 machine and recently moved from python 2.7 to 3.5. When trying to install lxml through pip, it stops and throws this error message- building 'lxml.etree' extension error: ...

python visual-c++pip lxml

Syrinx asked 15/8, 2016 at 4:59

2

Solved

Extract Coordinates from KML BatchGeo File with Python

I've uploaded some addresses to BatchGeo and downloaded the resulting KML file from which I want to extract the coordinates. I managed to prettify the jumbled text file online here, but I don't kno...

python lxml geocoding kml pykml

Tomahawk asked 4/12, 2012 at 21:19

1

lxml.etree to iterparse requests response (with stream=True)

I have a SOAP client in Python receiving a response, which, in one element of the SOAP envelope's body receives a large stream of data (gzipped file, several GBs, machine's main memory not necessar...

python python-requests lxml

Besant asked 22/7, 2016 at 13:46

lxml Questions

Recommended topics

Hot tags