Search for text nodes in Nokogiri

About

Asked 10/9, 2014 at 7:2 Answered 10/9, 2014 at 9:37

I searched for text nodes on a document fragment. This works, as the following snippet shows:

doc = Nokogiri::HTML::DocumentFragment.parse("<p>foo</p>") 
doc.xpath('.//text()')
=> [#<Nokogiri::XML::Text:0x3fe56c8c02a8 "foo">]

However, my root node may not have an tag like <p>, it could be a simple string like "foo". Then this query fails.

doc = Nokogiri::HTML::DocumentFragment.parse("foo") 
doc.xpath('.//text()')
=> []

Changing the query to doc.xpath('text()') solves the problem.

Is there a query to combine both behaviour?

Enhanced answered 10/9, 2014 at 7:2 Comment(6)

maybe "//p[text()='foo']" ? – Squeegee 10/9, 2014 at 7:3

foo is not a tag based on your example. – Conation 10/9, 2014 at 7:4

One remark: The string is not fixed, and the outer tag also not. I just want to find all text nodes in all children. – Enhanced 10/9, 2014 at 7:5

you want to find text nodes except foo? – Conation 10/9, 2014 at 7:6

I want to find all text nodes, regardless of its name. – Enhanced 10/9, 2014 at 7:8

A workaround is to wrap it in an additional element parse("<root>#{original_xml}</root>"). The not working example (parse("foo")) is not valid xml afaik (DocumentFragment). – Palgrave 10/9, 2014 at 7:13

I have not used Nokogiri, but in standard XPath, you should be able to just use the union operator:

doc.xpath('.//text() | text()')

Servility answered 10/9, 2014 at 9:37 Comment(0)

Hot tags

Godot Unity Godot Help Programming Godot 4.X GUI GDScript 3D 2D Physics CSharp Godot 3.X VR XR Projects C++

Recommended topics

Hot tags