I have not found any documentation nor tutorial for that. Does anything like that exist? <hr> <pre class="prettyprint"><code>doc.xpath('//table/tbody[@id="threadbits_forum_251"]/tr') </code></pre> The code above will get me any <code>table</code>, anywhere, that has a <code>tbody</code> child with the attribute <code>id</code> equal to "threadbits_forum_251". But why does it start with double <code>//</code>? Why there is <code>/tr</code> at the end? See "Ruby Nokogiri Parsing HTML table II" for more details. <hr> Can anybody tell me how to extract <code>href</code>, <code>id</code>, <code>alt</code>, <code>src</code>, etc., using Nokogiri? <pre class="prettyprint"><code>td[3]/div[1]/a/text()' <--- extracts text </code></pre> How can I extract other things?

Seems you need to read a XPath Tutorial Your <code>//table/tbody[@id="threadbits_forum_251"]/tr</code> expression means: <ul> <li> <code>//</code> - Anywhere in your XML document</li> <li> <code>table/tbody</code> - take a table element with a tbody child</li> <li> <code>[@id="threadbits_forum_251"]</code> - where id attribute are equals to "threadbits_forum_251"</li> <li> <code>tr</code> - and take its <code>tr</code> elements</li> </ul> So, basically, you need to know: <ul> <li>attributes begins with <code>@</code> </li> <li>conditions go inside <code>[]</code> brackets</li> </ul> If I correcly understood that API, you can go with <code>doc.xpath("td[3]/div[1]/a")["href"]</code>, or <code>td[3]/div[1]/a/@href</code> if there is just one <code><a></code> element.

How do I use XPath in Nokogiri?

Tags:

ruby

xpath

nokogiri

I have not found any documentation nor tutorial for that. Does anything like that exist?

Click to copy

doc.xpath('//table/tbody[@id="threadbits_forum_251"]/tr')

The code above will get me any table, anywhere, that has a tbody child with the attribute id equal to "threadbits_forum_251". But why does it start with double //? Why there is /tr at the end? See "Ruby Nokogiri Parsing HTML table II" for more details.

Can anybody tell me how to extract href, id, alt, src, etc., using Nokogiri?

Click to copy

td[3]/div[1]/a/text()' <--- extracts text

How can I extract other things?

840

asked Jan 17 '10 11:01

Radek

Video Answer

2 Answers

Seems you need to read a XPath Tutorial

Your //table/tbody[@id="threadbits_forum_251"]/tr expression means:

// - Anywhere in your XML document
table/tbody - take a table element with a tbody child
[@id="threadbits_forum_251"] - where id attribute are equals to "threadbits_forum_251"
tr - and take its tr elements

So, basically, you need to know:

attributes begins with @
conditions go inside [] brackets

If I correcly understood that API, you can go with doc.xpath("td[3]/div[1]/a")["href"], or td[3]/div[1]/a/@href if there is just one <a> element.

167

answered Oct 12 '22 09:10

Rubens Farias

Your XPath is correct and you seem to have answered your own question's first part (almost):

Click to copy

doc.xpath('//table/tbody[@id="threadbits_forum_251"]/tr')

"the code above will get me any ~~table~~ table's tr, anywhere, that has a tbody child with the attribute id equal to threadbits_forum_251"

// means the following element can appear anywhere in the document.

/tr at the end means, get the tr node of the matching element.

You dont need to extract each attribute one by one. Just get the entire node containing all four attributes in Nokogiri, and get the attributes using:

Click to copy

theNode['href'] theNode['src']

Where theNode is your Nokogiri Node object.

Edit:

Sorry I haven't used these libraries, but I think the XPath evaluation and parsing is being done by Mechanize. So here's how you would get the entire element and its attributes in one go.

Click to copy

doc.xpath("td[3]/div[1]/a").each do |anchor|     puts anchor['href']     puts anchor['src']     ... end

answered Oct 12 '22 09:10

Anurag

Related questions
                            
                                In Ruby on Rails, are '#encoding: utf-8' and 'config.encoding = "utf-8"' different?
                            
                                `binding.pry` for javascript console?
                            
                                How to convert 1 to "first", 2 to "second", and so on, in Ruby?
                            
                                How do I find where a constant is defined in Ruby?
                            
                                What is a worker in ruby/rails?
                            
                                POST json to rails server
                            
                                Why are there frozen constants everywhere?
                            
                                rails console - display active record results in a table
                            
                                Two controllers for one shared view in Ruby on Rails
                            
                                Run a bundler-deployed Ruby app outside of its own directory?
                            
                                Why there is no double-render when using before_action?
                            
                                How to raise an ActiveRecord::Rollback exception and return a value together?
                            
                                Rails Fixtures not loading with rspec
                            
                                How to alias a class method in rails model?
                            
                                Constructor overriding
                            
                                How can I call an older version of a gem from the commandline?
                            
                                Is it possible to compile Ruby to byte code as with Python?
                            
                                Ruby on Rails - How to print log messages in color
                            
                                Rspec run all tests except a specific folder
                            
                                What a safe and easy way to delete a dir in Ruby?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I use XPath in Nokogiri?

Tags:

ruby

xpath

nokogiri

Radek

People also ask

Video Answer

2 Answers

Rubens Farias

Anurag

Recent Activity

Donate For Us