<p>I'm reading Scrapy/XPath tutorials but this does not seem trivial and I can't find an example that would explain it.</p> <p>Given a markup like this how would you select the <code><span></code> element?</p> <p></p> <div class="snippet" data-lang="js" data-hide="false"> <div class="snippet-code"> <pre class="prettyprint snippet-code-html lang-html prettyprint-override"><code><div id=”...”> <div> <div> <div> <div> <div> <div> <div> <span></code></pre> </div> </div> <p>If we generalize the problem it would be:</p> <ul> <li>skip n divs in the div with id="..."</li> <li>skip m divs in the div</li> <li>...</li> <li>select the span element in the div</li> </ul>

<p>Assuming indentation denotes containment in your example, the following XPath will select the <code>span</code> element for you:</p> <pre class="prettyprint"><code>//div[@id='...']/div[3]/div[2]/div/div/span </code></pre> <p>Of course, if there are no other <code>span</code> elements beneath the id'ed <code>div</code>, you could jump right to it:</p> <pre class="prettyprint"><code>//div[@id='...']//span </code></pre> <p>Or if there are no other <code>span</code> elements in the entire document:</p> <pre class="prettyprint"><code>//span </code></pre>

Select deeply nested element

Tags:

html

xml

xpath

scrapy

I'm reading Scrapy/XPath tutorials but this does not seem trivial and I can't find an example that would explain it.

Given a markup like this how would you select the <span> element?

<div id=”...”>
	<div>
	<div>
	<div>
		<div>
		<div>
			<div>
				<div>
					<span>

If we generalize the problem it would be:

skip n divs in the div with id="..."
skip m divs in the div
...
select the span element in the div

662

asked Aug 04 '15 02:08

grigy

1 Answers

Assuming indentation denotes containment in your example, the following XPath will select the span element for you:

//div[@id='...']/div[3]/div[2]/div/div/span

Of course, if there are no other span elements beneath the id'ed div, you could jump right to it:

//div[@id='...']//span

Or if there are no other span elements in the entire document:

//span

117

answered Sep 16 '22 14:09

kjhughes

Related questions
                            
                                Controlling audio speed of a mp3 file
                            
                                Remove top of page space from Bootstrap
                            
                                Truncate string in Rails: "..." showing up on strings at length
                            
                                Full-screen Canvas is low res [duplicate]
                            
                                Place icon inside submit button using Bootstrap 3
                            
                                CSS Transparent buttons with borders
                            
                                Serving a static HTML page containing an image using Node JS / Express
                            
                                JQuery "Chosen" dropdown cut when appearing
                            
                                SVG shape transparency with solid color as a background
                            
                                Put HTML head in another file
                            
                                How can I pass the current element to a Javascript function in a Knockout.js binding?
                            
                                How to put span in Jade?
                            
                                Do scrapers need to be written for every site they target?
                            
                                How to force position absolute with 100% width to fit into parent div with padding?
                            
                                Why doesn't translateX work as expected for fixed elements on IE9, IE10, and IE11?
                            
                                change focus to the next input text with angularjs
                            
                                How to insert a HTML character into this Html.ActionLink?
                            
                                How to have tabs of matching height with bootstrap?
                            
                                Number only input box with range restriction
                            
                                Change color of the text in contenteditable div

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With