I am using <code>"@nesk/puphpeteer": "^2.0.0"</code> Link to Github-Repo and want get the text and the <code>href</code>-attribute from a link. I tried the following: <pre class="prettyprint"><code><?php require_once '../vendor/autoload.php'; use Nesk\Puphpeteer\Puppeteer; use Nesk\Rialto\Data\JsFunction; $debug = true; $puppeteer = new Puppeteer([ 'read_timeout' => 100, 'debug' => $debug, ]); $browser = $puppeteer->launch([ 'headless' => !$debug, 'ignoreHTTPSErrors' => true, ]); $page = $browser->newPage(); $page->goto('http://example.python-scraping.com/'); //get text and link $links = $page->querySelectorXPath('//*[@id="results"]/table/tbody/tr/td/div/a', JsFunction::createWithParameters(['node']) ->body('return node.textContent;')); // iterate over links and print each link and its text // get single text $singleText = $page->querySelectorXPath('//*[@id="pagination"]/a', JsFunction::createWithParameters(['node']) ->body('return node.textContent;')); $browser->close(); </code></pre> When I run the above script I get the nodes from the page, BUT I cannot access the attributes or the text? Any suggestions how to do this? I appreciate your replies!

<code>querySelectorXPath</code> return array of <code>ElementHandle</code>. one more thing <code>querySelectorXPath</code> does not support callback function. first get all node <code>ElementHandle</code> <pre class="prettyprint"><code>$links = $page->querySelectorXPath('//*[@id="results"]/table/tbody/tr/td/div/a'); </code></pre> then loop over links to access attributes or text of node <pre class="prettyprint"><code>foreach($links as $link){ // for text $text = $link->evaluate(JsFunction::createWithParameters(['node']) ->body('return node.innerText;')); // for link $link = $link->evaluate(JsFunction::createWithParameters(['node']) ->body('return node.href;')); } </code></pre>

Puphpeteer - Get text and href-attribute from link

Tags:

php

I am using "@nesk/puphpeteer": "^2.0.0" Link to Github-Repo and want get the text and the href-attribute from a link.

I tried the following:

<?php

require_once '../vendor/autoload.php';

use Nesk\Puphpeteer\Puppeteer;
use Nesk\Rialto\Data\JsFunction;

$debug = true;

$puppeteer = new Puppeteer([
    'read_timeout' => 100,
    'debug' => $debug,
]);
$browser = $puppeteer->launch([
    'headless' => !$debug,
    'ignoreHTTPSErrors' => true,
]);

$page = $browser->newPage();
$page->goto('http://example.python-scraping.com/');

//get text and link
$links = $page->querySelectorXPath('//*[@id="results"]/table/tbody/tr/td/div/a', JsFunction::createWithParameters(['node'])
    ->body('return node.textContent;'));

// iterate over links and print each link and its text

// get single text
$singleText = $page->querySelectorXPath('//*[@id="pagination"]/a', JsFunction::createWithParameters(['node'])
    ->body('return node.textContent;'));


$browser->close();

When I run the above script I get the nodes from the page, BUT I cannot access the attributes or the text?

Any suggestions how to do this?

I appreciate your replies!

927

asked Aug 09 '21 19:08

Carol.Kar

1 Answers

querySelectorXPath return array of ElementHandle. one more thing querySelectorXPath does not support callback function.

first get all node ElementHandle

$links = $page->querySelectorXPath('//*[@id="results"]/table/tbody/tr/td/div/a');

then loop over links to access attributes or text of node

foreach($links as $link){
   // for text
    $text = $link->evaluate(JsFunction::createWithParameters(['node'])
    ->body('return node.innerText;'));

  // for link
  $link = $link->evaluate(JsFunction::createWithParameters(['node'])
    ->body('return node.href;'));
}

187

answered Oct 12 '22 04:10

Zeeshan Anjum

Related questions
                            
                                How to write codeigniter like query
                            
                                Laravel 5 hasManyThrough
                            
                                Yii2 DropDownList Onchange change Autocomplete Widget "source" attribute?
                            
                                iCal format for Google Calendar / Yahoo calendar not working
                            
                                creating database from postgreSQL with symfony
                            
                                HotelBeds Php API providing me empty result
                            
                                PHP 7 - Unsupported declare 'strict_types'
                            
                                POST http://localhost:3000/ 404 (Not Found)
                            
                                Wildcard in prepared MySQLi returning bad values
                            
                                How can I bundle search terms into more efficient queries?
                            
                                Laravel Multiple Model Events
                            
                                How to access my website on GoDaddy with just the IP address of my Web Hosting account [closed]
                            
                                Pass variables from one table to another in another PHP page [duplicate]
                            
                                How can i rename laravel controller with command line interface(CLI)?
                            
                                Apache error: cannot load mod_access_compat.so
                            
                                What is `HtmlString` used for in Laravel?
                            
                                How to read and echo file size of uploaded file being written at server in real time without blocking at both server and client?
                            
                                laravel validate with user function
                            
                                DOCKERFILE: Running multiple CMD. (Starting NGINX and PHP) [duplicate]
                            
                                Reformat number inside array of string PHP

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With