How can I capture all network requests and full response data when loading a page in Chrome?

2 Answers

You can enable a request interception with page.setRequestInterception() for each request, and then, inside page.on('request'), you can use the request-promise-native module to act as a middle man to gather the response data before continuing the request with request.continue() in Puppeteer.

Here's a full working example:

'use strict';  const puppeteer = require('puppeteer'); const request_client = require('request-promise-native');  (async () => {   const browser = await puppeteer.launch();   const page = await browser.newPage();   const result = [];    await page.setRequestInterception(true);    page.on('request', request => {     request_client({       uri: request.url(),       resolveWithFullResponse: true,     }).then(response => {       const request_url = request.url();       const request_headers = request.headers();       const request_post_data = request.postData();       const response_headers = response.headers;       const response_size = response_headers['content-length'];       const response_body = response.body;        result.push({         request_url,         request_headers,         request_post_data,         response_headers,         response_size,         response_body,       });        console.log(result);       request.continue();     }).catch(error => {       console.error(error);       request.abort();     });   });    await page.goto('https://example.com/', {     waitUntil: 'networkidle0',   });    await browser.close(); })();

answered Nov 07 '22 21:11

Grant Miller

Puppeteer-only solution

This can be done with puppeteer alone. The problem you are describing that the response.buffer is cleared on navigation, can be circumvented by processing each request one after another.

How it works

The code below uses page.setRequestInterception to intercept all requests. If there is currently a request being processed/being waited for, new requests are put into a queue. Then, response.buffer() can be used without the problem that other requests might asynchronously wipe the buffer as there are no parallel requests. As soon as the currently processed request/response is handled, the next request will be processed.

Code

const puppeteer = require('puppeteer');  (async () => {     const browser = await puppeteer.launch();     const [page] = await browser.pages();      const results = []; // collects all results      let paused = false;     let pausedRequests = [];      const nextRequest = () => { // continue the next request or "unpause"         if (pausedRequests.length === 0) {             paused = false;         } else {             // continue first request in "queue"             (pausedRequests.shift())(); // calls the request.continue function         }     };      await page.setRequestInterception(true);     page.on('request', request => {         if (paused) {             pausedRequests.push(() => request.continue());         } else {             paused = true; // pause, as we are processing a request now             request.continue();         }     });      page.on('requestfinished', async (request) => {         const response = await request.response();          const responseHeaders = response.headers();         let responseBody;         if (request.redirectChain().length === 0) {             // body can only be access for non-redirect responses             responseBody = await response.buffer();         }          const information = {             url: request.url(),             requestHeaders: request.headers(),             requestPostData: request.postData(),             responseHeaders: responseHeaders,             responseSize: responseHeaders['content-length'],             responseBody,         };         results.push(information);          nextRequest(); // continue with next request     });     page.on('requestfailed', (request) => {         // handle failed request         nextRequest();     });      await page.goto('...', { waitUntil: 'networkidle0' });     console.log(results);      await browser.close(); })();

answered Nov 07 '22 22:11

Thomas Dondorf

Related questions
                            
                                AngularJS constants
                            
                                Cordova - refuse to execute inline event handler because it violates the following content Security policy
                            
                                WebApi 2 POST with single string parameter not working
                            
                                Meaning of a double star (**) in a file path
                            
                                Prevent default behavior in text input while pressing arrow up
                            
                                Is the HTML <base> tag also honored by scripting and CSS?
                            
                                Can setTimeout ever return 0 as the id?
                            
                                How to detect when an iframe has already been loaded
                            
                                deep merge objects with AngularJS
                            
                                How to change css of tag from an outside link
                            
                                Best way to create new React component using create-react-app
                            
                                jQuery ajax() vs get()/post()
                            
                                How do I get basic auth working in angularjs?
                            
                                Accessibility: what can aria-haspopup be used for?
                            
                                How to get the filename from the Javascript FileReader?
                            
                                Synchronous XMLHttpRequest on the main thread is deprecated
                            
                                Inject constant to other modules config using Angular JS
                            
                                What's the purpose of if (typeof window !== 'undefined')
                            
                                Console is throwing Unterminated JSX contents error [closed]
                            
                                Preloading Google Fonts

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I capture all network requests and full response data when loading a page in Chrome?

Tags:

javascript

google-chrome

puppeteer

Matt Zeunert

People also ask