Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

JSDOM in nodeJS: How do I get back the manipulated html?

Tags:

I am trying to manipulate a remote HTML and return it manipulated. I decided to use JSDOM but cannot figure out how to get the manipulated HTML back. Any ideas?

  jsdom.env({         url: "http://www.cnn.com",         scripts: ["http://code.jquery.com/jquery.js"],         done: function (err, window) {             var $ = window.$;             console.log("HN Links");             var src = $(".ghciTopStoryImage1 img").attr('src','http://lorempixel.com/396/220/');             var headline = $(".blkbigheader span").html('header');             var description = $(".blkbigheader").parent().find("p a:eq(0)").html('text');             // not working....             content =$(window.document).html();          }     }); 
like image 314
Guy Avatar asked Aug 05 '15 09:08

Guy


2 Answers

From the jsdom readme:

var JSDOM = require("jsdom").JSDOM;  var jsdom = new JSDOM("<!DOCTYPE html>hello");  jsdom.serialize() === "<!DOCTYPE html><html><head></head><body>hello</body></html>"; doc.documentElement.outerHTML === "<html><head></head><body>hello</body></html>"; 

Adapting your above example would then just be content = jsdom.serialize().

like image 104
Domenic Avatar answered Oct 04 '22 00:10

Domenic


sometimes jQuery isn't the answer

content = window.document.documentElement.outerHTML; 
like image 28
Jaromanda X Avatar answered Oct 04 '22 00:10

Jaromanda X