I am trying to manipulate a remote HTML and return it manipulated. I decided to use JSDOM but cannot figure out how to get the manipulated HTML back. Any ideas?
jsdom.env({ url: "http://www.cnn.com", scripts: ["http://code.jquery.com/jquery.js"], done: function (err, window) { var $ = window.$; console.log("HN Links"); var src = $(".ghciTopStoryImage1 img").attr('src','http://lorempixel.com/396/220/'); var headline = $(".blkbigheader span").html('header'); var description = $(".blkbigheader").parent().find("p a:eq(0)").html('text'); // not working.... content =$(window.document).html(); } });
From the jsdom readme:
var JSDOM = require("jsdom").JSDOM; var jsdom = new JSDOM("<!DOCTYPE html>hello"); jsdom.serialize() === "<!DOCTYPE html><html><head></head><body>hello</body></html>"; doc.documentElement.outerHTML === "<html><head></head><body>hello</body></html>";
Adapting your above example would then just be content = jsdom.serialize()
.
sometimes jQuery isn't the answer
content = window.document.documentElement.outerHTML;
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With