This question was asked once already, but the API changed I guess and the answers are no valid anymore.
URL url = new URL("http://www.example.com");
StringWebResponse response = new StringWebResponse("<html><head><title>Test</title></head><body></body></html>", url);
HtmlPage page = HTMLParser.parseHtml(response, new TopLevelWindow("top", new WebClient()));
System.out.println(page.getTitleText());
Can't be done because TopLevelWindow is protected and stuff like extending/implementing the window because of that is ridiculous :)
Anybody has an idea how to do that ? It seems to me weird that it can't be done easily.
This code works in GroovyConsole
@Grapes(
@Grab(group='net.sourceforge.htmlunit', module='htmlunit', version='2.8')
)
import com.gargoylesoftware.htmlunit.*
import com.gargoylesoftware.htmlunit.html.*
URL url = new URL("http://www.example.com");
StringWebResponse response = new StringWebResponse("<html><head><title>Test</title></head><body></body></html>", url);
WebClient client = new WebClient()
HtmlPage page = HTMLParser.parseHtml(response, client.getCurrentWindow());
System.out.println(page.getTitleText());
Using HTMLUnit 2.40, Grooveek's code won't compile, you get "Cannot make a static reference to the non-static method parseHtml(WebResponse, WebWindow) from the type HTMLParser". But there is now a class HtmlUnitNekoHtmlParser implementing the HTMLParser interface, so the following code works:
StringWebResponse response = new StringWebResponse(
"<html><head><title>Test</title></head><body></body></html>",
new URL("http://www.example.com"));
HtmlPage page = new HtmlUnitNekoHtmlParser().parseHtml(
response, new WebClient().getCurrentWindow());
There is some sample code in the FAQ https://htmlunit.sourceforge.io/faq.html#HowToParseHtmlString
e.g.
final String htmlCode = "<html>"
+ " <head>"
+ " <title>Title</title>"
+ " </head>"
+ " <body>"
+ " content..."
+ " </body>"
+ "</html> ";
try (WebClient webClient = new WebClient(browserVersion)) {
final HtmlPage page = webClient.loadHtmlCodeIntoCurrentWindow(htmlCode);
// work with the html page
}
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With