Can Html Agility Pack be used to parse an html string fragment? Such As: <pre class="prettyprint"><code>var fragment = "Some code "; </code></pre> Then extract all <code></code> tags? All the examples I seen so far have been loading like html documents.

If it's html then yes. <pre class="prettyprint"><code>string str = "Some code"; // not sure if needed string html = string.Format("<html><head></head><body>{0}</body></html>", str); HtmlDocument doc = new HtmlDocument(); doc.LoadHtml(html); // look xpath tutorials for how to select elements // select 1st element HtmlNode bNode = doc.DocumentNode.SelectSingleNode("b[1]"); string boldText = bNode.InnerText; </code></pre>

Can I use Html Agility Pack To Parse HTML Fragment?

Tags:

c#

.net

html-agility-pack

Can Html Agility Pack be used to parse an html string fragment?

Such As:

var fragment = "<b>Some code </b>";

Then extract all  tags? All the examples I seen so far have been loading like html documents.

660

asked Mar 29 '10 05:03

chobo2

2 Answers

If it's html then yes.

string str = "<b>Some code</b>";
// not sure if needed
string html = string.Format("<html><head></head><body>{0}</body></html>", str);
HtmlDocument doc = new HtmlDocument();
doc.LoadHtml(html);

// look xpath tutorials for how to select elements
// select 1st <b> element
HtmlNode bNode = doc.DocumentNode.SelectSingleNode("b[1]");
string boldText = bNode.InnerText;

160

answered Oct 20 '22 08:10

Mike Koder

I dont think this is really the best use of HtmlAgilityPack.

Normally I see people trying to parse large amounts of html using regular expressions and I point them towards HtmlAgilityPack but in this case I think it would be better to use a regex.

Roy Osherove has a blog post describing how you can strip out all the html from a snippet:

http://weblogs.asp.net/rosherove/archive/2003/05/13/6963.aspx

Even if you did get the correct xpath with Mika Kolari's sample this would only work for a snippet with a tag in it and would break if the code changed.

answered Oct 20 '22 08:10

rtpHarry

Related questions
                            
                                How can I add programmability to my application
                            
                                What does the ? mean after a type? [duplicate]
                            
                                Searching Hierarchical List
                            
                                Multiple ways to define C# Enums with [Flags] attribute?
                            
                                Prevent other developers using base methods within a class
                            
                                WPF equivalent to Silverlight "RootVisual"
                            
                                Why does Interlocked.CompareExchange<T> only support reference types?
                            
                                JSON.NET to C# objects
                            
                                List all topics from a CHM file
                            
                                What's the best way to convert non-generic collection to a generic collection?
                            
                                C# N way merge for external sort
                            
                                Are there any open source C#-based non-blocking, event-based web server like Tornado? [closed]
                            
                                C#: How to prevent two instances of an application from doing the same thing at the same time?
                            
                                In a C# solution, Where do you declare solution-scope enums?
                            
                                How should comments for interface and class methods be different
                            
                                How to encode a DateTime in a QueryString and read it in the asp:QueryStringParameter
                            
                                How to get the list of forms in VS2008 C# project?
                            
                                How can I create my own form designer?
                            
                                Exception handling for events
                            
                                How to diagnose cause, fix, or work around Adobe ActiveX / COM related error 0x80004005 progmatically?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With