Here's the project official "Documentation": https://bitbucket.org/rflechner/scrapysharp/wiki/Home <hr> No matter what I try, I can't find the <code>CssSelect()</code> method that the library is supposed to add to make querying things easier. Here's what I've tried: <pre class="prettyprint"><code>using ScrapySharp.Core; using ScrapySharp.Html.Parsing; using HtmlAgilityPack; HtmlWeb web = new HtmlWeb(); HtmlDocument doc = web.Load("http://www.stackoverflow.com"); var page = doc.DocumentNode.SelectSingleNode("//body"); page.CssSel??? </code></pre> Exactly how do I use this library? In the documentation it isn't clear what type <code>html</code> is.

Add <pre class="prettyprint"><code>using ScrapySharp.Extensions; </code></pre> It looks like you're missing that. That should make <code>CssSelect</code> available. Just in case an example helps, here's a method, as well, that I use in a project: <pre class="prettyprint"><code>private string GetPdfUrl(HtmlDocument document, string baseUrl) { return new Uri(new Uri(baseUrl), document.DocumentNode.CssSelect(".table-of-content .head-row td.download a.text-pdf").Single().Attributes["href"].Value).ToString(); } </code></pre>

How to use ScrapySharp to parse elements in an html document?

Tags:

html

c#

html-agility-pack

web-scraping

scrapysharp

Here's the project official "Documentation":

https://bitbucket.org/rflechner/scrapysharp/wiki/Home

No matter what I try, I can't find the CssSelect() method that the library is supposed to add to make querying things easier. Here's what I've tried:

using ScrapySharp.Core;
using ScrapySharp.Html.Parsing;
using HtmlAgilityPack;

HtmlWeb web = new HtmlWeb();
HtmlDocument doc = web.Load("http://www.stackoverflow.com");

var page = doc.DocumentNode.SelectSingleNode("//body");
page.CssSel???

Exactly how do I use this library? In the documentation it isn't clear what type html is.

893

asked Mar 31 '13 01:03

sergserg

1 Answers

Add

using ScrapySharp.Extensions;

It looks like you're missing that. That should make CssSelect available.

Just in case an example helps, here's a method, as well, that I use in a project:

private string GetPdfUrl(HtmlDocument document, string baseUrl)
{
    return new Uri(new Uri(baseUrl), document.DocumentNode.CssSelect(".table-of-content .head-row td.download a.text-pdf").Single().Attributes["href"].Value).ToString();
}

183

answered Sep 30 '22 17:09

Ben Allred

Related questions
                            
                                How to debug large lists of strings and multidimensional arrays of numbers?
                            
                                How do I emit a method with a pre-loaded MethodInfo local variable?
                            
                                Facing error during catalog refresh, the new dll is not used
                            
                                The call is ambiguous between the following methods or properties C#
                            
                                "Ctrl + C" buttons pressed handle C# console app
                            
                                Detecting Ctrl+Left (mouse button) in MouseDown event handler
                            
                                C# - Force String.Format to use decimals and NEVER commas
                            
                                Executing a function periodically after the function completes its task
                            
                                WebMatrix.WebData.WebSecurity - How can I get UserName by only having PasswordResetToken
                            
                                .Net Remoting uses only one connection ?
                            
                                Oracle: Arithmetic operation resulted in an overflow
                            
                                Building a Matrix of Combinations
                            
                                How to password protect pdf programmatically in .NET?
                            
                                Does Bitmap.LockBits "pin" a bitmap into memory?
                            
                                Multiple parameter in Row Filter
                            
                                NewLine replacement c#?
                            
                                parameter 'T' has the same name as the type parameter [closed]
                            
                                DataTable RowChanged how to get previous Row value? [closed]
                            
                                PDFSharp: Measuring height of long text with word wrap
                            
                                WP8/C#/SQLite: get last inserted id?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With