Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Read specific div from HttpResponse

I am sending 1 httpWebRequest and reading the response. I am getting full page in the response. I want to get 1 div which is names ad Rate from the response. So how can I match that pattern?

My code is like:

    HttpWebRequest WebReq = (HttpWebRequest)WebRequest.Create("http://www.domain.com/");
    HttpWebResponse WebResp = (HttpWebResponse)WebReq.GetResponse();
    Stream response = WebResp.GetResponseStream();
    StreamReader data = new StreamReader(response);
    string result = data.ReadToEnd();

I am getting response like:

<HTML><BODY><div id="rate">Todays rate 55 Rs.</div></BODY></HTML>

I want to read data of div rate. i.e. I should get Content "Todays rate 55 Rs."

So how can I make regex for this???

like image 706
Kishan Gajjar Avatar asked Feb 22 '11 07:02

Kishan Gajjar


2 Answers

The HTML Agility Pack can load and parse the file for you, no need for messy streams and responses:

HtmlWeb web = new HtmlWeb();
HtmlDocument doc = web.Load("http://jsbin.com/owobe3");
HtmlNode rateNode = doc.DocumentNode.SelectSingleNode("//div[@id='rate']");
string rate = rateNode.InnerText;
like image 183
Kobi Avatar answered Sep 26 '22 14:09

Kobi


You should read the entire response and then use something like the Html Agility Pack to parse the response and extract the bits you want in an xpath-like syntax:

HtmlDocument doc = new HtmlDocument();
doc.LoadHtml(result);
var output = doc.DocumentNode.SelectSingleNode("//div[@id='rate']").InnerHtml;

Dont use regular expressions!

like image 43
Justin Avatar answered Sep 26 '22 14:09

Justin