Go Parse HTML table

Tags:

I have a table in html that I would like to parse. Something like the one in the following http://sprunge.us/IJUC However, I'm not sure of a good way to parse out the information. I've seen a couple of html parsers, but those seem to require that everything has a special tag for you to parse it like info to grab; however, the majority of my info is within <td></td>

Does anyone have a suggestion for parsing this information out?

898

asked Oct 14 '12 14:10

Joe P.

1 Answers

Shameless plug: My goquery library. It's the jQuery syntax brought to Go (requires Go's experimental html package, see instructions in the README of the library).

So you can do things like that (assuming your HTML document is loaded in doc, a *goquery.Document):

doc.Find("td").Each(func (i int, s *goquery.Selection) {
  fmt.Printf("Content of cell %d: %s\n", i, s.Text())
})

Edit: Change doc.Root.Find to doc.Find in the example since a goquery Document is now a Selection too (new in v0.2/master branch)

answered Sep 18 '22 13:09

mna

Related questions
                            
                                Question regarding Visible=false and display:none;
                            
                                What is the point of input without name in HTML5?
                            
                                Passing parameters on event listeners with loops
                            
                                How to give file name at base 64 images
                            
                                reading innerHTML of HTML form with VALUE attribute (& its value) of INPUT tags
                            
                                How to prevent an textfield losing focus using jQuery and JavaScript?
                            
                                How do you find out if an HTML element has a certain class with plain Javascript?
                            
                                Select all inputs, labels, selects etc within THIS - each loop
                            
                                Why does select have a slightly larger height than input[type=text]?
                            
                                How to cancel an image load after a period of time?
                            
                                How can I stop Firefox from caching the contents of a textarea on localhost?
                            
                                load external html file to div using jquery
                            
                                Best place to insert JavaScript within a HTML document [duplicate]
                            
                                How do I extend selection to word boundary using JavaScript, once only?
                            
                                Losing column widths when printing HTML table
                            
                                Internet Explorer 8 won't modify HTML5 tags in print stylesheet
                            
                                Create new HTML5 video element throught JavaScript
                            
                                How to create a box-shadow that covers the entire page?
                            
                                HTML5 Canvas size and resolution
                            
                                Can a user edit the page source, manipulate hidden field values and then post the form with those values?

Go Parse HTML table

Tags:

html

go

web-scraping

Joe P.

People also ask

1 Answers

mna

Recent Activity

Donate For Us