Extract links from a web page using Go lang

2 Answers

If you know jQuery, you'll love GoQuery.

Honestly, it's the easiest, most powerful HTML utility I've found in Go, and it's based off of the html package in the go.net repository. (Okay, so it's higher-level than just a parser as it doesn't expose raw HTML tokens and the like, but if you want to actually get anything done with an HTML document, this package will help.)

answered Sep 23 '22 11:09

Matt

Go's standard package for HTML parsing is still a work in progress and is not part of the current release. A third party package you might try though is go-html-transform. It is being actively maintained.

answered Sep 19 '22 11:09

Sonia

Related questions
                            
                                Parsing the html meta tag with jsoup library
                            
                                Need python lxml syntax help for parsing html
                            
                                How to read HTML as XML?
                            
                                Beautiful Soup 4: Remove comment tag and its content
                            
                                Importing bs4 in Python 3.5
                            
                                HTML parsing in perl
                            
                                How to fix this AttributeError?
                            
                                Using HTMLParser in Python 3.2
                            
                                What does HTML Parsing mean? [closed]
                            
                                Unclosed / misnested HTML tags extend past their parent
                            
                                Building an HTML Diff/Patch Algorithm
                            
                                Extending CSS selectors in BeautifulSoup
                            
                                How to extract a JSON object that was defined in a HTML page javascript block using Python?
                            
                                BeautifulSoup HTML table parsing
                            
                                How to extract separate text nodes with Jsoup?
                            
                                Get (text) in XPath
                            
                                Using DOMDocument, is it possible to get all elements that exists within a certain DOM?
                            
                                extracting element and insert a space
                            
                                JSoup.connect throws 403 error while apache.httpclient is able to fetch the content
                            
                                How can I use regular expression to grab an 'img' tag?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Extract links from a web page using Go lang

Tags:

html-parsing

go

Jifeng Zhang

People also ask

2 Answers

Matt

Sonia

Recent Activity

Donate For Us