What is the best practice to parse html in swift?

1 Answers

There are several nice libraries of HTML Parsing using Swift and Objective-C like the followings:

hpple
NDHpple
Kanna( old Swift-HTML-Parser)
Fuzi
SwiftSoup
Ji

Take a look in the following examples in the four libraries posted above, mainly parsed using XPath 2.0:

hpple:

let data = NSData(contentsOfFile: path) let doc = TFHpple(htmlData: data)  if let elements = doc.searchWithXPathQuery("//a/@href[ends-with(.,'.txt')]") as? [TFHppleElement] {    for element in elements {        println(element.content)    } }

NDHpple:

let data = NSData(contentsOfFile: path)! let html = NSString(data: data, encoding: NSUTF8StringEncoding)! let doc = NDHpple(HTMLData: html) if let elements = doc.searchWithXPathQuery("//a/@href[ends-with(.,'.txt')]") {    for element in elements {      println(element.children?.first?.content)    } }

Kanna (Xpath and CSS Selectors):

let html = "<html><head></head><body><ul><li><input type='image' name='input1' value='string1value' class='abc' /></li><li><input type='image' name='input2' value='string2value' class='def' /></li></ul><span class='spantext'><b>Hello World 1</b></span><span class='spantext'><b>Hello World 2</b></span><a href='example.com'>example(English)</a><a href='example.co.jp'>example(JP)</a></body>"  if let doc = Kanna.HTML(html: html, encoding: NSUTF8StringEncoding) {    var bodyNode   = doc.body     if let inputNodes = bodyNode?.xpath("//a/@href[ends-with(.,'.txt')]") {       for node in inputNodes {          println(node.contents)       }    } }

Fuzi (Xpath and CSS Selectors):

let html = "<html><head></head><body><ul><li><input type='image' name='input1' value='string1value' class='abc' /></li><li><input type='image' name='input2' value='string2value' class='def' /></li></ul><span class='spantext'><b>Hello World 1</b></span><span class='spantext'><b>Hello World 2</b></span><a href='example.com'>example(English)</a><a href='example.co.jp'>example(JP)</a></body>"  do {   // if encoding is omitted, it defaults to NSUTF8StringEncoding   let doc = try HTMLDocument(string: html, encoding: NSUTF8StringEncoding)    // XPath queries   for anchor in doc.xpath("//a/@href[ends-with(.,'.txt')]") {     print(anchor.stringValue)   }  } catch let error {     print(error) }

The ends-with function is part of Xpath 2.0.

SwiftSoup (CSS Selectors):

do{     let doc: Document = try SwiftSoup.parse("...")     let links: Elements = try doc.select("a[href]") // a with href     let pngs: Elements = try doc.select("img[src$=.png]")      // img with src ending .png     let masthead: Element? = try doc.select("div.masthead").first()      // div with class=masthead     let resultLinks: Elements? = try doc.select("h3.r > a") // direct a after h3 } catch Exception.Error(let type, let message){     print(message) } catch {    print("error") }

Ji (XPath):

let jiDoc = Ji(htmlURL: URL(string: "http://www.apple.com/support")!) let titleNode = jiDoc?.xPath("//head/title")?.first print("title: \(titleNode?.content)") // title: Optional("Official Apple Support")

I hope this helps you.

167

answered Sep 26 '22 00:09

Victor Sigler

Related questions
                            
                                XPath to select a table row that has a cell containing specified text
                            
                                Rendering HTML elements to <canvas>
                            
                                JavaScript 'onclick' event 'return' keyword functionality
                            
                                CSS override rules and specificity
                            
                                HTML + CSS: Ordered List without the Period?
                            
                                How do I remove outline on link click?
                            
                                HTML 'td' width and height
                            
                                ng-click on parent clicks through children
                            
                                how to get files from <input type='file' .../> (Indirect) with javascript
                            
                                Inline SVG vs SVG File Performance
                            
                                diff to html (diff2html) program [closed]
                            
                                Activate CSS3 animation when the content scrolls into view
                            
                                jquery .slideToggle() horizontal alternative?
                            
                                How to play CSS3 transitions in a loop?
                            
                                Wrap link <a> around <div>
                            
                                Make a <td> span the entire row in a table
                            
                                How can I convert an HTML element to a canvas element?
                            
                                Setting button text via JavaScript [duplicate]
                            
                                how can I display tooltip or item information on mouse over?
                            
                                Get the value of input text when enter key pressed

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the best practice to parse html in swift?

Tags:

html

parsing

swift

amazingbasil

People also ask

1 Answers

Victor Sigler

Recent Activity

Donate For Us