Best way to fetch a varying HTML tag

3 Answers

The answer is: don't use regular expressions.

Seriously. Use a SGML parser, or an XML parser if you happen to know it's valid XML (probably almost never true). You will absolutely screw up and waste tons of time trying to get it right. Just use what's already available.

101

answered Nov 08 '22 01:11

Brad Wilson

Actually, you should probably use some sort of HTML parser where you can inspect each node (and therefore node attributes) in the DOM of the page. I've not used any of these for a while so I don't know the pros and cons but here's a list http://java-source.net/open-source/html-parsers

answered Nov 08 '22 02:11

martinatime

Those differences are not really important according to the XHTML standard.

In other words, they are exactly the same thing.

Also, if you replace double quotes with single quotes would be the same.

The typical way of 'normalizing' an xml document is to pare it using some API that treats the document as its Infoset representation. Both DOM and SAX style APIs work that way.

If you want to parse them by hand (or with a RegEx) you have to replicate all those things in your code and, in my opinion, that's not practical.

answered Nov 08 '22 03:11

Sergio Acosta

Related questions
                            
                                Apply css style to child element
                            
                                Finding HTML tags in string
                            
                                jQuery: get all children whose ID contains part of string
                            
                                text-indent doesn't work with anchor tag
                            
                                CSS HTML Cant click on links
                            
                                CSS - First-letter selection not working with ID
                            
                                SELECT2 -> Add data without replacing content
                            
                                Attempting to position font awesome icon in the middle of a div
                            
                                How to create a pricetag shape in CSS and HTML
                            
                                make section fullscreen of current screen
                            
                                Save and load input values using local storage?
                            
                                Swiper Slider not working inside Bootstrap Tab Panel
                            
                                .innerHTML is not a function [duplicate]
                            
                                How to rotate a triangle without rotating the entire canvas?
                            
                                how to make input field fit completely inside bootstrap table <td>
                            
                                creating a responsive diagonal line in element
                            
                                Bootstrap 4 Cards of same height in columns
                            
                                Angular2 set height to full screen
                            
                                Set maximum number of words HTML textbox
                            
                                Uncaught TypeError: $.ajax(...).error is not a function

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Best way to fetch a varying HTML tag

Tags:

language-agnostic

html

regex

pek

People also ask

3 Answers

Brad Wilson

martinatime

Sergio Acosta

Recent Activity

Donate For Us