Java: Extract all links with a certain word in them with JSoup?

Tags:

Might be an unclear question so here's the code and explanation:

    Document doc = Jsoup.parse(exampleHtmlData);

    Elements certainLinks = doc.select("a[href=google.com/example/]");

The String exampleHtmlData contains a parsed HTML source from a certain site. This site has a lot of links which direct the user to google. A few examples would be:

http://google.com/example/hello 
http://google.com/example/certaindir/anotherdir/something
http://google.com/anotherexample

I want to extract all the links that contain google.com/example/ in the link with the doc.select function. How do I do this with JSoup?

437

asked Jun 10 '12 20:06

ZimZim

1 Answers

You can refer the SelectorSyntax.

Document doc = Jsoup.parse(exampleHtmlData);
Elements certainLinks = doc.select("a[href*=google.com/example/]");

answered Oct 23 '22 10:10

Akhi

Related questions
                            
                                Including null elements in JSON output of Jersey RESTful API with JAXB
                            
                                given a set of n integers, return all subsets of k elements that sum to 0
                            
                                Will compiling the same code using different JDKs result in the same byte code?
                            
                                How to find the total memory allocated by an object [duplicate]
                            
                                How to know with java whether file is corrupted (readable) or not?
                            
                                Why is the volatile field copied to a local variable when doing double check locking
                            
                                JTree rightclick behaviour like in any Filebrowser
                            
                                JPA - EclipseLink - How to configure Database Schema name at runtime
                            
                                java.lang.RuntimeException: Unable to instantiate activity ComponentInfo after SDK update
                            
                                SPARQL query using Jena producing no results — but works online
                            
                                How to compare differences in very large csv files
                            
                                Can someone explain what virtual machines are and why they're useful? [closed]
                            
                                How do you unit test a servlet endpoint in apache camel?
                            
                                How do I convert a JSON array into a Java List. I'm using svenson
                            
                                How to call into .NET dll from Java
                            
                                Java OOP: how to create objects properly
                            
                                How to create own annotation for junit that will skip test if concrete exception was thrown during execution?
                            
                                how to make program able to install
                            
                                Web service that works as REST and SOAP using Java/Jersey
                            
                                Optionally using String.split(), split a string at the last occurance of a delimiter

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Java: Extract all links with a certain word in them with JSoup?

Tags:

java

html

parsing

jsoup

ZimZim

People also ask

1 Answers

Akhi

Recent Activity

Donate For Us