Extract main domain name from a given url

Tags:

I used the following to extract the domain from a url: (They are test cases)

String regex = "^(ww[a-zA-Z0-9-]{0,}\\.)";
ArrayList<String> cases = new ArrayList<String>();
cases.add("www.google.com");
cases.add("ww.socialrating.it");
cases.add("www-01.hopperspot.com");
cases.add("wwwsupernatural-brasil.blogspot.com");
cases.add("xtop10.net");
cases.add("zoyanailpolish.blogspot.com");

for (String t : cases) {  
    String res = t.replaceAll(regex, "");  
}

I can get the following results:

google.com
hopperspot.com
socialrating.it
blogspot.com
xtop10.net
zoyanailpolish.blogspot.com

The first four cases are good. The last one is not good. What I want is: blogspot.com for the last one, but it gives zoyanailpolish.blogspot.com. What am I doing wrong?

799

asked Aug 27 '11 20:08

chnet

1 Answers

Using Guava library, we can easily get domain name:

InternetDomainName.from(tld).topPrivateDomain()

Refer API link for more details

https://google.github.io/guava/releases/14.0/api/docs/

http://docs.guava-libraries.googlecode.com/git/javadoc/com/google/common/net/InternetDomainName.html

175

answered Oct 13 '22 13:10

Satya

Related questions
                            
                                Is it possible to block/deny a cast conversion in Java?
                            
                                Can a transient field in a class be obtained using reflection
                            
                                Placing component on Glass Pane
                            
                                Does the main method belong to any class?
                            
                                Representing float values in Java
                            
                                Are Java Beans as data storage classes bad design?
                            
                                Hibernate, single table inheritance and using field from superclass as discriminator column
                            
                                Good learner book for a 12 year old? [closed]
                            
                                Best way to get integer part of the string "600sp"?
                            
                                Java Get Default UI Colors
                            
                                Error when defining inner classes in a Test class in JUnit
                            
                                How to set transparent background of JDialog
                            
                                Get page content from Apache Commons HTTP Request
                            
                                How to close the window in AWT?
                            
                                How to use java.util.Arrays
                            
                                Cloning an Integer
                            
                                Why toBinaryString is not an instance method in Integer class?
                            
                                Get contact name given a phone number in Android
                            
                                Regex to remove comments in XML file in Eclipse java
                            
                                org.hibernate.AnnotationException: Collection has neither generic type or OneToMany.targetEntity()

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Extract main domain name from a given url

Tags:

java

regex

url

domain-name

chnet

People also ask

1 Answers

Satya

Recent Activity

Donate For Us