Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

r google search result count retrieve [closed]

Tags:

r

Searching google with keywords "health hospital" returns about 1,150,000,000 results. How can this count be obtained programmatically in R?

I have seen this link where they try to solve it using Java. How can this be done in R? An example code snippet would be appreciated.

Thanks.

like image 591
user6633625673888 Avatar asked May 12 '15 18:05

user6633625673888


1 Answers

Modifying just one line of the code found on theBioBucket blog post: Get No. of Google Search Hits with R and XML:

GoogleHits <- function(input)
   {
    require(XML)
    require(RCurl)
    url <- paste("https://www.google.com/search?q=",
                 input, sep = "") # modified line      
    CAINFO = paste(system.file(package="RCurl"), "/CurlSSL/ca-bundle.crt", sep = "")
    script <- getURL(url, followlocation = TRUE, cainfo = CAINFO)
    doc <- htmlParse(script)
    res <- xpathSApply(doc, '//*/div[@id="resultStats"]', xmlValue)
    cat(paste("\nYour Search URL:\n", url, "\n", sep = ""))
    cat("\nNo. of Hits:\n") # get rid of cat text if not wanted
    return(as.integer(gsub("[^0-9]", "", res)))
   }

# Example:
no.hits <- GoogleHits("health%20hospital")
#Your Search URL:
#https://www.google.com/search?q=health%20hospital
#
#No. of Hits:
no.hits
#[1] 1170000000

I changed the url assignment from

url <- paste("https://www.google.com/search?q=\"", input, "\"", sep = "")

to

url <- paste("https://www.google.com/search?q=", input, sep = "")
like image 168
Jota Avatar answered Oct 21 '22 04:10

Jota