Searching google with keywords "health hospital" returns about 1,150,000,000 results. How can this count be obtained programmatically in R?
I have seen this link where they try to solve it using Java. How can this be done in R? An example code snippet would be appreciated.
Thanks.
Modifying just one line of the code found on theBioBucket blog post: Get No. of Google Search Hits with R and XML:
GoogleHits <- function(input)
{
require(XML)
require(RCurl)
url <- paste("https://www.google.com/search?q=",
input, sep = "") # modified line
CAINFO = paste(system.file(package="RCurl"), "/CurlSSL/ca-bundle.crt", sep = "")
script <- getURL(url, followlocation = TRUE, cainfo = CAINFO)
doc <- htmlParse(script)
res <- xpathSApply(doc, '//*/div[@id="resultStats"]', xmlValue)
cat(paste("\nYour Search URL:\n", url, "\n", sep = ""))
cat("\nNo. of Hits:\n") # get rid of cat text if not wanted
return(as.integer(gsub("[^0-9]", "", res)))
}
# Example:
no.hits <- GoogleHits("health%20hospital")
#Your Search URL:
#https://www.google.com/search?q=health%20hospital
#
#No. of Hits:
no.hits
#[1] 1170000000
I changed the url
assignment from
url <- paste("https://www.google.com/search?q=\"", input, "\"", sep = "")
to
url <- paste("https://www.google.com/search?q=", input, sep = "")
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With