Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Search in html source with GOOGLE? [closed]

I have several websites, and I can't remember where I wrote some lines of code. As my pages are indexed by Google, I would like to know if Google offers a facility to search within the HTML source code/mark-up itself, instead of just allowing search within the visual, rendered, part of a page?

Thanks

like image 555
Entretoize Avatar asked Nov 24 '10 10:11

Entretoize


People also ask

How can I get Google search in HTML?

In the Control Panel click the search engine you want to use. Click Setup in the sidebar, and then click the Basics tab. In the Details section, click Get code. Copy the code and paste it into your page's HTML source code where you want the Programmable Search Element to appear.

How do I open search bar in source?

Click the menu icon Chrome menu on the browser toolbar. Select Tools, then View Source. Press Ctrl+U (Windows) or ⌘-Option-U (Mac).

How do I enable the search bar in HTML?

Using Html and CSSStep 1: Firstly, we have to type the Html code in any text editor or open the existing Html file in the text editor in which we want to use the CSS and Html code for making a search bar. Step 2: Now, we have to place the cursor at that point in the body tag where we want to make a search bar.


1 Answers

I've come across the following resources on my travels (some already mentioned above):

HTML Mark-up-focused search engines

  • Nerdydata

I'd also like to throw in the following:

Huge, website crawl data archives

  • Common Crawl - 'years of free web page data to help change the world' (over 250TB+)

How can we analyze this crawl data?

For an idea of how to begin analyzing some of this massive data, take a look at Big Data/Map-reduce-type frameworks(s).

Google lists some ideas on using Apache's Spark project to analyze Common Crawl's dump(s). To understand the file format(s) used by Common Crawl, refer to the following:

  • So you’re ready to get started [with Common Crawl]
  • Navigating the WARC file format [by Common Crawl]

The article, Accessing-Common-Crawl-Dataset-on-S3, outlines accessing Common Crawl's 250TB+ dump(s) in a low cost manner without transferring that data load outside of Amazon's AWS/S3 network. Of course, that assumes you are going to use some combination AWS/EC2/S3 etc. to analyze the crawl data.

Finally, Patrick Durusau maintains some interesting Common-Crawl-usage-related blog pages.

Personally, I find this subject intriguing, I suggest we get this crawl data while it's HOT! ;-)

like image 96
Big Rich Avatar answered Sep 23 '22 05:09

Big Rich