Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Facebook's Lint is crawling root URL rather than the given URL

There are previous posts about this on the web, however none of the proposed solutions actually work for my scenario.

I have a children pages which all have their own unique og tags, and a parent root domain page with its own og tag, all to be crawled by facebook so people can see the descriptions/images when they are posted.

However, when I post my child url in the box, the results are blatantly proof that facebook crawled the parent page instead. Even the "see exactly what our scraper sees" page confirms that because it is only the parent template being shown.

Here is a link to fb's scraper : https://developers.facebook.com/tools/debug

Here are my og tags in the root domain html template: Suppose the parent is www.me.com Suppose one child is www.me.com/path/path

<head> ...

    <meta name="keywords" content="...">

    <meta name="description" content="..." />

    <meta property="fb:app_id" content="..."/>
    <meta property="og:site_name" content="..." />
    <meta property="og:type" content="website" />
    <meta property="og:title" content="..." />
    <meta property="og:image" content="..." />
    <meta property="og:description" content="..." />
    <meta property="og:url" content="http://www.me.com" />

Here are the og tags in the child html template:

<head> ...

    <meta name="keywords" content="...">

    <meta name="description" content="..." />

    <meta property="fb:app_id" content="..."/>
    <meta property="og:site_name" content="..." />
    <meta property="og:type" content="article" />
    <meta property="og:title" content="..." />
    <meta property="og:image" content="..." />
    <meta property="og:description" content="..." />
    <meta property="og:url" content="http://www.me.com/path/path" />

This is very frustrating because i've tried removing and adding many of the tags in different combinations, and every single time, the www.me.com parent gets crawled, while the child is left ignored!!

I tried to mimic the order of the tags of other sites with this feature working like cnn.com, but however mine still refuse to crawl the child..

Any help? Thank you.

like image 414
Lucas Ou-Yang Avatar asked Jul 22 '13 01:07

Lucas Ou-Yang


1 Answers

Check that you're not

  • Rejecting the crawler's attempt to reach the child page
  • Redirecting the child page to the parent page when the crawler tries to load it
  • Including an og:url meta tag on the child page pointing to the homepage
  • Including a canonical meta tag in the child page pointing to the parent

Those are by far the most likely reasons for this, and though you've said in your question your og:url tags are correct, in my experience that's most likely to be the problrm

like image 131
Igy Avatar answered Sep 30 '22 11:09

Igy