Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Site not valid - but it is

Tags:

url

facebook

dns

So, I'm building a website called "dagbok.nu", which is swedish for "diary now" :)

Anyway, when creating the Facebook application, it claims that the site URL is invalid as well as the app domain. For site url, I used "http://dagbok.nu" and for site domain, I used "dagbok.nu". Please don't reply (as I've seen others do on similar issues) that I should type the site url with the scheme and the domain without - that's exactly what I'm doing.

Right, so according to another question here, one could trouble shoot this functionality using FB's own URL scraper, so I did just that:

http://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Fdagbok.nu

And the reply: Error Parsing URL: Error parsing input URL, no data was scraped Right, so now I can assume that the reason for it being considered invalid is because of FB not being able to scrape the URL. But why?

According to this question, one of the reasons seems to be that FB has deemed the URL insecure or "spammy". I've acquired this domain from a previous owner so this wasn't all that impossible. But when doing the same thing as Matthew in that post - i.e. trying to post in my timeline using the domain "http://dagbok.nu", I didn't get any information. The status box expanded as if to include a thumbnail and information about the link, but it only contained a "(No title)" text and nothing more. Screengrab

So now I don't know what to do. I've tried to check the DIG and NS records from multiple servers around the web, and everyone seems to resolve it correctly, and I've had friends double check the URL from the states as well. I can't understand what's wrong and I have no idea how to ask someone at FB how to resolve this. Does anyone here have a good advice for this? Thanks in advance! :)

EDIT When changing the domain to another domain that points to the exact same web server and document_root, it works! So this is definitely a problem with the domain "dagbok.nu" and not with the code on that page.

EDIT When using the debug function above - I see no activity in the server log what so ever. Facebook doesn't even contact the server. When using the alternate url - the one from the last edit, it pops up in the logs as it should.

EDIT I filed a bug report with Facebook, And their first response was that they were going to follow up. Now, a month later, I got an email that said "We are prioritizing bugs based on impact to the developer community. As this bug report has not received much attention from other developers, we are closing it so as to better focus on the top issues", and then they told me to go here to stackoverflow to try to solve my issue - but the issue is WITH THEM, and of course no one else have reported that my site doesn't work, it affects only me, and I haven't opened it yet due to this bug!

EDIT I wanted to file a new bug report, but I can't even that now, since they are blocking bug reports with this URL as well!

I had to edit the URL - here is the new bug report

like image 246
Sandman Avatar asked Apr 17 '12 11:04

Sandman


People also ask

What does it mean if the URL is not valid?

An invalid URL is a web address that didn't work as expected. This link is faulty, and on its response, your browser will show you an error message. Even the users of WordPress may also face this not loading issue. You may have encountered many error notices while browsing.

Why is URL not working?

This could be because your server is down, or there is an issue with the DNS routing to your domain. Make sure that your domain is resolving correctly and try again. Before we crawled the pages of your site, we tried to check your robots. txt file to ensure we didn't crawl any pages that you had roboted out.

What is a valid site URL?

A URL is a valid URL if at least one of the following conditions holds: The URL is a valid URI reference [RFC3986]. The URL is a valid IRI reference and it has no query component. [RFC3987] The URL is a valid IRI reference and its query component contains no unescaped non-ASCII characters.


4 Answers

When Facebook tries to scrap your site for information, they send a call to your server with specific user agent called "facebookexternalhit"...

Facebook needs to scrape your page to know how to display it around the site.

Facebook scrapes your page every 24 hours to ensure the properties are up to date. The page is also scraped when an admin for the Open Graph page clicks the Like button and when the URL is entered into the Facebook URL Linter. Facebook observes cache headers on your URLs - it will look at "Expires" and "Cache-Control" in order of preference. However, even if you specify a longer time, Facebook will scrape your page every 24 hours.

The user agent of the scraper is: "facebookexternalhit/1.1(+http://www.facebook.com/externalhit_uatext.php)"

  1. Make sure it is not blocked by your server firewall
  2. Look in your server log if it even tried to access your site
  3. If you think this is a firewall issue look at this link
like image 107
Roni Avatar answered Nov 15 '22 09:11

Roni


Your problem appears to be with your character encoding string. Your Apache server is currently sending the unsupported string latin1. You've defined your meta:content-type as iso-8859-1. See the w3c validator

From what I've seen, the Facebook parser will stop immediately if it encounters either an unrecognized character encoding string or a mismatch in character encoding strings between your header and meta tags.

The problem could be originating from either your httpd.conf or php.ini files. Change these to match your meta and restart Apache. Since the problem seems to be domain-specific, I'd check httpd.conf first.

like image 32
cpilko Avatar answered Nov 15 '22 09:11

cpilko


Could your domain be blacklisted? Could you try messaging your url to someone, and see if Facebook gives you a "This message contains blocked content..." error?

For example:

Message Failed box

like image 3
Alex L Avatar answered Nov 15 '22 10:11

Alex L


If you don't provide certain minimum Facebook markup on your page, it will respond with "Error Parsing URL: Error parsing input URL, no data was scraped." I only looked at the homepage, but it appears that dagbok.nu contains no Facebook markup. I'm not sure what things must be present at minimum, but in my implementation, I assume the fb:app_id meta tag and the JavaScript SDK script must be there. You may want to take a look at http://developers.facebook.com/docs/guides/web/#plugins , particularly the Authentication section.

I discovered your question because I had this same error today for an unknown reason. I found that it was caused because the content of my og:image meta tag used an incorrect URL to the image I was trying to use. So as you add Facebook markup to your page, make sure your values are correct or you may continue to receive this message.

like image 2
NickdeClaw Avatar answered Nov 15 '22 08:11

NickdeClaw