PHP has a simple command to get meta tags of a webpage (get_meta_tags), but this only works for meta tags with name attributes. However, Open Graph Protocol is becoming more and more popular these days. What is the easiest way to get the values of opg from a webpage. For example: <pre class="prettyprint"><code><meta property="og:url" content=""> <meta property="og:title" content=""> <meta property="og:description" content=""> <meta property="og:type" content=""> </code></pre> The basic way I see is to get the page via cURL and parse it with regex. Any idea?

How about: <pre class="prettyprint"><code>preg_match_all('~<\s*meta\s+property="(og:[^"]+)"\s+content="([^"]*)~i', $str, $matches); </code></pre> So, yes, grab the page with any way you can and parse with regex

How to get Open Graph Protocol of a webpage by php?

Tags:

html

regex

php

PHP has a simple command to get meta tags of a webpage (get_meta_tags), but this only works for meta tags with name attributes. However, Open Graph Protocol is becoming more and more popular these days. What is the easiest way to get the values of opg from a webpage. For example:

<meta property="og:url" content=""> 
<meta property="og:title" content=""> 
<meta property="og:description" content=""> 
<meta property="og:type" content="">

The basic way I see is to get the page via cURL and parse it with regex. Any idea?

596

asked Sep 17 '11 12:09

Googlebot

3 Answers

Really simple and well done:

Using https://github.com/scottmac/opengraph

$graph = OpenGraph::fetch('http://www.avessotv.com.br/bastidores-pantene-institute-experience-pg.html'); print_r($graph);

Will return

OpenGraph Object

(     [_values:OpenGraph:private] => Array         (             [type] => article             [video] => http://www.avessotv.com.br/player/flowplayer/flowplayer-3.2.7.swf?config=%7B%27clip%27%3A%7B%27url%27%3A%27http%3A%2F%2Fwww.avessotv.com.br%2Fmedia%2Fprogramas%2Fpantene.flv%27%7D%7D             [image] => /wp-content/thumbnails/9025.jpg             [site_name] => Programa Avesso - Bastidores             [title] => Bastidores Ã¢Â€ÂœPantene Institute ExperienceÃ¢Â€Â P&G             [url] => http://www.avessotv.com.br/bastidores-pantene-institute-experience-pg.html             [description] => Confira os bastidores do Pantene Institute Experience, da Procter &#038; Gamble. www.pantene.com.br Mais imagens:         )      [_position:OpenGraph:private] => 0 )

answered Sep 16 '22 13:09

Guilherme Viebig

When parsing data from HTML, you really shouldn't use regex. Take a look at the DOMXPath Query function.

Now, the actual code could be :

[EDIT] A better query for XPath was given by Stefan Gehrig, so the code can be shortened to :

libxml_use_internal_errors(true); // Yeah if you are so worried about using @ with warnings
$doc = new DomDocument();
$doc->loadHTML($html);
$xpath = new DOMXPath($doc);
$query = '//*/meta[starts-with(@property, \'og:\')]';
$metas = $xpath->query($query);
$rmetas = array();
foreach ($metas as $meta) {
    $property = $meta->getAttribute('property');
    $content = $meta->getAttribute('content');
    $rmetas[$property] = $content;
}
var_dump($rmetas);

Instead of :

$doc = new DomDocument();
@$doc->loadHTML($html);
$xpath = new DOMXPath($doc);
$query = '//*/meta';
$metas = $xpath->query($query);
$rmetas = array();
foreach ($metas as $meta) {
    $property = $meta->getAttribute('property');
    $content = $meta->getAttribute('content');
    if(!empty($property) && preg_match('#^og:#', $property)) {
        $rmetas[$property] = $content;
    }
}
var_dump($rmetas);

answered Sep 20 '22 13:09

Tom

How about:

preg_match_all('~<\s*meta\s+property="(og:[^"]+)"\s+content="([^"]*)~i', $str, $matches);

So, yes, grab the page with any way you can and parse with regex

answered Sep 20 '22 13:09

zerkms

Related questions
                            
                                Running PHP code/scripts on the command line
                            
                                How to detect if $_POST is set?
                            
                                shell_exec() returning null on "ls"
                            
                                Get Current URL in Magento and show something
                            
                                cURL file uploads not working anymore after upgrade from PHP 5.5 to 5.6
                            
                                Laravel 5.4: how to delete a file stored in storage/app
                            
                                How to get SSL certificate info with CURL in PHP?
                            
                                Convert associative array into indexed
                            
                                How to change exception message of Exception object?
                            
                                How to test if Ci successfully inserted data
                            
                                Use cURL with SNI (Server Name Indication)
                            
                                Convert MIME type to file Extension PHP
                            
                                PHP Adding 15 minutes to Time value
                            
                                Undefined Method in Request::all()
                            
                                possible characters base64 url safe function
                            
                                Add PHP variable inside echo statement as href link address?
                            
                                Add extra meta for orders in Woocommerce
                            
                                How to modify WooCommerce cart, checkout pages (main theme portion)
                            
                                How to prefix a positive number with plus sign in PHP
                            
                                Process very big csv file without timeout and memory error

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With