PHP Simple HTML DOM Parser returning false on valid url

Tags:

I'm trying the following:

$url = 'https://www.tripadvisor.es/Hotels-g187514-Madrid-Hotels.html'

$ta_html = file_get_html($url);
var_dump($ta_html);

it returns false, this is working and correctly getting the html for:

$url = 'https://www.tripadvisor.es/Hotels-g294316-Lima_Lima_Region-Hotels.html#ACCOM_OVERVIEW'

My first thought was that it had a redirect but I checked the headers with curl and its 200 ok and it seemed like the same on both cases. What can be happening? how it can be solved?

This seems to be a duplicate of this problem: Simple HTML DOM returning false that is also unanswered

323

asked Apr 22 '17 17:04

Aschab

1 Answers

It looks like HTML DOM parser is failing because the HTML file size is greater than the library's max file size. When you call file_get_html() it does a file size check based on it's MAX_FILE_SIZE constant. So before calling any HTML DOM parser methods, increase the max file size used by the library by calling:

define('MAX_FILE_SIZE', 1200000); // or larger if needed, default is 600000

Also as as you found out you can work around the file size check with doing this

$html = new simple_html_dom();
$html->load($str);

105

answered Nov 01 '22 03:11

Jim

Related questions
                            
                                Insert an Arabic text MySQL
                            
                                Can't retrieve url GET parameters with Laravel 5.1on remote server
                            
                                Is it good practice to store large Data in session variable?
                            
                                How can I camelcase a string in php
                            
                                PHP in same file as form or seperate? Speed
                            
                                How to validate 12-Hour Time with regex (regular Expression)
                            
                                Laravel custom helper - undefined index SERVER_NAME
                            
                                How to edit the html code in wordpress theme
                            
                                Merge request not working
                            
                                Indirect modification of overloaded property App\Category::$thesizes has no effect
                            
                                PHP Startup: Unable to load dynamic library php_msgpack_serialize
                            
                                Get Cart products id on checkout WooCommerce page, to display product images
                            
                                How to find file without specific file extension in laravel storage?
                            
                                PHP Amazon SES v3 - Missing Required Header 'From'
                            
                                Adding multiple tabs to WooCommerce single product pages
                            
                                Setting the path for include / require files
                            
                                WordPress "Notice: Undefined index: host" after initial setup running locally on linux
                            
                                JWT Decode try catch
                            
                                Get the order id from the current user orders In WooCommerce [closed]
                            
                                Missing argument 1 for Illuminate\Support\MessageBag::has()

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

PHP Simple HTML DOM Parser returning false on valid url

Tags:

html

php

web-scraping

Aschab

People also ask

1 Answers

Jim

Recent Activity

Donate For Us