Chrome with headless mode already available for linux: https://chromium.googlesource.com/chromium/src/+/lkgr/headless/README.md It works only with Canary right now, but it coming official in Chrome 57. Any chances to run Google Chrome on aws lambda?

Yes; it's possible. Compiling a non-debug build of Headless Chrome yields a binary that's ~125 MB, and just under 44 MB when gzipped. This means it fits within the 250 MB uncompressed and 50 MB size limitation for the function's deployment package. What's (currently) required is to force Chrome to compile without using shared memory at /dev/shm. Theres a thread on the topic on the headless-dev google group here. Here are steps I've used to build a binary of headless Chrome that will work on AWS Lambda. They're based on this and this. <ol> <li>Create a new EC2 instance using the community AMI with name amzn-ami-hvm-2016.03.3.x86_64-gp2 (us-west-2 ami-7172b611).</li> <li>Pick an Instance Type with at least 16 GB of memory. Compile time will take about 4-5 hours on a t2.xlarge, or 2-3ish on a t2.2xlarge or about 45 min on a c4.4xlarge.</li> <li>Give yourself a Root Volume that's at least 30 GB (40 GB if you want to compile a debug build—which you won't be able to upload to Lambda because it's too big.)</li> <li>SSH into the new instance and run:</li> </ol> <pre class="prettyprint lang-bash prettyprint-override"><code>sudo printf "LANG=en_US.utf-8\nLC_ALL=en_US.utf-8" >> /etc/environment sudo yum install -y git redhat-lsb python bzip2 tar pkgconfig atk-devel alsa-lib-devel bison binutils brlapi-devel bluez-libs-devel bzip2-devel cairo-devel cups-devel dbus-devel dbus-glib-devel expat-devel fontconfig-devel freetype-devel gcc-c++ GConf2-devel glib2-devel glibc.i686 gperf glib2-devel gtk2-devel gtk3-devel java-1.*.0-openjdk-devel libatomic libcap-devel libffi-devel libgcc.i686 libgnome-keyring-devel libjpeg-devel libstdc++.i686 libX11-devel libXScrnSaver-devel libXtst-devel libxkbcommon-x11-devel ncurses-compat-libs nspr-devel nss-devel pam-devel pango-devel pciutils-devel pulseaudio-libs-devel zlib.i686 httpd mod_ssl php php-cli python-psutil wdiff --enablerepo=epel </code></pre> Yum will complain about some packages not existing. Whatever. I haven't looked into them. Didn't seem to stop me from building headless_shell, though. Ignore whiney little Yum and move on. Next: <pre class="prettyprint lang-bash prettyprint-override"><code>git clone https://chromium.googlesource.com/chromium/tools/depot_tools.git echo "export PATH=$PATH:$HOME/depot_tools" >> ~/.bash_profile source ~/.bash_profile mkdir Chromium && cd Chromium fetch --no-history chromium cd src </code></pre> At this point we need to make a very small change to the Chrome code. By default on Linux, Chrome assumes there to be a tmpfs at <code>/dev/shm</code>. There is no tmpfs available to a Lambda function. :-( The file we have to change is <code>src/base/files/file_util_posix.cc</code>. Modify <code>GetShmemTempDir()</code> such that it always returns the OSs temp dir (<code>/tmp</code>). A simple way to do this is to just remove the entire <code>#if defined(OS_LINUX)</code> block in the <code>GetShmemTempDir()</code> function. A less drastic change is to hardcode <code>use_dev_shm</code> to <code>false</code>: <pre class="prettyprint lang-cc prettyprint-override"><code>bool GetShmemTempDir(bool executable, FilePath* path) { #if defined(OS_LINUX) bool use_dev_shm = true; if (executable) { static const bool s_dev_shm_executable = DetermineDevShmExecutable(); use_dev_shm = s_dev_shm_executable; } // cuz lambda use_dev_shm = false; // <-- add this. Yes it's pretty hack-y if (use_dev_shm) { *path = FilePath("/dev/shm"); return true; } #endif return GetTempDir(path); } </code></pre> With that change, it's time to compile. Picking things back up in the <code>src</code> directory, set some compile arguments and then (the last command) start the build process. <pre class="prettyprint lang-bash prettyprint-override"><code>mkdir -p out/Headless echo 'import("//build/args/headless.gn")' > out/Headless/args.gn echo 'is_debug = false' >> out/Headless/args.gn echo 'symbol_level = 0' >> out/Headless/args.gn echo 'is_component_build = false' >> out/Headless/args.gn echo 'remove_webcore_debug_symbols = true' >> out/Headless/args.gn echo 'enable_nacl = false' >> out/Headless/args.gn gn gen out/Headless ninja -C out/Headless headless_shell </code></pre> Finally we make a tarball of the relevant file(s) we'll need to run in Lambda. <pre class="prettyprint lang-bash prettyprint-override"><code>mkdir out/headless-chrome && cd out cp Headless/headless_shell Headless/libosmesa.so headless-chrome/ tar -zcvf chrome-headless-lambda-linux-x64.tar.gz headless-chrome/ </code></pre> Within Lambda, run <code>headless_shell</code> with the remote debugger interface enabled by executing: <pre class="prettyprint lang-bash prettyprint-override"><code>/path/to/headless_shell --disable-gpu --no-sandbox --remote-debugging-port=9222 --user-data-dir=/tmp/user-data --single-process --data-path=/tmp/data-path --homedir=/tmp --disk-cache-dir=/tmp/cache-dir </code></pre> Since /tmp is the only writeable place in a Lambda function, there are a bunch of flags just telling Chrome where to dump it's data. They're not necessary but it keeps Chrome happy. Note also that it's been mentioned that with the <code>--disable-gpu</code> flag, we don't need <code>libosmesa.so</code>, the omission of which would shave off about 4 MB from our package zip. I've started this project with the aim of making it easier to get started. It comes with a pre-built headless Chrome binary which you can get here.

Chrome --headless for AWS Lambda?

1 Answers

Yes; it's possible.

Compiling a non-debug build of Headless Chrome yields a binary that's ~125 MB, and just under 44 MB when gzipped. This means it fits within the 250 MB uncompressed and 50 MB size limitation for the function's deployment package.

What's (currently) required is to force Chrome to compile without using shared memory at /dev/shm. Theres a thread on the topic on the headless-dev google group here.

Here are steps I've used to build a binary of headless Chrome that will work on AWS Lambda. They're based on this and this.

Create a new EC2 instance using the community AMI with name amzn-ami-hvm-2016.03.3.x86_64-gp2 (us-west-2 ami-7172b611).
Pick an Instance Type with at least 16 GB of memory. Compile time will take about 4-5 hours on a t2.xlarge, or 2-3ish on a t2.2xlarge or about 45 min on a c4.4xlarge.
Give yourself a Root Volume that's at least 30 GB (40 GB if you want to compile a debug build—which you won't be able to upload to Lambda because it's too big.)
SSH into the new instance and run:

sudo printf "LANG=en_US.utf-8\nLC_ALL=en_US.utf-8" >> /etc/environment
sudo yum install -y git redhat-lsb python bzip2 tar pkgconfig atk-devel alsa-lib-devel bison binutils brlapi-devel bluez-libs-devel bzip2-devel cairo-devel cups-devel dbus-devel dbus-glib-devel expat-devel fontconfig-devel freetype-devel gcc-c++ GConf2-devel glib2-devel glibc.i686 gperf glib2-devel gtk2-devel gtk3-devel java-1.*.0-openjdk-devel libatomic libcap-devel libffi-devel libgcc.i686 libgnome-keyring-devel libjpeg-devel libstdc++.i686 libX11-devel libXScrnSaver-devel libXtst-devel libxkbcommon-x11-devel ncurses-compat-libs nspr-devel nss-devel pam-devel pango-devel pciutils-devel pulseaudio-libs-devel zlib.i686 httpd mod_ssl php php-cli python-psutil wdiff --enablerepo=epel

Yum will complain about some packages not existing. Whatever. I haven't looked into them. Didn't seem to stop me from building headless_shell, though. Ignore whiney little Yum and move on. Next:

git clone https://chromium.googlesource.com/chromium/tools/depot_tools.git
echo "export PATH=$PATH:$HOME/depot_tools" >> ~/.bash_profile
source ~/.bash_profile
mkdir Chromium && cd Chromium
fetch --no-history chromium
cd src

At this point we need to make a very small change to the Chrome code. By default on Linux, Chrome assumes there to be a tmpfs at /dev/shm. There is no tmpfs available to a Lambda function. :-(

The file we have to change is src/base/files/file_util_posix.cc. Modify GetShmemTempDir() such that it always returns the OSs temp dir (/tmp). A simple way to do this is to just remove the entire #if defined(OS_LINUX) block in the GetShmemTempDir() function. A less drastic change is to hardcode use_dev_shm to false:

bool GetShmemTempDir(bool executable, FilePath* path) {
#if defined(OS_LINUX)
  bool use_dev_shm = true;
  if (executable) {
    static const bool s_dev_shm_executable = DetermineDevShmExecutable();
    use_dev_shm = s_dev_shm_executable;
  }

// cuz lambda
use_dev_shm = false; // <-- add this. Yes it's pretty hack-y

  if (use_dev_shm) {
    *path = FilePath("/dev/shm");
    return true;
  }
#endif
  return GetTempDir(path);
}

With that change, it's time to compile. Picking things back up in the src directory, set some compile arguments and then (the last command) start the build process.

mkdir -p out/Headless
echo 'import("//build/args/headless.gn")' > out/Headless/args.gn
echo 'is_debug = false' >> out/Headless/args.gn
echo 'symbol_level = 0' >> out/Headless/args.gn
echo 'is_component_build = false' >> out/Headless/args.gn
echo 'remove_webcore_debug_symbols = true' >> out/Headless/args.gn
echo 'enable_nacl = false' >> out/Headless/args.gn
gn gen out/Headless
ninja -C out/Headless headless_shell

Finally we make a tarball of the relevant file(s) we'll need to run in Lambda.

mkdir out/headless-chrome && cd out
cp Headless/headless_shell Headless/libosmesa.so headless-chrome/
tar -zcvf chrome-headless-lambda-linux-x64.tar.gz headless-chrome/

Within Lambda, run headless_shell with the remote debugger interface enabled by executing:

/path/to/headless_shell --disable-gpu --no-sandbox --remote-debugging-port=9222 --user-data-dir=/tmp/user-data --single-process --data-path=/tmp/data-path --homedir=/tmp --disk-cache-dir=/tmp/cache-dir

Since /tmp is the only writeable place in a Lambda function, there are a bunch of flags just telling Chrome where to dump it's data. They're not necessary but it keeps Chrome happy. Note also that it's been mentioned that with the --disable-gpu flag, we don't need libosmesa.so, the omission of which would shave off about 4 MB from our package zip.

I've started this project with the aim of making it easier to get started. It comes with a pre-built headless Chrome binary which you can get here.

answered Oct 08 '22 15:10

Marco Lüthy

Related questions
                            
                                chrome wont let me access localhost (it google searches instead)
                            
                                How to repeat Chrome requests as curl commands?
                            
                                Media query for fullscreen
                            
                                Black-boxing script option in Chrome Developer tool
                            
                                "Save image as.." not working in Google Chrome when using window.open() and document.write()
                            
                                Where is libappindicator3.so.1?
                            
                                ChromeDriver ERR_SSL_PROTOCOL_ERROR despite --ignore-certificate-errors
                            
                                How do I access the popup page DOM from bg page in Chrome extension?
                            
                                Changing the window title when focussing the window doesn't work in Chrome
                            
                                Update a greasemonkey script in chrome without reinstalling?
                            
                                HTML5 Video Tag in Chrome - Why is currentTime ignored when video downloaded from my webserver?
                            
                                jQuery mousemove() is called even if the mouse is still
                            
                                How does Google Chrome manage to execute installation automatically after download?
                            
                                Chromedriver in Selenium and SSL certificate
                            
                                How to read and display file in a chrome extension
                            
                                What's so special in web sockets messages displayed with a green background in Chrome?
                            
                                $x() function is not defined inside a Chrome extension, content script
                            
                                What does Chrome Network Timings really mean and what does affects each timing length?
                            
                                Options on a select changed with jquery but strange results with Chrome
                            
                                What is the meaning of == $0 that is shown in inspect element of google chrome for the selected element [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Chrome --headless for AWS Lambda?

Tags:

google-chrome

amazon-web-services

aws-lambda

chromium

Sergey Babochkin

People also ask

1 Answers

Marco Lüthy

Recent Activity

Donate For Us