Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Issue in executing puppeteer in headful mode in Docker

I am new to puppeteer and docker. I am facing an issue while setting up puppeteer in headful mode in docker container.

Puppeteer version: 1.6.2 Platform / OS version: Docker node:8-slim Node.js version: node 8

DockerFile -

FROM node:8-slim
RUN apt-get update && apt-get install --no-install-recommends -y ca-certificates curl fontconfig fonts-liberation gconf-service git libappindicator1 libasound2 libatk1.0-0 libc6 libcairo2 libcups2 libdbus-1-3 libexpat1 libfontconfig1 libgcc1 libgconf-2-4 libgdk-pixbuf2.0-0 libglib2.0-0 libgtk-3-0 libnspr4 libnss3 libpango-1.0-0 libpangocairo-1.0-0 libstdc++6 libx11-6 libx11-xcb1 libxcb1 libxcomposite1 libxcursor1 libxdamage1 libxext6 libxfixes3 libxi6 libxrandr2 libxrender1 libxss1 libxtst6 locales lsb-release unzip wget xdg-utils

RUN apt-get update && apt-get install -y wget --no-install-recommends && wget -q -O - https://dl-ssl.google.com/linux/linux_signing_key.pub | apt-key add - && sh -c 'echo "deb [arch=amd64] http://dl.google.com/linux/chrome/deb/ stable main" >> /etc/apt/sources.list.d/google.list' && apt-get update && apt-get install -y google-chrome-unstable fonts-ipafont-gothic fonts-wqy-zenhei fonts-thai-tlwg fonts-kacst ttf-freefont --no-install-recommends && rm -rf /var/lib/apt/lists/* \
    && apt-get purge --auto-remove -y curl && rm -rf /src/*.deb

ADD https://github.com/Yelp/dumb-init/releases/download/v1.2.0/dumb-init_1.2.0_amd64 /usr/local/bin/dumb-init

RUN chmod +x /usr/local/bin/dumb-init

RUN groupadd -r pptruser && useradd -r -g pptruser -G audio,video pptruser && mkdir -p /home/pptruser/Downloads && chown -R pptruser:pptruser /home/pptruser && chown -R pptruser:pptruser .

USER pptruser
EXPOSE 4081/tcp
CMD [ "npm", "start" ]

Application code:

const browser = await puppeteer.launch({
    headless: false,
    args:['--no-sandbox']
});

I am using puppeteer to take screenshot of the url provided. It executes successfully in headless true mode. But, when same script executed with headless false, it gives errors as:

  1. Error: Failed to launch chrome! [0803/070106.562113:ERROR:nacl_helper_linux.cc(310)] NaCl helper process running without a sandbox!

  2. [11104:11104:0816/105455.434188:FATAL:zygote_host_impl_linux.cc(123)] No usable sandbox! Update your kernel or see https://chromium.googlesource.com/chromium/src/+/master/docs/linux_suid_sandbox_development.md for more information on developing with the SUID sandbox. If you want to live dangerously and need an immediate workaround, you can try using --no-sandbox

Could you please help with this issue. Or could you share a sample application code which uses puppeteer for screenshot in docker container in headful mode. Reason for using headful mode: better performance and some sites don't allow

like image 303
jsanjayce Avatar asked Aug 03 '18 07:08

jsanjayce


1 Answers

Check the following Dockerfile, read the comments. It should run properly.

# First, we need to make sure all dependencies are there. If you are using docker, then the important dependencies are already present on most node images. 
FROM node:8

# To run Headful mode, you will need to have a display, which is not present in a server. 
# To avoid this, we will use Xvfb, and create a fake display, so the chrome will think there is a display and run properly. 
# So we just need to install Xvfb and Puppeteer related dependencies.
RUN apt-get update && apt-get install -yq gconf-service libasound2 libatk1.0-0 libc6 libcairo2 libcups2 libdbus-1-3 libexpat1 libfontconfig1 libgcc1 libgconf-2-4 libgdk-pixbuf2.0-0 libglib2.0-0 libgtk-3-0 libnspr4 libpango-1.0-0 libpangocairo-1.0-0 libstdc++6 libx11-6 libx11-xcb1 libxcb1 libxcomposite1 libxcursor1 libxdamage1 libxext6 libxfixes3 libxi6 libxrandr2 libxrender1 libxss1 libxtst6 ca-certificates fonts-liberation libappindicator1 libnss3 lsb-release xdg-utils wget x11vnc x11-xkb-utils xfonts-100dpi xfonts-75dpi xfonts-scalable xfonts-cyrillic x11-apps xvfb

# I am going to ignore dumb-init or such for now since it will add complexities to understand what is actually needed here. 

# Assuming we are working on /app folder, cd into /app
WORKDIR /app

# Copy package.json into app folder
COPY package.json /app

# Install dependencies
RUN npm install 

COPY . /app

# Start server on port 3000
EXPOSE 3000

# I'll also assume you are going to use root user, 
# and your script has `--no-sandbox` and `--disable-setuid-sandbox` arguments.
# We run a fake display and run our script.
# Start script on Xvfb
CMD xvfb-run --server-args="-screen 0 1024x768x24" npm start

Then I run it with following command,

# build the app
sudo docker build -t app . 

# start the container, expose to network and remove after running
sudo docker run --network=host -it app

Sample result from our script that I used above, it can be any script,

Step 8/8 : CMD xvfb-run --server-args="-screen 0 1024x768x24" npm start
 ---> Using cache
 ---> b1319f0e68ef
Successfully built b1319f0e68ef
Successfully tagged scraper:latest

> [email protected] start /app
> node index.js

Example app listening on port 3000!
> Opening browser
> Navigating url
> Typing text
> Wait for results
> Extracted data
> Cleaning up instances
like image 162
Md. Abu Taher Avatar answered Sep 22 '22 10:09

Md. Abu Taher