Converting Images to Black and White for Image Recognition in R

Tags:

I'm trying to gain some experience with automatic text recognition and i'm using the package tesseract to perform ocr on some images (i.e. some screenshots I took).

To improve the performance of my program's recognition of the prices in the image below, I implemented some preprocessing on the image using the magick package by increasing the contrast of the image by changing brightness and saturation parameters.

However, I think the performance could be further increased by converting to a black and white image.

How can this be efficiently achieved in R?

Original Image

After preprocessing image after my preprcessing

330

asked Jan 31 '18 22:01

Francesco Dal Pont

1 Answers

You can convert the colorspace with magick::image_quantize:

library(magick)
#> Linking to ImageMagick 6.9.9.25
#> Enabled features: cairo, fontconfig, freetype, fftw, lcms, pango, rsvg, webp
#> Disabled features: ghostscript, x11

i <- image_read('https://i.stack.imgur.com/nn9k0.png')

i

i %>% image_quantize(colorspace = 'gray')

Depending on your desired image structure, you could also use image_convert to do the same thing:

i %>% image_convert(colorspace = 'gray')
# or
i %>% image_convert(type = 'Grayscale')

or to convert to true black and white (not grayscale),

i %>% image_convert(type = 'Bilevel')

which in this case returns an image with salt and pepper noise, which may or may not be useful.

Note, however, that while this might be good practice for OCR, it would be a lot simpler to get this data by webscraping, e.g. with rvest should it be permissible (presumably the same issues apply to grabbing these images). Better, should it contain the information you need, is to use the appropriate RyanAir API.

129

answered Sep 20 '22 14:09

alistaire

Related questions
                            
                                ggplot divergent lines with error bars
                            
                                Combine in flexdashboard with multiple pages different types of vertical_layout
                            
                                How to import ical .ics file in R
                            
                                R flexdashboard remove title bar
                            
                                R Hex to RGB converter
                            
                                Using ggfortify and ggrepel for pca
                            
                                Can't load files using system.file or file.path in R?
                            
                                How to use data within a function in an R package?
                            
                                How to add label to geom_segment at the start of the segment?
                            
                                R optparse error with command line arguments
                            
                                how to find top N descending values in group in dplyr
                            
                                Shiny Dashboadpage lock dashboardHeader on top
                            
                                How to pass user and password in new_handle in curl R
                            
                                Change the caption title of a figure in markdown
                            
                                How to filter by a string containing variables in dbplyr [duplicate]
                            
                                How to Use na.rm=TRUE with n() While Using Dplyr's Group_by and Summarise_at
                            
                                Why isn't \\b in gsubfn in R working for me?
                            
                                Add ribbon showing mean and interquartile range to ggplot2
                            
                                Count the number of non-NA numeric values of each row in dplyr
                            
                                str_replace_all replacing named vector elements iteratively not all at once

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Converting Images to Black and White for Image Recognition in R

Tags:

r

image-processing

contrast

tesseract

text-recognition

Francesco Dal Pont

People also ask

1 Answers

alistaire

Recent Activity

Donate For Us