I'm working on a project to measure and visualize image similarity. The images in my dataset come from photographs of images in books, some of which have very high or low exposure rates. For example, the images below come from two different books; the one on the top is an over-exposed reprint of the one on the bottom, wherein the exposure looks good: <img src="https://i.stack.imgur.com/E8lBj.jpg" alt="enter image description here"> <img src="https://i.stack.imgur.com/y2Kk4.jpg" alt="enter image description here"> I'd like to normalize each image's exposure in Python. I thought I could do so with the following naive approach, which attempts to center each pixel value between 0 and 255: <pre class="prettyprint"><code>from scipy.ndimage import imread import sys def normalize(img): ''' Normalize the exposure of an image. @args: {numpy.ndarray} img: an array of image pixels with shape: (height, width) @returns: {numpy.ndarray} an image with shape of `img` wherein all values are normalized such that the min=0 and max=255 ''' _min = img.min() _max = img.max() return img - _min * 255 / (_max - _min) img = imread(sys.argv[1]) normalized = normalize(img) </code></pre> Only after running this did I realize that this normalization will only help images whose lightest value is less than 255 or whose darkest value is greater than 0. Is there a straightforward way to normalize the exposure of an image such as the top image above? I'd be grateful for any thoughts others can offer on this question.

Histogram equalisation works surprisingly well for this kind of thing. It's usually better for photographic images, but it's helpful even on line art, as long as there are some non-black/white pixels. It works well for colour images too: split the bands up, equalize each one separately, and recombine. I tried on your sample image: <img src="https://i.stack.imgur.com/pbGU1.jpg" alt="after hist equal"> Using libvips: <pre class="prettyprint"><code>$ vips hist_equal sample.jpg x.jpg </code></pre> Or from Python with pyvips: <pre class="prettyprint"><code>x = pyvips.Image.new_from_file("sample.jpg") x = x.hist_equal() x.write_to_file("x.jpg") </code></pre>

Python: Normalize image exposure

Tags:

python

image

image-processing

I'm working on a project to measure and visualize image similarity. The images in my dataset come from photographs of images in books, some of which have very high or low exposure rates. For example, the images below come from two different books; the one on the top is an over-exposed reprint of the one on the bottom, wherein the exposure looks good:

enter image description here

I'd like to normalize each image's exposure in Python. I thought I could do so with the following naive approach, which attempts to center each pixel value between 0 and 255:

from scipy.ndimage import imread
import sys

def normalize(img):
  '''
  Normalize the exposure of an image.
  @args:
    {numpy.ndarray} img: an array of image pixels with shape:
      (height, width)
  @returns:
    {numpy.ndarray} an image with shape of `img` wherein
      all values are normalized such that the min=0 and max=255
  '''
  _min = img.min()
  _max = img.max()
  return img - _min * 255 / (_max - _min)

img = imread(sys.argv[1])
normalized = normalize(img)

Only after running this did I realize that this normalization will only help images whose lightest value is less than 255 or whose darkest value is greater than 0.

Is there a straightforward way to normalize the exposure of an image such as the top image above? I'd be grateful for any thoughts others can offer on this question.

781

asked Mar 29 '18 00:03

duhaime

3 Answers

Histogram equalisation works surprisingly well for this kind of thing. It's usually better for photographic images, but it's helpful even on line art, as long as there are some non-black/white pixels.

It works well for colour images too: split the bands up, equalize each one separately, and recombine.

I tried on your sample image:

after hist equal

Using libvips:

$ vips hist_equal sample.jpg x.jpg

Or from Python with pyvips:

x = pyvips.Image.new_from_file("sample.jpg")
x = x.hist_equal()
x.write_to_file("x.jpg")

125

answered Oct 31 '22 01:10

jcupitt

It's very hard to say if it will work for you without seeing a larger sample of your images, but you may find an "auto-gamma" useful. There is one built into ImageMagick and the description - so that you can calculate it yourself - is:

Automagically adjust gamma level of image.

This calculates the mean values of an image, then applies a calculated -gamma adjustment so that the mean color in the image will get a value of 50%.

This means that any solid 'gray' image becomes 50% gray.

This works well for real-life images with little or no extreme dark and light areas, but tend to fail for images with large amounts of bright sky or dark shadows. It also does not work well for diagrams or cartoon like images.

You can try it out yourself on the command line very simply before you go and spend a lot of time coding something that may not work:

convert Tribunal.jpg -auto-gamma result.png

enter image description here

You can do -auto-level as per your own code beforehand, and a thousand other things too:

convert Tribunal.jpg -auto-level -auto-gamma result.png

answered Oct 31 '22 00:10

Mark Setchell

I ended up using a numpy implementation of the histogram normalization method @user894763 pointed out. Just save the below as normalize.py then you can call:

python normalize.py cats.jpg

Script:

import numpy as np
from scipy.misc import imsave
from scipy.ndimage import imread
import sys

def get_histogram(img):
  '''
  calculate the normalized histogram of an image
  '''
  height, width = img.shape
  hist = [0.0] * 256
  for i in range(height):
    for j in range(width):
      hist[img[i, j]]+=1
  return np.array(hist)/(height*width)

def get_cumulative_sums(hist):
  '''
  find the cumulative sum of a numpy array
  '''
  return [sum(hist[:i+1]) for i in range(len(hist))]

def normalize_histogram(img):
  # calculate the image histogram
  hist = get_histogram(img)
  # get the cumulative distribution function
  cdf = np.array(get_cumulative_sums(hist))
  # determine the normalization values for each unit of the cdf
  sk = np.uint8(255 * cdf)
  # normalize the normalization values
  height, width = img.shape
  Y = np.zeros_like(img)
  for i in range(0, height):
    for j in range(0, width):
      Y[i, j] = sk[img[i, j]]
  # optionally, get the new histogram for comparison
  new_hist = get_histogram(Y)
  # return the transformed image
  return Y

img = imread(sys.argv[1])
normalized = normalize_histogram(img)
imsave(sys.argv[1] + '-normalized.jpg', normalized)

Output:

enter image description here

answered Oct 30 '22 23:10

duhaime

Related questions
                            
                                How to keep column MultiIndex values when merging pandas DataFrames
                            
                                Use os.listdir to show directories only [duplicate]
                            
                                Matplotlib reads jpg into int8 and png into normalized float
                            
                                Using a colormap for matplotlib line plots
                            
                                PYQT - nesting widgets and layouts in multiple levels
                            
                                How to remove the multiindex from GroupBy.apply()?
                            
                                How can I parse a host:port pair in Python
                            
                                Suptitle alignment issues in Matplotlib
                            
                                gsutil no longer works?
                            
                                What's the inferred name of variables in argparse in conflicting cases
                            
                                How to set the timeout of 'driver.get' for python selenium 3.8.0?
                            
                                Seaborn heatmap, custom tick values
                            
                                Round to nearest 1000 in pandas
                            
                                Pandas, how to combine multiple columns into an array column
                            
                                Django '/' only homepage url error
                            
                                Making numpy arrays JSON serializable
                            
                                opposite of df.diff() in pandas
                            
                                What does x in range(...) == y mean in Python 3? [duplicate]
                            
                                Django's template tag inside javascript
                            
                                Unit test pyspark code using python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python: Normalize image exposure

Tags:

python

image

image-processing

duhaime

People also ask

3 Answers

jcupitt

Mark Setchell

duhaime

Recent Activity

Donate For Us