I'm trying to find a good interval of colors for color masking in order to extract skin from images. I have a database with images and masks to extract skin from those images. here's an example of a sample : <img src="https://i.stack.imgur.com/MVsoTm.png" alt="sample image"> I'm applying the mask for each image in order to get something like this : <img src="https://i.stack.imgur.com/jEdOfm.jpg" alt="masked sample result"> I'm getting all the pixels from all the masked images and removing the black pixels in order to keep only the pixels containing the skin. Using this method I'm able to gather different pixels containing different shades of color of different skins from different people. This is the code I'm using for this : <pre class="prettyprint lang-py prettyprint-override"><code>for i, (img_color, img_mask) in enumerate ( zip(COLORED_IMAGES, MASKS) ) : # masking img_masked = cv2.bitwise_and(img_color, img_mask) # transforming into pixels array img_masked_pixels = img_masked.reshape(len(img_masked) * len(img_masked[0]), len(img_masked[0][0])) # merging all pixels from all samples if i == 0: all_pixels = img_masked_pixels else: all_pixels = np.concatenate((all_pixels, img_masked_pixels), axis = 0) # removing black all_pixels = all_pixels[ ~ (all_pixels == 0).all(axis = 1) ] # sorting pixels all_pixels = np.sort(all_pixels) # reshape into 1 NB_PIXELSx1 image in order to create histogram all_pixels = all_pixels.reshape(len(all_pixels), 1, 3) # creating image NB_PIXELSx1 image containing all skin colors from dataset samples all_pixels = cv2.cvtColor(all_pixels, cv2.COLOR_BGR2YCR_CB) </code></pre> After extracting all shades of color from different skins, I'm creating a histogram that allows me to see which colors are more common. The code is too long for the creation of the histogram, but this is the result : <img src="https://i.stack.imgur.com/nR10Q.png" alt="enter image description here"> Then, I use the turning point for each color space graph and chose a distance for that color space, say 20. The interval for that color space is gotten by doing [ turning point - 20, turning point +20 ] <img src="https://i.stack.imgur.com/CyS49.png" alt="enter image description here"> So let's say that we got the following : R : <ul> <li>turning point : 142</li> <li>distance : 61</li> <li>interval : [81, 203]</li> </ul> G : <ul> <li>turning point : 155</li> <li>distance : 10</li> <li>interval : [145, 165]</li> </ul> B : <ul> <li>turning point : 109</li> <li>distance : 14</li> <li>interval : [95, 123]</li> </ul> I would use these intervals in order to create masks of the colored image from the dataset in order to extract the skin (left: my intervals mask, right: ground truth mask): <img src="https://i.stack.imgur.com/90wD6.png" alt="enter image description here"> The extracted masks using my intervals are compared with the dataset preexistent masks and the accuracy is calculated in order to see how effective and good the intervals that I got are : <pre class="prettyprint"><code>precision_moy = 0 accuracy_moy = 0 for i, (image, img) in enumerate ( zip(COLORED, GROUND_TRUTH) ) : Min = np.array([81, 145, 95], np.uint8) Max = np.array([203, 165, 123], np.uint8) mask = cv2.inRange (image, Min, Max) TP = 0 # True Positive TN = 0 # True Negative FP = 0 # False Positive FN = 0 # False Negative for i in range(mask.shape[0]) : for j in range(mask.shape[1]) : if mask[i,j] == 255 and img[i,j,0] == 255: TP = TP + 1 if mask[i,j] == 0 and img[i,j,0] == 0: TN = TN+1 if mask[i,j] == 255 and img[i,j,0] == 0: FP = FP+1 if mask[i,j] == 0 and img[i,j,0] == 255: FN = FN+1 precision = TP/(TP+FP) accuracy = (TP+TN)/(TP+TN+FP+FN) precision_moy = precision_moy + precision accuracy_moy = accuracy_moy + accuracy precision_moy = precision_moy / len(COLORED) accuracy_moy = accuracy_moy / len(COLORED) </code></pre> I keep on changing the intervals, testing and calculating the accuracy, in order to find the best possible interval for each color space. This change is done by multiplying the distance by a number between 0 and 2. For example : OLD R : <ul> <li>turning point : 142</li> <li>distance : 61</li> <li>interval : [81, 203]</li> </ul> NEW DISTANCE = OLD DISTANCE * 0.7 = 61 * 0.7 = 43 NEW R: <ul> <li>turning point : 142</li> <li>distance : 43</li> <li>interval : [99, 185]</li> </ul> <ul> <li>To get a higher interval I would multiply by a number in ]1, 2]</li> <li>To get a smaller interval I would multiply by a number in ]0, 1[</li> </ul> Now, to my question: I would like to find the best possible interval for each color space using an optimization method instead of manually and randomly changing the intervals. What optimization method should I use and how would I use it ? Thank you for taking the time. Your help is appreciated.

I would suggest using genetic optimization which can be easily implemented for as simple problem as yours. Since the problem is relatively "small" it should not take much longer to find optimal solution compared to some local optimization method like Hillclimb suggested by @Leander. Genetic algorithm is a metaheuristic search so it is not guaranteed to find the optimal solution but it should get you very close. In fact for a such small problem the chance that you will find the global optimum is very high. As a start I would recommend taking a look at DEAP so you don't have to implement anything yourself (https://deap.readthedocs.io/en/master/). It contains very good implementations of many genetic algorithm variations and there are tutorials with nice examples. With a bit of effort you should be able to compose a simple optimization algorithm in a day or two. Genetic algorithm will from now on be denoted as <code>GA</code> for simplicity Some tips where to start: <ul> <li>I suggest you start with the simplest variation<code>eaSimple</code> in DEAP. When this will not be satisfactory you can always move to something little more sophisticated but I think that won't be necessary.</li> <li>your <code>Individual</code> in GA will have 6 components -> [blue_low, blue_high, green_low, green_high, red_low, red_high] this will also address the problem of assymetric interval as mentioned by @Leander in the comments</li> <li> <code>mutations</code> will be done by randomly altering elements of the individual</li> <li>for <code>fittness</code> function you can use your accuracy as you are computing it now</li> </ul> That is essentially all you need to build GA for your problem. This example here https://deap.readthedocs.io/en/master/examples/ga_onemax.html should get you up and running. You just need to define your own individuals, operators and fitness evaluation function as I mentioned in previous steps A final note on the use of any general optimization method. As I understand this is a discrete problem in 6 dimensions since you have 6 components: blue_low, blue_high, green_low, green_high, red_low, red_high and each one of them has only 255 possible values. This will prevent use of most optimization methods since they require the problem to be continuous.

One basic approach which converges quickly but may not yield the global optimum is Hillclimbing. Hillclimbing is a form of local search which can be used in this case. Hillclimbing works by going from one state or solution to the next depending on the score or performance of the state. If no better state can be found that state is returned as solution. There are multiple ways of implementing Hillclimbing, in your case I would do something like this: The State: In your case an item containing the Min and Max numpy arrays and the accuracy or f-measure of the mask created with these arrays applied on the image as score property. <blockquote> For now I suggest you only take symmetrical ranges to massively reduce the search space. </blockquote> Starting State You can create a starting state at random, taking a random interval for each channel (Red, Green, Blue). This is especially useful if you run this algorithm multiple times. Determine the maximum and minimum for each interval based on your histograms. Iteration Process (this is where the searching is done) You want to create an infinite loop in which you create successor states for the current state. Increasing or decreasing the interval of each channel with say <code>10</code> of the current state, and then every combination of those new intervals can be a successor state. Another way could be to switch channel each iteration. So in the first iteration you create a successor state that has the Red channel of the current state decreased with 10, and a successor state that has the Red channel of the current state increased with 10. The second iteration you change the Green channel, the third iteration the Blue channel, etc. You then create a mask based on each successor state and apply them onto the image, therefore determining the performance of each successor state. Select the best performing successor state and take it as current state if its performance is better. Repeat this process until the best successor state is performing worse than the current state, then you know you have hit a local optimum. Return this state as solution. Problems As highlighted in above line, this algorithm will find the local optimum for the starting state. This is because of greediness of this algorithm. You therefore may want to restart this algorithm on different starting locations, allowing more of the search space to be explored, increasing the chance the global maximum is found. If you have multiple threads you may run multiple instances in parallel and then finally returning the best state out of the results from each instance. Hillclimbing is not the best optimization algorithm, but it is very fast and easy to implement.

How to use an optimization algorithm to find the best possible parameter

Tags:

python

optimization

mask

I'm trying to find a good interval of colors for color masking in order to extract skin from images.

I have a database with images and masks to extract skin from those images. here's an example of a sample :

sample image

I'm applying the mask for each image in order to get something like this :

masked sample result

I'm getting all the pixels from all the masked images and removing the black pixels in order to keep only the pixels containing the skin. Using this method I'm able to gather different pixels containing different shades of color of different skins from different people.

This is the code I'm using for this :

for i, (img_color, img_mask) in enumerate ( zip(COLORED_IMAGES, MASKS) ) :

    # masking
    img_masked = cv2.bitwise_and(img_color, img_mask)
    
    # transforming into pixels array
    img_masked_pixels = img_masked.reshape(len(img_masked) * len(img_masked[0]), len(img_masked[0][0]))
 
    # merging all pixels from all samples
    if i == 0:
        all_pixels = img_masked_pixels
    else:
        all_pixels = np.concatenate((all_pixels, img_masked_pixels), axis = 0)

# removing black
all_pixels = all_pixels[ ~ (all_pixels == 0).all(axis = 1) ]

# sorting pixels
all_pixels = np.sort(all_pixels)

# reshape into 1 NB_PIXELSx1 image in order to create histogram
all_pixels = all_pixels.reshape(len(all_pixels), 1, 3)

# creating image NB_PIXELSx1 image containing all skin colors from dataset samples
all_pixels = cv2.cvtColor(all_pixels, cv2.COLOR_BGR2YCR_CB)

After extracting all shades of color from different skins, I'm creating a histogram that allows me to see which colors are more common. The code is too long for the creation of the histogram, but this is the result :

enter image description here

Then, I use the turning point for each color space graph and chose a distance for that color space, say 20. The interval for that color space is gotten by doing [ turning point - 20, turning point +20 ]

enter image description here

So let's say that we got the following :

R :

turning point : 142
distance : 61
interval : [81, 203]

G :

turning point : 155
distance : 10
interval : [145, 165]

B :

turning point : 109
distance : 14
interval : [95, 123]

I would use these intervals in order to create masks of the colored image from the dataset in order to extract the skin (left: my intervals mask, right: ground truth mask):

enter image description here

The extracted masks using my intervals are compared with the dataset preexistent masks and the accuracy is calculated in order to see how effective and good the intervals that I got are :

precision_moy = 0
accuracy_moy = 0

for i, (image, img) in enumerate ( zip(COLORED, GROUND_TRUTH) ) :
    Min = np.array([81, 145, 95], np.uint8)
    Max = np.array([203, 165, 123], np.uint8)

    mask = cv2.inRange (image, Min, Max)

    TP = 0 # True Positive
    TN = 0 # True Negative
    FP = 0 # False Positive
    FN = 0 # False Negative

    for i in range(mask.shape[0]) :
        for j in range(mask.shape[1]) :
            if mask[i,j] == 255 and img[i,j,0] == 255:
                TP = TP + 1
            if mask[i,j] == 0 and img[i,j,0] == 0:
                TN = TN+1
            if mask[i,j] == 255 and img[i,j,0] == 0:
                FP = FP+1
            if mask[i,j] == 0 and img[i,j,0] == 255:
                FN = FN+1

    precision = TP/(TP+FP)
    accuracy = (TP+TN)/(TP+TN+FP+FN)
    
    precision_moy = precision_moy + precision
    accuracy_moy = accuracy_moy + accuracy

precision_moy = precision_moy / len(COLORED)
accuracy_moy = accuracy_moy / len(COLORED)

I keep on changing the intervals, testing and calculating the accuracy, in order to find the best possible interval for each color space. This change is done by multiplying the distance by a number between 0 and 2. For example :

OLD R :

turning point : 142
distance : 61
interval : [81, 203]

NEW DISTANCE = OLD DISTANCE * 0.7 = 61 * 0.7 = 43

NEW R:

turning point : 142
distance : 43
interval : [99, 185]

To get a higher interval I would multiply by a number in ]1, 2]
To get a smaller interval I would multiply by a number in ]0, 1[

Now, to my question:

I would like to find the best possible interval for each color space using an optimization method instead of manually and randomly changing the intervals. What optimization method should I use and how would I use it ?

Thank you for taking the time. Your help is appreciated.

795

asked Aug 17 '20 20:08

Mohamed Benkedadra

2 Answers

I would suggest using genetic optimization which can be easily implemented for as simple problem as yours. Since the problem is relatively "small" it should not take much longer to find optimal solution compared to some local optimization method like Hillclimb suggested by @Leander. Genetic algorithm is a metaheuristic search so it is not guaranteed to find the optimal solution but it should get you very close. In fact for a such small problem the chance that you will find the global optimum is very high.

As a start I would recommend taking a look at DEAP so you don't have to implement anything yourself (https://deap.readthedocs.io/en/master/). It contains very good implementations of many genetic algorithm variations and there are tutorials with nice examples. With a bit of effort you should be able to compose a simple optimization algorithm in a day or two.

Genetic algorithm will from now on be denoted as GA for simplicity

Some tips where to start:

I suggest you start with the simplest variationeaSimple in DEAP. When this will not be satisfactory you can always move to something little more sophisticated but I think that won't be necessary.
your Individual in GA will have 6 components -> [blue_low, blue_high, green_low, green_high, red_low, red_high] this will also address the problem of assymetric interval as mentioned by @Leander in the comments
mutations will be done by randomly altering elements of the individual
for fittness function you can use your accuracy as you are computing it now

That is essentially all you need to build GA for your problem. This example here https://deap.readthedocs.io/en/master/examples/ga_onemax.html should get you up and running. You just need to define your own individuals, operators and fitness evaluation function as I mentioned in previous steps

A final note on the use of any general optimization method. As I understand this is a discrete problem in 6 dimensions since you have 6 components: blue_low, blue_high, green_low, green_high, red_low, red_high and each one of them has only 255 possible values. This will prevent use of most optimization methods since they require the problem to be continuous.

114

answered Oct 10 '22 10:10

Majo

One basic approach which converges quickly but may not yield the global optimum is Hillclimbing.

Hillclimbing is a form of local search which can be used in this case.
Hillclimbing works by going from one state or solution to the next depending on the score or performance of the state. If no better state can be found that state is returned as solution.

There are multiple ways of implementing Hillclimbing, in your case I would do something like this:

The State: In your case an item containing the Min and Max numpy arrays and the accuracy or f-measure of the mask created with these arrays applied on the image as score property.

For now I suggest you only take symmetrical ranges to massively reduce the search space.

Starting State
You can create a starting state at random, taking a random interval for each channel (Red, Green, Blue). This is especially useful if you run this algorithm multiple times. Determine the maximum and minimum for each interval based on your histograms.

Iteration Process (this is where the searching is done)
You want to create an infinite loop in which you create successor states for the current state. Increasing or decreasing the interval of each channel with say 10 of the current state, and then every combination of those new intervals can be a successor state.
Another way could be to switch channel each iteration. So in the first iteration you create a successor state that has the Red channel of the current state decreased with 10, and a successor state that has the Red channel of the current state increased with 10. The second iteration you change the Green channel, the third iteration the Blue channel, etc.

You then create a mask based on each successor state and apply them onto the image, therefore determining the performance of each successor state.
Select the best performing successor state and take it as current state if its performance is better.

Repeat this process until the best successor state is performing worse than the current state, then you know you have hit a local optimum. Return this state as solution.

Problems
As highlighted in above line, this algorithm will find the local optimum for the starting state. This is because of greediness of this algorithm.
You therefore may want to restart this algorithm on different starting locations, allowing more of the search space to be explored, increasing the chance the global maximum is found.
If you have multiple threads you may run multiple instances in parallel and then finally returning the best state out of the results from each instance.

Hillclimbing is not the best optimization algorithm, but it is very fast and easy to implement.

answered Oct 10 '22 10:10

Leander

Related questions
                            
                                Python typing: typed dictionary or defaultdict extending classes
                            
                                How to avoid poor performance of pandas mean() with datetime columns
                            
                                How to use deep learning models for time-series forecasting?
                            
                                Include minimum pip version in setup.py
                            
                                How to make conda-build work correctly and find the setup.py?
                            
                                Animation of tangent line of a 3D curve
                            
                                os.link() vs. os.rename() vs. os.replace() for writing atomic write files. What is the best approach?
                            
                                Reasons for differences in memory consumption and performances of np.zeros and np.full
                            
                                Find Fraction using LP
                            
                                Training stability of Wasserstein GANs
                            
                                Detecting insertion/removal of USB input devices on Windows 10
                            
                                TensorFlow 2.0 C++ - Load pre-trained model
                            
                                how to increase resolution of text in scanned images in python?
                            
                                matplotlib figure won't show when Python is run from VS Code integrated terminal
                            
                                ImportError: cannot import name 'Feature' from 'setuptools [closed]
                            
                                how to add a different model form to modelformset_factory
                            
                                tensorflow_hub throwing this error: 'SentencepieceOp' when loading the link
                            
                                Why multiprocess python grpc server do not work?
                            
                                ValueError: Expect x to be a 1-D sorted array_like.I am trying to plot smooth curve but couldn't
                            
                                Calling a function with unknown number of parameters Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to use an optimization algorithm to find the best possible parameter

Tags:

python

optimization

mask

Mohamed Benkedadra

People also ask

2 Answers

Majo

Leander

Recent Activity

Donate For Us