Question 1

How to normalize the data between 0 and 1 range?

Accepted Answer

The formula for normalizing the data between 0 and 1 range is given below. To normalize a value, subtract it from the minimum value of the dataset and divide it by using the difference between the maximum and minimum value of the dataset.

Question 2

What happens when you normalize the data of different scales?

Accepted Answer

When you normalize the data of the different scales, both the values will be transformed to the same scale/range. For example, both values will be in the range between 0 and 1. The lowest value in the data will have the value 0 and the highest value in the data will have the value 1 and the other values will be within the range 0 and 1.

Question 3

How to normalize columns to 0 mean and 1 stdev?

Accepted Answer

If Google brought you here (like me) and you want to normalize columns to 0 mean, 1 stdev using the estimator API you can use sklearn.preprocessing.StandardScaler. It can handle NaNs (Tested on sklearn 0.20.2, I remember it didn't work on some older versions).

Question 4

How can I ignore NaN values in sklearn?

Accepted Answer

Typically I would use MinMaxScaler ( ref page) from sklearn.preprocessing, but this cannot handle NaN and recommends imputing the values based on mean or median etc. it doesn't offer the option to ignore all the NaN values. Use np.nanmax and np.nanmin instead of np.max and np.min, the rest should work fine.

Normalise between 0 and 1 ignoring NaN

Tags:

JakeCowton

People also ask

2 Answers

piRSquared

Chris Farr

Recent Activity

Donate For Us