Convert probability vector into target vector in python?

Question

I am doing logistic regression on iris dataset from sklearn, I know the math and try to implement it. At the final step, I get a prediction vector, this prediction vector represents the probability of that data point being to class 1 or class 2 (binary classification).

Now I want to turn this prediction vector into target vector. Say if probability is greater than 50%, that corresponding data point will belong to class 1, otherwise class 2. Use 0 to represent class 1, 1 for class 2.

I know there is a for loop version of it, just looping through the whole vector. But when the size get large, for loop is very expensive, so I want to do it more efficiently, like numpy's matrix operation, it is faster than doing matrix operation in for loop.

Any suggestion on the faster method?

Shi XiuFeng · Accepted Answer

import numpy as np

a = np.matrix('0.1 0.82')
print(a)

a[a > 0.5] = 1
a[a <= 0.5] = 0
print(a)

Output:

[[ 0.1   0.82]]
[[ 0.  1.]]

Update:

import numpy as np

a = np.matrix('0.1 0.82')
print(a)

a = np.where(a > 0.5, 1, 0)
print(a)

Jeff · Answer

A more general solution to a 2D array which has many vectors with many classes:

import numpy as np
a = np.array( [ [.5, .3, .2], 
                [.1, .2, .7], 
                [ 1,  0,  0] ] )

idx = np.argmax(a, axis=-1)
a = np.zeros( a.shape )
a[ np.arange(a.shape[0]), idx] = 1

print(a)

Output:

[[1. 0. 0.]
 [0. 0. 1.]
 [1. 0. 0.]]

Convert probability vector into target vector in python?

Tags:

python

machine-learning

numpy

jiancheng wu

2 Answers

Output:

Update:

Shi XiuFeng

Jeff

Recent Activity

Donate For Us

Convert probability vector into target vector in python?

Tags:

python

machine-learning

numpy

jiancheng wu

2 Answers

Output:

Update:

Shi XiuFeng

Jeff

Related questions

Recent Activity

Donate For Us