i have a 1000 * 1000 numpy array with 1 million values which was created as follows : <pre class="prettyprint"><code>>>import numpy as np >>data = np.loadtxt('space_data.txt') >> print (data) >>[[ 13. 15. 15. ..., 15. 15. 16.] [ 14. 13. 14. ..., 13. 15. 16.] [ 16. 13. 13. ..., 13. 15. 17.] ..., [ 14. 15. 14. ..., 14. 14. 13.] [ 15. 15. 16. ..., 16. 15. 14.] [ 14. 13. 16. ..., 16. 16. 16.]] </code></pre> I have another numpy array which which has 2 columns as follows: <pre class="prettyprint"><code>>> print(key) >>[[ 10., S], [ 11., S], [ 12., S], [ 13., M], [ 14., L], [ 15., S], [ 16., S], ..., [ 92., XL], [ 93., M], [ 94., XL], [ 95., S]] </code></pre> What i would basically want is to replace each element of of the data array with corresponding element in the second column of the key array like this.. <pre class="prettyprint"><code>>> print(data) >>[[ M S S ..., S S S] [ L M L ..., M S S] [ S M M ..., M S XL] ..., [ L S L ..., L L M] [ S S S ..., S S L] [ L M S ..., S S S]] </code></pre>

In Python dicts are a natural choice for mapping from keys to values. NumPy has no direct equivalent of a dict. But it does have arrays which can do fast integer indexing. For example, <pre class="prettyprint"><code>In [153]: keyarray = np.array(['S','M','L','XL']) In [158]: data = np.array([[0,2,1], [1,3,2]]) In [159]: keyarray[data] Out[159]: array([['S', 'L', 'M'], ['M', 'XL', 'L']], dtype='|S2') </code></pre> So if we could massage your <code>key</code> array into one that looked like this: <pre class="prettyprint"><code>In [161]: keyarray Out[161]: array(['', '', '', '', '', '', '', '', '', '', 'S', 'S', 'S', 'M', 'L', 'S', 'S', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', 'XL', 'M', 'XL', 'S'], dtype='|S32') </code></pre> So that 10 maps to 'S' in the sense that <code>keyarray[10]</code> equals <code>S</code>, and so forth: <pre class="prettyprint"><code>In [162]: keyarray[10] Out[162]: 'S' </code></pre> then we could produce the desired result with <code>keyarray[data]</code>. <hr> <pre class="prettyprint"><code>import numpy as np data = np.array( [[ 13., 15., 15., 15., 15., 16.], [ 14., 13., 14., 13., 15., 16.], [ 16., 13., 13., 13., 15., 17.], [ 14., 15., 14., 14., 14., 13.], [ 15., 15 , 16., 16., 15., 14.], [ 14., 13., 16., 16., 16., 16.]]) key = np.array([[ 10., 'S'], [ 11., 'S'], [ 12., 'S'], [ 13., 'M'], [ 14., 'L'], [ 15., 'S'], [ 16., 'S'], [ 17., 'XL'], [ 92., 'XL'], [ 93., 'M'], [ 94., 'XL'], [ 95., 'S']]) idx = np.array(key[:,0], dtype=float).astype(int) n = idx.max()+1 keyarray = np.empty(n, dtype=key[:,1].dtype) keyarray[:] = '' keyarray[idx] = key[:,1] data = data.astype('int') print(keyarray[data]) </code></pre> yields <pre class="prettyprint"><code>[['M' 'S' 'S' 'S' 'S' 'S'] ['L' 'M' 'L' 'M' 'S' 'S'] ['S' 'M' 'M' 'M' 'S' 'XL'] ['L' 'S' 'L' 'L' 'L' 'M'] ['S' 'S' 'S' 'S' 'S' 'L'] ['L' 'M' 'S' 'S' 'S' 'S']] </code></pre> Note that <code>data = data.astype('int')</code> is assuming that the floats in <code>data</code> can be uniquely mapped to <code>int</code>s. That appears to be the case with your data, but it is not true for arbitrary floats. For example, <code>astype('int')</code> maps both 1.0 and 1.5 map to 1. <pre class="prettyprint"><code>In [167]: np.array([1.0, 1.5]).astype('int') Out[167]: array([1, 1]) </code></pre>

Replace values of a numpy array by values from another numpy array

Tags:

python

numpy

i have a 1000 * 1000 numpy array with 1 million values which was created as follows :

>>import numpy as np
>>data = np.loadtxt('space_data.txt')
>> print (data)
>>[[ 13.  15.  15. ...,  15.  15.  16.]
   [ 14.  13.  14. ...,  13.  15.  16.]
   [ 16.  13.  13. ...,  13.  15.  17.]
   ..., 
   [ 14.   15.  14. ...,  14.  14.  13.]
   [ 15.   15.  16. ...,  16.  15.  14.]
   [ 14.   13.  16. ...,  16.  16.  16.]]

I have another numpy array which which has 2 columns as follows:

>> print(key)
>>[[ 10.,   S],
   [ 11.,   S],
   [ 12.,   S],
   [ 13.,   M],
   [ 14.,   L],
   [ 15.,   S],
   [ 16.,   S],
   ...,
   [ 92.,   XL],
   [ 93.,   M],
   [ 94.,   XL],
   [ 95.,   S]]

What i would basically want is to replace each element of of the data array with corresponding element in the second column of the key array like this..

>> print(data)
>>[[ M  S  S ...,  S  S  S]
   [ L   M  L ...,  M  S  S]
   [ S   M  M ...,  M  S  XL]
   ..., 
   [ L   S  L ...,  L  L  M]
   [ S   S  S ...,  S  S  L]
   [ L   M  S ...,  S  S  S]]

984

asked Mar 28 '15 18:03

Amistad

3 Answers

In Python dicts are a natural choice for mapping from keys to values. NumPy has no direct equivalent of a dict. But it does have arrays which can do fast integer indexing. For example,

In [153]: keyarray = np.array(['S','M','L','XL'])

In [158]: data = np.array([[0,2,1], [1,3,2]])

In [159]: keyarray[data]
Out[159]: 
array([['S', 'L', 'M'],
       ['M', 'XL', 'L']], 
      dtype='|S2')

So if we could massage your key array into one that looked like this:

In [161]: keyarray
Out[161]: 
array(['', '', '', '', '', '', '', '', '', '', 'S', 'S', 'S', 'M', 'L',
       'S', 'S', '', '', '', '', '', '', '', '', '', '', '', '', '', '',
       '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '',
       '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '',
       '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '', '',
       '', '', '', '', '', '', '', '', '', '', 'XL', 'M', 'XL', 'S'], 
      dtype='|S32')

So that 10 maps to 'S' in the sense that keyarray[10] equals S, and so forth:

In [162]: keyarray[10]
Out[162]: 'S'

then we could produce the desired result with keyarray[data].

import numpy as np

data = np.array( [[ 13.,   15.,  15.,  15.,  15.,  16.],
                  [ 14.,   13.,  14.,  13.,  15.,  16.],
                  [ 16.,   13.,  13.,  13.,  15.,  17.],
                  [ 14.,   15.,  14.,  14.,  14.,  13.],
                  [ 15.,   15 ,  16.,  16.,  15.,  14.],
                  [ 14.,   13.,  16.,  16.,  16.,  16.]])

key = np.array([[ 10., 'S'],
                [ 11., 'S'],
                [ 12., 'S'],
                [ 13., 'M'],
                [ 14., 'L'],
                [ 15., 'S'],
                [ 16., 'S'],
                [ 17., 'XL'],
                [ 92., 'XL'],
                [ 93., 'M'],
                [ 94., 'XL'],
                [ 95., 'S']])

idx = np.array(key[:,0], dtype=float).astype(int)
n = idx.max()+1
keyarray = np.empty(n, dtype=key[:,1].dtype)
keyarray[:] = ''
keyarray[idx] = key[:,1]

data = data.astype('int')
print(keyarray[data])

yields

[['M' 'S' 'S' 'S' 'S' 'S']
 ['L' 'M' 'L' 'M' 'S' 'S']
 ['S' 'M' 'M' 'M' 'S' 'XL']
 ['L' 'S' 'L' 'L' 'L' 'M']
 ['S' 'S' 'S' 'S' 'S' 'L']
 ['L' 'M' 'S' 'S' 'S' 'S']]

Note that data = data.astype('int') is assuming that the floats in data can be uniquely mapped to ints. That appears to be the case with your data, but it is not true for arbitrary floats. For example, astype('int') maps both 1.0 and 1.5 map to 1.

In [167]: np.array([1.0, 1.5]).astype('int')
Out[167]: array([1, 1])

174

answered Oct 10 '22 13:10

unutbu

An un-vectorized linear approach will be to use a dictionary here:

dct = dict(keys)
# new array is required if dtype is different or it it cannot be casted
new_array = np.empty(data.shape, dtype=str)
for index in np.arange(data.size):
    index = np.unravel_index(index, data.shape)
    new_array[index] = dct[data[index]]

answered Oct 10 '22 11:10

Ashwini Chaudhary

import numpy as np

data = np.array([[ 13.,  15.,  15.],
   [ 14.,  13.,  14. ],
   [ 16.,  13.,  13. ]])

key = [[ 10.,   'S'],
   [ 11.,   'S'],
   [ 12.,   'S'],
   [ 13.,   'M'],
   [ 14.,   'L'],
   [ 15.,   'S'],
   [ 16.,   'S']]

data2 = np.zeros(data.shape, dtype=str)

for k in key:
    data2[data == k[0]] = k[1]

answered Oct 10 '22 12:10

Julien Spronck

Related questions
                            
                                Python Invalid format string [duplicate]
                            
                                Python psycopg2 copy_from() to load data throws error for null integer values: DataError: invalid input syntax for integer: ""
                            
                                Test doesn't raise ValidationError on Django model field
                            
                                Simplest way to get the first n elements of an iterator
                            
                                django rest-framework : can't get static files
                            
                                TypeError: object() takes no parameters
                            
                                Python threading self._stop() 'Event' object is not callable
                            
                                10 ,most frequent words in a string Python
                            
                                How to select all children text but excluding a tag with Scapy's XPath?
                            
                                What does ''except Exception as e'' mean in python? [closed]
                            
                                Filter values of dictionary [duplicate]
                            
                                Global variable is not defined - Python
                            
                                from pymongo.objectid import ObjectId ImportError: No module named objectid
                            
                                Python3 adds extra byte when printing hex values
                            
                                Prepending to list python
                            
                                Multilevel JSON diff in python
                            
                                sqlalchemy mysql connections not closing on flask api
                            
                                How to simulate from an (arbitrary) continuous probability distribution? [duplicate]
                            
                                Alternative to redis.keys(...)
                            
                                How do you "echo" quotes using python's os.system()?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With