Binning of data along one axis in numpy

Tags:

I have a large two dimensional array arr which I would like to bin over the second axis using numpy. Because np.histogram flattens the array I'm currently using a for loop:

import numpy as np

arr = np.random.randn(100, 100)

nbins = 10
binned = np.empty((arr.shape[0], nbins))

for i in range(arr.shape[0]):
    binned[i,:] = np.histogram(arr[i,:], bins=nbins)[0]

I feel like there should be a more direct and more efficient way to do that within numpy but I failed to find one.

835

asked Oct 13 '16 10:10

obachtos

2 Answers

You could use np.apply_along_axis:

x = np.array([range(20), range(1, 21), range(2, 22)])

nbins = 2
>>> np.apply_along_axis(lambda a: np.histogram(a, bins=nbins)[0], 1, x)
array([[10, 10],
       [10, 10],
       [10, 10]])

The main advantage (if any) is that it's slightly shorter, but I wouldn't expect much of a performance gain. It's possibly marginally more efficient in the assembly of the per-row results.

107

answered Oct 21 '22 00:10

Ami Tavory

I was a bit confused by the lambda in Ami's solution so I expanded it out to show what it's doing:

def hist_1d(a):
    return np.histogram(a, bins=bins)[0]

counts = np.apply_along_axis(hist_1d, axis=1, arr=x)

answered Oct 21 '22 01:10

ThomasNicholas

Related questions
                            
                                TemplateDoesNotExist at / base.html
                            
                                matplotlib on pycharm with remote ssh intepreter
                            
                                Memory consumption of NumPy function for standard deviation
                            
                                python mock and libraries that are not installed
                            
                                Does multiprocessing.pool.imap has a variant (like starmap) that allows for multiple arguments?
                            
                                Can you fix the false negative rate in a classifier in scikit learn
                            
                                How do I download Anaconda packages without "installing" them?
                            
                                Compiling & installing C executable using python's setuptools/setup.py?
                            
                                How are variables names stored and mapped internally?
                            
                                import m2m relation in django-import-export
                            
                                How do I fix a dimension error in TensorFlow?
                            
                                Idioms in python: closure vs functor vs object
                            
                                What pylint options can be specified in inline comments?
                            
                                How can I create an argparse mutually exclusive group with multiple positional parameters?
                            
                                How do you count cars in OpenCV with Python?
                            
                                How does Apache spark handle python multithread issues?
                            
                                Syntaxnet / Parsey McParseface python API
                            
                                What is the proper way of testing throttling in DRF?
                            
                                Python Profiling: What does "method 'poll' of 'select.poll' objects"?
                            
                                TensorFlow freeze_graph.py: The name 'save/Const:0' refers to a Tensor which does not exist

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Binning of data along one axis in numpy

Tags:

python

numpy

histogram

binning

obachtos

People also ask

2 Answers

Ami Tavory

ThomasNicholas

Recent Activity

Donate For Us