python convolution with different dimension

Tags:

I'm trying to implement convolutional neural network in Python.
However, when I use signal.convolve or np.convolve, it can not do convolution on X, Y(X is 3d, Y is 2d). X are training minibatches. Y are filters. I don't want to do for loop for every training vector like:

Click to copy

for i in xrange(X.shape[2]):
    result = signal.convolve(X[:,:,i], Y, 'valid')
    ....

So, is there any function I can use to do convolution efficiently?

522

asked Aug 30 '16 07:08

Vito

1 Answers

Scipy implements standard N-dimensional convolutions, so that the matrix to be convolved and the kernel are both N-dimensional.

A quick fix would be to add an extra dimension to Y so that Y is 3-Dimensional:

Click to copy

result = signal.convolve(X, Y[..., None], 'valid')

I'm assuming here that the last axis corresponds to the image index as in your example [width, height, image_idx] (or [height, width, image_idx]). If it is the other way around and the images are indexed in the first axis (as it is more common in C-ordering arrays) you should replace Y[..., None] with Y[None, ...].

The line Y[..., None] will add an extra axis to Y, making it 3-dimensional [kernel_width, kernel_height, 1] and thus, converting it to a valid 3-Dimensional convolution kernel.

NOTE: This assumes that all your input mini-batches have the same width x height, which is standard in CNN's.

EDIT: Some timings as @Divakar suggested.

The testing framework is setup as follows:

Click to copy

def test(S, N, K):
    """ S: image size, N: num images, K: kernel size"""
    a = np.random.randn(S, S, N)
    b = np.random.randn(K, K)
    valid = [slice(K//2, -K//2+1), slice(K//2, -K//2+1)]

    %timeit signal.convolve(a, b[..., None], 'valid')
    %timeit signal.fftconvolve(a, b[..., None], 'valid')
    %timeit ndimage.convolve(a, b[..., None])[valid]

Find bellow tests for different configurations:

Varying image size S:

Click to copy

>>> test(100, 50, 11) # 100x100 images
1 loop, best of 3: 909 ms per loop
10 loops, best of 3: 116 ms per loop
10 loops, best of 3: 54.9 ms per loop

>>> test(1000, 50, 11) # 1000x1000 images
1 loop, best of 3: 1min 51s per loop
1 loop, best of 3: 16.5 s per loop
1 loop, best of 3: 5.66 s per loop

Varying number of images N:

Click to copy

>>> test(100, 5, 11) # 5 images
10 loops, best of 3: 90.7 ms per loop
10 loops, best of 3: 26.7 ms per loop
100 loops, best of 3: 5.7 ms per loop

>>> test(100, 500, 11) # 500 images
1 loop, best of 3: 9.75 s per loop
1 loop, best of 3: 888 ms per loop
1 loop, best of 3: 727 ms per loop

Varying kernel size K:

Click to copy

>>> test(100, 50, 5) # 5x5 kernels
1 loop, best of 3: 217 ms per loop
10 loops, best of 3: 100 ms per loop
100 loops, best of 3: 11.4 ms per loop

>>> test(100, 50, 31) # 31x31 kernels
1 loop, best of 3: 4.39 s per loop
1 loop, best of 3: 220 ms per loop
1 loop, best of 3: 560 ms per loop

So, in short, ndimage.convolve is always faster, except when the kernel size is very large (as K = 31 in the last test).

197

answered Sep 20 '22 20:09

Imanol Luengo

Related questions
                            
                                What is the unit of height variable in "barh" of matplotlib?
                            
                                Python/Pandas - creating new variable based on several variables and if/elif/else function
                            
                                Making 1 milion requests with aiohttp/asyncio - literally
                            
                                Write Custom Python-Based Gradient Function for an Operation? (without C++ Implementation)
                            
                                python smallest range from multiple lists
                            
                                Best way to take mean/sum of block matrix in numpy? [duplicate]
                            
                                Fit t distribution using scipy with predetermined mean and std(loc & scale)?
                            
                                Capture interactive Python shell output along with input
                            
                                Pandas set_Value with DatetimeIndex [Python]
                            
                                Spark: equivelant of zipwithindex in dataframe
                            
                                Inherit namedtuple from a base class in python
                            
                                Using itertools for arbitrary number of nested loops of different ranges with dependencies?
                            
                                How to print the console to a text file AFTER the program finishes (Python)?
                            
                                How to sort numpy array by absolute value of a column?
                            
                                Interpolate sleep() and print() in the same line inside a for loop using python 3 [duplicate]
                            
                                using sdl2 in Kivy instead of pygame
                            
                                Convert numbered pinyin to pinyin with tone marks
                            
                                Django foreign key relation in template
                            
                                Python shutil copytree: use ignore function to keep specific files types
                            
                                Update Counter collection in python with string, not letter

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

python convolution with different dimension

Tags:

python

numpy

scipy

deep-learning

Vito

People also ask

1 Answers

Imanol Luengo

Recent Activity

Donate For Us