Convolution along one axis only

Asked 8/3, 2011 at 5:31 Answered 10/4 at 9:20

numpy signal-processing scipy linear-algebra convolution

I have two 2-D arrays with the same first axis dimensions. In python, I would like to convolve the two matrices along the second axis only. I would like to get C below without computing the convolution along the first axis as well.

import numpy as np
import scipy.signal as sg

M, N, P = 4, 10, 20
A = np.random.randn(M, N)
B = np.random.randn(M, P)

C = sg.convolve(A, B, 'full')[(2*M-1)/2]

Is there a fast way?

Himes answered 8/3, 2011 at 5:31 Comment(0)

You can use np.apply_along_axis to apply np.convolve along the desired axis. Here is an example of applying a boxcar filter to a 2d array:

import numpy as np

a = np.arange(10)
a = np.vstack((a,a)).T

filt = np.ones(3)

np.apply_along_axis(lambda m: np.convolve(m, filt, mode='full'), axis=0, arr=a)

This is an easy way to generalize many functions that don't have an axis argument.

Faradmeter answered 6/3, 2014 at 5:43 Comment(3)

does this have any speedup vs a loop along each row? – Agni 5/4, 2014 at 19:32

@Agni Nope, it doesn't. See this answer... – Clap 6/8, 2016 at 0:57

np.apply_along_axis really can't be used here, since the OP wants it to iterate over two arrays. The way to do that is to simply use a loop, as described here. – Clap 6/8, 2016 at 1:50

With ndimage.convolve1d, you can specify the axis...

Disinherit answered 10/3, 2011 at 14:24 Comment(6)

Thanks for pointing that out. The 'weights' argument, however, needs to be 1-D. It is 2-D in my case. – Himes 10/3, 2011 at 19:4

@Paul: what is the context? What are the weights, B? – Disinherit 10/3, 2011 at 19:39

Each row in A is being filtered by the corresponding row in B. I could implement it like that, just thought there might be a faster way. A is on the order of 10s of gigabytes in size and I use overlap-add. – Himes 11/3, 2011 at 5:50

lfilter has a similar issue that your can only specify a 1-d filter. I think this is just not a common use case. – Himes 12/3, 2011 at 20:35

@Himes - your comment that Each row in A is filtered by the corresponding row in B makes me think you want a result of shape (M,N+P-1) whose contents are [[sg.convolve(A[0,:],B[0,:],'full'],[sg.convolve(A[1,:],B[1,:],'full'],...,[sg.convolve(A[M-1,:],B[M-1,:],'full']]. But your example does something different. Could you explain the desired output again? – Vitrification 2/4, 2011 at 20:45

This answer still isn't answering the question, as Paul says in his comments. – Clap 6/8, 2016 at 0:55

np.apply_along_axis won't really help you, because you're trying to iterate over two arrays. Effectively, you'd have to use a loop, as described here.

Now, loops are fine if your arrays are small, but if N and P are large, then you probably want to use FFT to convolve instead.

However, you need to appropriately zero pad your arrays first, so that your "full" convolution has the expected shape:

M, N, P = 4, 10, 20
A = np.random.randn(M, N)
B = np.random.randn(M, P)

A_ = np.zeros((M, N+P-1), dtype=A.dtype)
A_[:, :N] = A
B_ = np.zeros((M, N+P-1), dtype=B.dtype)
B_[:, :P] = B

A_fft = np.fft.fft(A_, axis=1)
B_fft = np.fft.fft(B_, axis=1)
C_fft = A_fft * B_fft

C = np.real(np.fft.ifft(C_fft))

# Test
C_test = np.zeros((M, N+P-1))
for i in range(M):
    C_test[i, :] = np.convolve(A[i, :], B[i, :], 'full')

assert np.allclose(C, C_test)

Clap answered 6/8, 2016 at 1:24 Comment(0)

for 2D arrays, the function scipy.signal.convolve2d is faster and scipy.signal.fftconvolve can be even faster (depending on the dimensions of the arrays):

Here the same code with N = 100000

import time    
import numpy as np
import scipy.signal as sg

M, N, P = 10, 100000, 20
A = np.random.randn(M, N)
B = np.random.randn(M, P)

T1 = time.time()
C = sg.convolve(A, B, 'full')
print(time.time()-T1)

T1 = time.time()
C_2d = sg.convolve2d(A, B, 'full')
print(time.time()-T1)

T1 = time.time()
C_fft = sg.fftconvolve(A, B, 'full')
print(time.time()-T1)

>>> 12.3
>>> 2.1
>>> 0.6

Answers are all the same with slight differences due to different computation methods used (e.g., fft vs direct multiplication, but i don't know what exaclty convolve2d uses):

print(np.max(np.abs(C - C_2d)))
>>>7.81597009336e-14

print(np.max(np.abs(C - C_fft)))
>>>1.84741111298e-13

Hypnotherapy answered 8/11, 2016 at 11:29 Comment(0)

Late answer, but worth posting for reference. Quoting from comments of the OP:

Each row in A is being filtered by the corresponding row in B. I could implement it like that, just thought there might be a faster way.

A is on the order of 10s of gigabytes in size and I use overlap-add.

Naive / Straightforward Approach

import numpy as np
import scipy.signal as sg

M, N, P = 4, 10, 20
A = np.random.randn(M, N) # (4, 10)
B = np.random.randn(M, P) # (4, 20)

C = np.vstack([sg.convolve(a, b, 'full') for a, b in zip(A, B)])

>>> C.shape
(4, 29)

Each row in A is convolved with each respective row in B, essentially convolving M 1D arrays/vectors.

No Loop + CUDA Supported Version

It is possible to replicate this operation by using PyTorch's F.conv1d. We have to imagine A as a 4-channel, 1D signal of length 10. We wish to convolve each channel in A with a specific kernel of length 20. This is a special case called a depthwise convolution, often used in deep learning.

Note that torch's conv is implemented as cross-correlation, so we need to flip B in advance to do actual convolution.

import torch
import torch.nn.functional as F

@torch.no_grad()
def torch_conv(A, B):
    M, N, P = A.shape[0], A.shape[1], B.shape[1]
    C = F.conv1d(A, B[:, None, :], bias=None, stride=1, groups=M, padding=N+(P-1)//2)
    return C.numpy()

# Convert A and B to torch tensors + flip B

X = torch.from_numpy(A) # (4, 10)
W = torch.from_numpy(np.fliplr(B).copy()) # (4, 20)

# Do grouped conv and get np array
Y = torch_conv(X, W)

>>> Y.shape
(4, 29)

>>> np.allclose(C, Y)
True

Advantages of using a depthwise convolution with torch:

No loops!
The above solution can also run on CUDA/GPU, which can really speed things up if A and B are very large matrices. (From OP's comment, this seems to be the case: A is 10GB in size.)

Disadvantages:

Overhead of converting from array to tensor (should be negligible)
Need to flip B once before the operation

Capwell answered 30/11, 2022 at 16:52 Comment(0)

Both scipy.signal.oaconvolve() and scipy.signal.fftconvolve() provide the axes argument, which enables applying convolution along the given axes (or, in your case, axis) only.

Both functions behave rather similar to scipy.signal.convolve() (in fact, with the right settings, convolve() internally calls fftconvolve()). In particular, both functions provide the same mode argument as convolve() for controlling the treatment of the signal boundaries.

Serbocroatian answered 10/4 at 9:20 Comment(0)

Naive / Straightforward Approach

No Loop + CUDA Supported Version

Recommended topics

Hot tags