python - Implementing numpy.tensordot code in C++ for arrays having different dimensions

Question

Welcome To Ask or Share your Answers For Others

python - Implementing numpy.tensordot code in C++ for arrays having different dimensions

posted Mar 6, 2021 in Technique[技术] by 深蓝 (71.8m points)

python - Implementing numpy.tensordot code in C++ for arrays having different dimensions

I'm trying to implement numpy.tensordot like summation of product for two different matrices in C++. Although I understand the implementation for arrays of same dimension, I'm unable to figure out the method to use for multiplying a 2-D array of size 3 * 3 with an array of size 3 * 600 * 600. The resultant array should have size 3 * 600 * 600.

To understand the intuition, I tried to work through 3 * 3 * 3 and 3 * 3 arrays on pen and paper but it lead to inconsistent results.

A sample numpy version of my code is as below:

import numpy as np
R = np.arange(9).reshape(3,3)
XYZ = np.arange(3*600*600).reshape*3, 600, 600)

result = np.tensordot(R, XYZ, axes = 1)

For ease I'm attaching a link to numpy.tensordot documentation.

question from:https://stackoverflow.com/questions/65840904/implementing-numpy-tensordot-code-in-c-for-arrays-having-different-dimensions

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Reply

深蓝 · Answer 1 · 2021-03-06T05:07:56+0000

In [129]: R = np.arange(9).reshape(3,3)
     ...: XYZ = np.arange(3*600*600).reshape(3, 600, 600)

tensordot has 2 styles of axes, the scalar and the tuple:

In [130]: result = np.tensordot(R, XYZ, axes=1)
In [131]: result.shape
Out[131]: (3, 600, 600)

The einsum equivalent is:

In [132]: res1 = np.einsum('ij,jkl->ikl',R, XYZ)
In [133]: res1.shape
Out[133]: (3, 600, 600)
In [134]: np.allclose(result, res1)
Out[134]: True

and the tuple axes equivalent:

In [135]: res2 = np.tensordot(R, XYZ, axes=(1,0))
In [136]: res2.shape
Out[136]: (3, 600, 600)
In [137]: np.allclose(result, res2)
Out[137]: True

I think the tuple axes style was original, and the integer axes added on top of that. Sometime ago I worked out how the interger cases were translated into the tuple inputs, but I've forgotten the details.

Anyways, the sum-of-products dimension is the last of R and the first of XYZ.

Another way is to multiply element-wise with broadcasting, and then summing:

In [139]: res4=(R[:,:,None,None]*XYZ[None,:,:,:]).sum(axis=1)
In [140]: np.allclose(result, res4)
Out[140]: True

By compressing the last dimensions of XYZ to one, we get the conventional dot product, where the sum-of-products is the last of the first, and 2nd to the last of second:

In [141]: res5 = (R@XYZ.reshape(3,-1)).reshape(3,600,600)
In [142]: np.allclose(result, res5)
Out[142]: True
In [143]: res6 = (R.dot(XYZ.reshape(3,-1))).reshape(3,600,600)
In [144]: np.allclose(result, res6)
Out[144]: True

I believe tensordot is effectively doing [143].

Categories

python - Implementing numpy.tensordot code in C++ for arrays having different dimensions

python - Implementing numpy.tensordot code in C++ for arrays having different dimensions

Please log in or register to add a comment.

Please log in or register to reply this article.

1 Reply

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags