为什么 MEX 代码比 matlab 代码慢得多？

Question

首先，请不要将此问题标记为重复，因为一旦详细研究就会清楚。

我正在尝试实现正交匹配追踪算法。为此，我需要找到大小为 144*14596 和 144*1 的两个矩阵的点积，如下所示

        clc,clear;

        load('E');
        load('R');
        load('P');


        sparse=zeros(14596,2209);

            dictionary=tem2;

            atoms=zeros(size(dictionary,1),size(dictionary,2));
            coefs=zeros(size(dictionary,2),1);
        tic
            %Normalize the dictionary 
            for index=1:size(dictionary,2)
               dictionary(:,index)=dictionary(:,index)./norm(dictionary(:,index)); 
            end
            D=dictionary;

      /*  NOTE: I tried for ii=1:5 to check the difference in computational time*/

         for ii=1:2209

            r=tem4(:,ii);
            dictionary=D;
            index=[];
            count=0;
            t=5;
            while(t>1e-15 && count~=144)
    /***************Problem lies here**************/
        % inner_product=dictionary'*r; %Dot Product (Should be slow but is fast)
        inner_product=dotProduct(dictionary',r); %(Should be fast but is very slow)
/****************************************************/
                [m,ind]=max(abs(inner_product));

                index=[index ind];
                atoms(:,ind)=dictionary(:,ind); %Select atom which has maximum inner product
                dictionary(:,ind)=0;
                at=atoms(:,index);
                x=(at'*at)\(at'*r);
                coefs(index)=x;
                r=r-at*x;
                t=norm(r);
                count=count+1;
            end
                sparse(:,ii)=coefs;

         end

        sig=D*sparse;
        final=uint8((repmat((((max(tem4))-min(tem4))./((max(sig)-min(sig)))),size(tem4,1),1).*(sig-repmat(min(sig),size(tem4,1),1)))+repmat(min(tem4),size(tem4,1),1));  

        toc

但我面临的问题是，在 MATLAB 中使用以下代码找出点积需要花费大量时间（如探查器报告中所示）。

inner_product=dictionary'*r;

为了减少计算时间，我写了如下所示的MEX代码来找出点积：

/***********************************************************************
 *Program to create a MEX-file to find the dot product of matrices     *
 *Created by: Navdeep Singh                                            * 
 *@Copyright Reserved                                                  * 
 ***********************************************************************/

#include "mex.h"

void dot_prod(double *m1,double *m2, double *t,size_t M,size_t N, size_t M2,size_t N2 )
{   
    int i,j,k;
    double s;

    for(i=0;i<M;i++)
    {   for(k=0;k<N2;k++)
        {   s=0;
            for(j=0;j<N;j++)
            {   s=s+*((m1+i)+(M*j))*(*(m2+(j+M2*k)));
            }
            *((t+i)+(M*k))=s;
        }
    }
}  

void mexFunction(int nlhs,mxArray *plhs[],int nrhs, const mxArray *prhs[])
{   double *mat1,*mat2,*out;
    size_t rows_mat1,cols_mat1,rows_mat2,cols_mat2;
    mat1=mxGetPr(prhs[0]);
    mat2=mxGetPr(prhs[1]);
    rows_mat1=mxGetM(prhs[0]);
    cols_mat1=mxGetN(prhs[0]);
    rows_mat2=mxGetM(prhs[1]);
    cols_mat2=mxGetN(prhs[1]);
    plhs[0]=mxCreateDoubleMatrix(rows_mat1,cols_mat2,mxREAL);
    out=mxGetPr(plhs[0]);
    dot_prod(mat1,mat2,out,rows_mat1,cols_mat1,rows_mat2,cols_mat2);

}

但令我惊讶的是，我发现 MEX 解决方案比 MATLAB 中使用的解决方案慢得多，这违背了 MEX 的最终目的。为了知道原因，我在互联网上搜索了很多，发现了一些有趣的事实，例如：

Matlab: Does calling the same mex function repeatedly from a loop incur too much overhead?

Matlab mex-file with mexCallMATLAB is almost 300 times slower than the corresponding m-file

这些 links 表明开销不应该太多，如果有的话，它总是用于第一次调用，因为加载符号表等需要时间。 -- 但与此相反，我发现我的代码中产生了很多开销。

我还发现参数的大小并不重要，尽管参数的数量会影响计算时间，但它又是最小的。 link 之一还建议释放动态分配的内存（除了由 matlab 本身分配的内存），但我也没有任何此类分配。

所以请告诉我背后的原因是什么

为什么 MEX 占用大量时间？

有什么解决办法吗？

非常感谢您的帮助。

可以在这里找到各种文件：

Answer 1

Matlab 具有高度优化的代码来计算矩阵的点积，

您刚刚编写了一个嵌套的 for 循环来计算点积，因此您可以将此 Mex 代码与 matlab 中的“类似的嵌套 for 循环”进行比较，然后决定 MEX 代码更快还是 matlab，

其实matlab并没有使用嵌套for循环来计算矩阵的点积，

来自 MATLAB 文档：

MEX-Files have several applications:

calling large pre-existing c/c++ and FORTRAN programs from MATLAB without rewriting them as MATLAB functions

Replacing performance-critical routines with c/c++ implementations

MEX files are not appropriate for all applications. MATLAB is a high-productivity environment whose specialty is eliminating time-consuming, low-level programming in compiled languages like C or C++. In general, do your programming in MATLAB. Do not use MEX files unless your application requires it.

EXAMPLE1

为什么 MEX 代码比 matlab 代码慢得多？

Why is MEX code much slower than the matlab code?

matlab

computation

mex

visual-studio-2012