C++编译器性能比较

作者佚名技术来源程序设计浏览发布时间 2012-06-30

现在市面上，主流的C/C++编译器包括M$的CL、gcc、Intel的icl、PGI的pgcc及Codegear的bcc（原来属于Borland公司）。Windows上使用最多的自然是cl，而在更广阔的平台上，gcc则是C/C++编译器的首选。但要提到能力优化，排名就未必与它们的市场占有率一致了。

今天一时兴起，便做了一个各编译器数值性能的比较。测试的代码是一个求积分的程序，来源于intel编译器的例子程序，修改了一个头文件，以便每个编译器都能编译。

#include <stdio.h> #include <stdlib.h> #include <time.h> #include <math.h> // Function to be integrated // Define and prototype it here // | sin(x) | #define INTEG_FUNC(x) fabs(sin(x)) // Prototype timing function double dclock(void); int main(void) { // Loop counters and number of interior points unsigned int i, j, N; // Stepsize, independent variable x, and accumulated sum double step, x_i, sum; // Timing variables for evaluation double start, finish, duration, clock_t; // Start integral from double interval_begin = 0.0; // Complete integral at double interval_end = 2.0 * 3.141592653589793238; // Start timing for the entire application start = clock(); printf(" \n"); printf(" Number of | Computed Integral | \n"); printf(" Interior Points | | \n"); for (j=2;j<27;j++) { printf("------------------------------------- \n"); // Compute the number of (internal rectangles + 1) N = 1 << j; // Compute stepsize for N-1 internal rectangles step = (interval_end - interval_begin) / N; // Approx. 1/2 area in first rectangle: f(x0) * [step/2] sum = INTEG_FUNC(interval_begin) * step / 2.0; // Apply midpoint rule: // Given length = f(x), compute the area of the // rectangle of width step // Sum areas of internal rectangle: f(xi + step) * step for (i=1;i<N;i++) { x_i = i * step; sum += INTEG_FUNC(x_i) * step; } // Approx. 1/2 area in last rectangle: f(xN) * [step/2] sum += INTEG_FUNC(interval_end) * step / 2.0; printf(" %10d | %14e | \n", N, sum); } finish = clock(); duration = (finish - start); printf(" \n"); printf(" Application Clocks = %10e \n", duration); printf(" \n"); return 0; }

当然，这个代码来自于intel，当然非常适合intel的编译器。以下的测试在Intel Core 2 Duo上进行。

gcc （GCC TDM-2 for MinGW） 4.3.0 VC 9.0 （cl 15.00.21022.08） Intel （icl 10.1） PGI （pgcc 7.16） CodeGear （bcc32 6.10）

禁止优化

-O0 /Od -Od -O0 -Od

17161 14461 12441 10514 13400

17133 14430 11687 9956 12917

17155 14476 11871 10099 13026

编译选项 -O2

13011 7737 4540 9348 12636

16571 7706 4185 9148 13026

16573 7706 4042 9183 13057

针对平台的优化

-march=core2 -O2 /arch：SSE2 /O2 -QxT -tp core2 -O2 无

16060 7710 1938 9578

测试的结果说明，在数值计算方法，intel的编译器是非常利害的，特别是针对某CPU的优化，能提高很多性能。GCC表现却有些让人失望。在禁止优化到-O2级优化的对比中，可以看出intel与m$的编译器的优化效果是非常明显的，而其它编译器优化后的提高非常有限。如果给个排名，那么将是 icl>cl>pgcc>bcc>gcc.

另外，在一台P4 1.5G的机器，linux环境下，测试得到

gcc icc pgCC

-O2 -O2 -O2

凌众科技专业提供服务器租用、服务器托管、企业邮局、虚拟主机等服务，公司网站：http://www.lingzhong.cn 为了给广大客户了解更多的技术信息，本技术文章收集来源于网络,凌众科技尊重文章作者的版权，如果有涉及你的版权有必要删除你的文章，请和我们联系。以上信息与文章正文是不可分割的一部分,如果您要转载本文章,请保留以上信息，谢谢!

1 2 下一页

分享到：更多

你可能对下面的文章感兴趣

上一篇: 在BCB中使用VCL控件数组1下一篇: 在BCB中使用VCL控件数组2

关于C++编译器性能比较的所有评论

随机推荐