C++高性能编码-酒店常州论坛

一，运行模式

在探索高性能编码时，一定要注意编译选项，不同编译优化级别场景下，同一个问题的答案可能截然相反！

如果是在visual studio里面运行，那就简单区分debug和release即可。

二，cache友好

1，二维数组的访问

二维数组的访问最好不要跳内存。

#include <stdio.h> #include "time.h" #define N 30000 #define M 1000 typedef struct { int a[N]; }Node; #define OUTCLOCK \ printf("%d ",clock()-theClock); \ theClock=clock(); int main() { clock_t theClock = clock(); Node *p = (Node *)malloc(sizeof(Node)*M); OUTCLOCK for (int j = 0; j < N; j++)for (int i = 0; i < M; i++)p[i].a[j] = i * j + 1; OUTCLOCK for (int i = 0; i < M; i++)for (int j = 0; j < N; j++)p[i].a[j] = i * j + 1; OUTCLOCK return 0; }

debug模式运行，输出：17 81 68

release模式运行，输出：0 45 51

分析：

一般来说，for (int i = 0; i < M; i++)for (int j = 0; j < N; j++)这种遍历更快，因为符号cache友好。debug模式是符合结论的。

但是，release模式下，for (int j = 0; j < N; j++)for (int i = 0; i < M; i++)这种写法触发了编译器优化，反而优化后的性能比for (int i = 0; i < M; i++)for (int j = 0; j < N; j++)这种遍历更快。

2，大批量内存拷贝

大批量内存拷贝，用memcpy代替赋值语句

int main() { clock_t theClock=clock(); Node *p=(Node *)malloc(sizeof(Node)*M); int *p2=(int *)malloc(sizeof(int)*N*M); OUTCLOCK for(int i=0;i<M;i++)for(int j=0;j<N;j++)p2[i*N+j]=p[i].a[j]; OUTCLOCK memcpy(p2,p, sizeof(int)*N*M); OUTCLOCK return 0; }

运行结果：

0 2811 276

企业官网建设流程全解析

一，运行模式

二，cache友好

1，二维数组的访问

2，大批量内存拷贝

三，多线程并发

1，伪共享

热门文章

文章分类

标签云

需要专业的网站建设服务？

企业官网建设流程全解析

一，运行模式

二，cache友好

1，二维数组的访问

2，大批量内存拷贝

三，多线程并发

1，伪共享

热门文章

文章分类

标签云

相关文章

厨房台面与窗沿市场调研：未来六年复合增长率（CAGR）将稳定在2.7%

基于峭度分析的UWB信号NLOS识别技术解析

Modern Honey Network (MHN) 终极指南：构建企业级蜜罐安全防护系统

需要专业的网站建设服务？