Skip to content

Improve compiler optimization#9

Open
vmarkovtsev wants to merge 1 commit into
michaelchughes:masterfrom
vmarkovtsev:patch-1
Open

Improve compiler optimization#9
vmarkovtsev wants to merge 1 commit into
michaelchughes:masterfrom
vmarkovtsev:patch-1

Conversation

@vmarkovtsev
Copy link
Copy Markdown

-march-native enables more aggressive vectorization and code generation for exactly your CPU

-fopenmp enables parallel matrix multiplication in Eigen (it makes 2x speedup using 12 cores on 100000x256 and K=1024)

`-march-native` enables more aggressive vectorization and code generation for exactly your CPU

`-fopenmp` enables parallel matrix multiplication in Eigen (it makes 2x speedup using 12 cores on 100000x256 and K=1024)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant