flat assembler
Message board for the users of flat assembler.
![]() |
Author |
|
Tomasz Grysztar
A really interesting article about AVX-512 performance (with experimental approach) posted today:
Gathering Intel on Intel AVX-512 Transitions. |
|||
![]() |
|
revolution
I think the outcome there is if you need top performance then execute the wide AVX instructions in batches, and not intersperse them amongst other code.
|
|||
![]() |
|
Tomasz Grysztar
Another amazing article: AVX-512 Mask Registers, Again.
|
|||
![]() |
|
revolution
Tomasz Grysztar wrote: Another amazing article: AVX-512 Mask Registers, Again. What happens if I mix FPU instructions with AVX-512 instructions in a single stream? Does the OS have to do a context save/restore each time the new instruction type is executed? So not only do we get a forced down-clock, we also get a forced context change. Yuck. |
|||
![]() |
|
revolution
Furs wrote: That's not what I understood. It's about the physical register space being shared. Which is quite large (128 registers) due to renaming. |
|||
![]() |
|
< Last Thread | Next Thread > |
Forum Rules:
|
Copyright © 1999-2020, Tomasz Grysztar. Also on GitHub, YouTube, Twitter.
Website powered by rwasa.