flat assembler
Message board for the users of flat assembler.
Index
> Main > Intel plans doubling 16 general purpose registers to 32 Goto page Previous 1, 2, 3, 4 |
Author |
|
tthsqe 07 Aug 2023, 04:41
revolution, you have a habit of giving this same generic canned-like response every time the topic of code timing comes up. Memory, cache, data access, optimize the high level algorithm, ..., small code may be better than huge code that trashes the code cache, ..., micro benchmarks are useless, test your whole program, ..., etc . I have heard all of this multiple times before and am well aware of where the hotspots are and how to find them. Sometimes the bottlenecks really do fall under the rubric of in-register throughput computation.
Now, I was simply musing (and revolution is not the intended audience) that at some point intel's nominal add/mul latency ratio went from 3/5 to 4/4, i.e. that addition became relatively "slower". Since Anger Fog goes through the trouble of assigning values here and intel even posts numbers on its intrinsics guide, they must be worth something. And, before revolution jumps back in tells me that I will know nothing untill I time my whole app, I would just like to mention that the only real success I've had applying these latency numbers is in the case of these trivial micro loops. ha. |
|||
07 Aug 2023, 04:41 |
|
revolution 07 Aug 2023, 05:54
I'm glad the message is starting to get through.
I hate to see people waste time on the wrong and give up on assembly out of frustration when their code is no faster than a good C compiler. |
|||
07 Aug 2023, 05:54 |
|
bitRAKE 07 Aug 2023, 13:12
I was born a C compiler then I had a growth spurt around Pascal.
tthsqe wrote: why didn't I hear anyone complaining that add and mul now have the same latency? If no bottlenecks exist in fetch/decode/retire then instruction binning into execution ports drives throughput. (Assuming no dependencies.) _________________ ¯\(°_o)/¯ “languages are not safe - uses can be” Bjarne Stroustrup |
|||
07 Aug 2023, 13:12 |
|
Goto page Previous 1, 2, 3, 4 < Last Thread | Next Thread > |
Forum Rules:
|
Copyright © 1999-2024, Tomasz Grysztar. Also on GitHub, YouTube.
Website powered by rwasa.