flat assembler
Message board for the users of flat assembler.

Index > Main > Add stream buffer to another

Goto page Previous  1, 2
Author
Thread Post new topic Reply to topic
revolution
When all else fails, read the source


Joined: 24 Aug 2004
Posts: 20298
Location: In your JS exploiting you and your system
revolution 01 May 2021, 10:49
Overclick wrote:
I'm looking for FPU instructions to do multiplied add operations at same tact.
Assuming you are running this on a system produced in the last 10 or 15 years, then with the operation being very simple (it's just an add) I think that effecting a cache management strategy would give all the speed up that is available.

As long as the operation remains simple, once you have it streaming through the cache then it almost won't matter which instructions you use because the CPU will outpace the external memory bus by a good margin.

Compare option 1:

loop:
load data
operate on data (each operation has to wait for data to arrive)
store data (the data bus has to switch from read mode to write mode each time)
loop a billion times

Option 2:

initiate cache streaming of multiple data (say 1000 items)
loop:
initiate cache streaming of multiple data (say 1000 items) again (second buffer)
operate on first buffer data
initiate writing first buffer data to memory
switch buffers
loop a million times

Notice for option 2 that there is less waiting, and there are many fewer bus direction switches.

If you don't try it then you won't see how much faster things can really be. Razz
Post 01 May 2021, 10:49
View user's profile Send private message Visit poster's website Reply with quote
Overclick



Joined: 11 Jul 2020
Posts: 669
Location: Ukraine
Overclick 01 May 2021, 11:24
I'll try but seems it's better to switch my buffers
to FLOAT anyway to prevent additional conversion step by windows audio engine. I'm working on my own engine's strategy at the moment then will see where and what type of data is better. Thanks for advise.
Post 01 May 2021, 11:24
View user's profile Send private message Visit poster's website Reply with quote
Overclick



Joined: 11 Jul 2020
Posts: 669
Location: Ukraine
Overclick 01 May 2021, 11:30
Nice site for describing FPU instructions:
https://www.officedaytime.com/simd512e
Post 01 May 2021, 11:30
View user's profile Send private message Visit poster's website Reply with quote
Display posts from previous:
Post new topic Reply to topic

Jump to:  
Goto page Previous  1, 2

< Last Thread | Next Thread >
Forum Rules:
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum
You cannot attach files in this forum
You can download files in this forum


Copyright © 1999-2024, Tomasz Grysztar. Also on GitHub, YouTube.

Website powered by rwasa.