flat assembler
Message board for the users of flat assembler.

Index > Main > SSE4 Programming reference is released

Author
Thread Post new topic Reply to topic
revolution
When all else fails, read the source


Joined: 24 Aug 2004
Posts: 17476
Location: In your JS exploiting you and your system
revolution
Post 06 Jun 2007, 23:21
View user's profile Send private message Visit poster's website Reply with quote
mattst88



Joined: 12 May 2006
Posts: 260
Location: South Carolina
mattst88
I've also added SSE4.1 instructions to my Instruction Reference. SSE4.2 instructions will be added very shortly.

http://mattst88.no-ip.com:8181/programming/asmref/
Post 06 Jun 2007, 23:57
View user's profile Send private message Visit poster's website Reply with quote
rugxulo



Joined: 09 Aug 2005
Posts: 2341
Location: Usono (aka, USA)
rugxulo
The latest YASM, 0.6.1, has support for SSE 4.1 and 4.2, if that helps any (for comparisons, etc).
Post 07 Jun 2007, 04:10
View user's profile Send private message Visit poster's website Reply with quote
Garthower



Joined: 21 Apr 2006
Posts: 158
Location: Ukraine
Garthower
Yes, a good set of commands. On idea, they should be useful. There is DivX codecs (version 6.6.1), wrotten by means of SSE4, under tests a gain of speed in comparison with the same realization, but by means of SSE2, about 70 %. It would be necessary to try test most the code, rewrite one on SSE4. I hope, that shortly there will be version FASM with their support. Though it's need to think still, whether have the given commands a urgency because the processors supporting them, it is NOT ENOUGH.
Post 07 Jun 2007, 08:59
View user's profile Send private message Visit poster's website MSN Messenger ICQ Number Reply with quote
r22



Joined: 27 Dec 2004
Posts: 805
r22
If CRC32 can (and it hopefully will) outperform an optimized crc32 procedure then it'll be a great boost to security and integrity checks as a whole. -- In the next 3-5 years when SSE4 becomes as common as SSE/2 is now -- but I'd hope we'd have a better architecture all together by then -- Very Happy

DotProduct is a step closer to giving CPU's the same capabilities as GPU's. Just need more SIMD ALU's.

Finally being able to multiply packed Dwords and Qwords is great, although it's kind of late by a decade.
Post 11 Jun 2007, 05:05
View user's profile Send private message AIM Address Yahoo Messenger Reply with quote
Display posts from previous:
Post new topic Reply to topic

Jump to:  


< Last Thread | Next Thread >
Forum Rules:
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum
You cannot attach files in this forum
You can download files in this forum


Copyright © 1999-2020, Tomasz Grysztar. Also on YouTube, Twitter.

Website powered by rwasa.