flat assembler
Message board for the users of flat assembler.
![]() |
Author |
|
revolution
|
|||
![]() |
|
Tomasz Grysztar
Of course opcodes are yet confidential?
|
|||
![]() |
|
vid
well... both FASM and YASM already have SSSE3 / SSE4 support
|
|||
![]() |
|
Garthower
Interesting expansions. Commands for work with strings can be very useful. I think, that it's necessary to add support of these commands in FASM.
|
|||
![]() |
|
Tomasz Grysztar
vid: SSSE3 is not SSE4.
|
|||
![]() |
|
vid
so this SSE4 is *not* SSSE3? some real SSE4 this time? wow
|
|||
![]() |
|
smiddy
Hi Tomasz, here ya go: http://www.intel.com/design/processor/manuals/253667.pdf
& http://developer.intel.com/design/processor/manuals/253666.pdf Whoops, a quick look and those instructions from the white paper are not in here, bummer! |
|||
![]() |
|
r22
Packed multiply with 32bit and 64bit is great, something I've personally wanted.
CRC32 opcode sounds like it would be very useful. I'll hold my final judgements until the performance and latency data is available. I can see why dot product is being added, might as well already have it in the architecture when we start seeing CPU/GPU combo processors in a few years. WHAT I STILL WANT: PLB/PLW/PLD/PLDQ packed load byte/word/dword/qword - replacing the addresses stored in the XMM register with the data at those addresses. Parallel random access to memory data would rock. Imagaine being able to do multiple look ups on multiple LUTs at the same time that would be incredibly useful. This combined with packed multiply and packed add for the address would be so cool. Shame I don't design microprocessors :/ |
|||
![]() |
|
Maverick
r22 wrote: Shame I don't design microprocessors :/ You can. Get a FPGA board and learn Verilog. _________________ Greets, Fabio |
|||
![]() |
|
MCD
Maverick wrote:
sure, but not everyone has time/money to do so _________________ MCD - the inevitable return of the Mad Computer Doggy -||__/ .|+-~ .|| || |
|||
![]() |
|
f0dder
Hmm, CRC32 opcode seems pretty silly to me - it's pretty blazing fast already with a LUT implementation. Where would this be useful? Outside of networking, where it really should be moved onto the NIC. Ok, intel mentions iSCSI and RDMA, but ho humm.
|
|||
![]() |
|
comrade
CISC all the way!
|
|||
![]() |
|
< Last Thread | Next Thread > |
Forum Rules:
|
Copyright © 1999-2020, Tomasz Grysztar. Also on GitHub, YouTube, Twitter.
Website powered by rwasa.