flat assembler
Message board for the users of flat assembler.
Index
> Windows > Unicode in the console. |
Author |
|
AsmGuru62 05 Jan 2012, 21:27
Why not using invoke macro?
Because, CALL is a jump to a direct address and you need to use the indirect address for Win API -- which is in Import Table(s). Try CALL [lstrlenW]. Or even better: invoke lstrlenW, msg |
|||
05 Jan 2012, 21:27 |
|
Picnic 05 Jan 2012, 21:59
@wyvern use brackets
call [lstrlenW] call [ExitProcess] call [SetConsoleOutputCP] call [GetStdHandle] call [WriteConsoleW] Here is a quick demo that displays Greek text in console. Code: format pe console include 'include\encoding\win1253.inc' include 'include\win32wx.inc' .data temp dd ? text du ' ',0 ; write some Greek text .code main: invoke WriteConsole,\ <invoke GetStdHandle,STD_OUTPUT_HANDLE>,\ text,\ <invoke lstrlen, text>,\ temp,\ 0 ret .end main Here is a set of unicode routines for the console. Last edited by Picnic on 30 Mar 2015, 20:19; edited 2 times in total |
|||
05 Jan 2012, 21:59 |
|
typedef 05 Jan 2012, 23:48
first of all, calling your functions should be
Code: call [ function ] not Code: call function try this Code: ;=5 ;for UTF-8 FORMAT PE console ENTRY start INCLUDE 'C:\fasm\include\win32a.inc' SECTION '.text' code readable executable start: PUSH msg CALL [lstrlenW] ;get string len, this is crashing (new problem)... PUSH eax ;string len PUSH msg ;message CALL printW ;print in console PUSH NULL CALL [ExitProcess] ;exit ;prints a wide string printW: PUSH ebp MOV ebp, esp SUB esp, 4 PUSH 869 ;code page for greek... tryed with UTF-8(65001) CALL [SetConsoleOutputCP] PUSH STD_OUTPUT_HANDLE CALL [GetStdHandle] LEA edx, [ebp - 4] PUSH NULL PUSH edx ;chars written PUSH dword [ebp + 12] ;message len PUSH dword [ebp + 8] ;message PUSH eax ;stdout handle CALL [WriteConsoleW] add esp, 4 MOV esp, ebp POP ebp RET SECTION '.data' data readable writeable msg du '-B> A>>1I5=85', 00h ;testing with greek string written dd 0 SECTION '.idata' import data readable writeable library kernel, 'KERNEL32.DLL' import kernel,\ lstrlenW, 'lstrlenW',\ SetConsoleOutputCP, 'SetConsoleOutputCP',\ GetStdHandle, 'GetStdHandle',\ WriteConsoleW, 'WriteConsoleW',\ ExitProcess, 'ExitProcess' |
|||
05 Jan 2012, 23:48 |
|
wyvern 06 Jan 2012, 02:04
Oh yeah, thanks for the correction guys, i forgot the "[]" !!, the code was actually made to work with a linker, so i quickly converted it to "pure fasm" before posting here, just a silly thing.
@Picnic This is your code output: @typedef This your code output: The same result for win 7 and XP, with "Lucida Console" font. What font are you using? By the way, i need always the "=5" at the first line of the source when is UTF-8, if not fasm will say "illegal instruction". _________________ Thanks |
|||
06 Jan 2012, 02:04 |
|
typedef 06 Jan 2012, 03:12
Well, here's my Vista. The same code I gave.
|
|||
06 Jan 2012, 03:12 |
|
Picnic 06 Jan 2012, 11:54
wyvern wrote:
I get correct chars with both lucida and raster font. |
|||
06 Jan 2012, 11:54 |
|
MHajduk 06 Jan 2012, 12:29
The clue is use of 'UTF8.inc' include file:
Quote: format pe console |
|||
06 Jan 2012, 12:29 |
|
wyvern 06 Jan 2012, 15:49
Well, something funny: nobody here seems to use the "=5" prefix at the start of the file.... im the only one who cant assemble without it?
@typedef The "-B> A>>1I5=85" string is erroneous, when i pasted mi code here the original greek string was converted to that. @MHajduk Thanks, the UTF-8.inc is equivalent to write manually the unicode code points: Code: text DW 0x03B5, 0x03BB,0x03BB ;, ... etc for every letter Altough.. i dont know exactly why i cant write the chars directly and why i need that conversion, after all the source file is saved in UTF-8... so?... Another funny thing, Lucida Console doesnt support the graphs for Chineese, Arabic, and others. I guess that i need to add a complete-unicode font for the console options, if somebody knows one please let me know. |
|||
06 Jan 2012, 15:49 |
|
MHajduk 06 Jan 2012, 16:03
wyvern wrote: Well, something funny: nobody here seems to use the "=5" prefix at the start of the file.... im the only one who cant assemble without it? wyvern wrote: Another funny thing, Lucida Console doesnt support the graphs for Chineese, Arabic, and others. I guess that i need to add a complete-unicode font for the console options, if somebody knows one please let me know. |
|||
06 Jan 2012, 16:03 |
|
Tomasz Grysztar 06 Jan 2012, 16:17
MHajduk wrote:
As for the editors, some (like Notepad++) allow you to choose "UTF-8 without BOM" or "UTF-8 with BOM" as written format, some (like PSPad) have an option in configuration to include BOM (by default off), and other, like Notepad, always put BOM there without an option to disable this. So if you use Notepad, you need something like "=5". But if some other editor then saves it without BOM, the source will not compile again. So I think putting something like ".utf:" in the first line would be better - it will work both with BOM and without it. |
|||
06 Jan 2012, 16:17 |
|
Tomasz Grysztar 06 Jan 2012, 16:26
wyvern wrote: Altough.. i dont know exactly why i cant write the chars directly and why i need that conversion, after all the source file is saved in UTF-8... so?... If you wanted to use UTF-8 strings in your program, you would need to use the MultiByteToWideChar API with CP_UTF8 in first parameter. Last edited by Tomasz Grysztar on 06 Jan 2012, 16:27; edited 1 time in total |
|||
06 Jan 2012, 16:26 |
|
wyvern 06 Jan 2012, 16:27
@MHajduk Ok, I will check if "Tahoma" is valid font for the console, there are some restrictions according to Microsoft KB...
@Tomasz Yeah, i was using the simple Win-Notepad. And thanks for the ".utf:", is better. By the way, FASM is really cool!, great work. Last edited by wyvern on 06 Jan 2012, 16:28; edited 1 time in total |
|||
06 Jan 2012, 16:27 |
|
MHajduk 06 Jan 2012, 16:27
Tomasz Grysztar wrote: The "=5" probably is there to "neutralize" the BOM, which is seen by fasm as unrecognized label, so by appending "=5" to it you just create the valid definition of constant. Well, I would suggest to choose a text editor which, like PSPad, allows to configure programmer's environment once in a such way that you don't need to remember all the time what's going on with FASM compiler "guts". |
|||
06 Jan 2012, 16:27 |
|
wyvern 06 Jan 2012, 16:31
@MHajduk I dont know PSPad. Normally i use RADasm for large things, you think PSPad is a better or more configurable envirmoment?
|
|||
06 Jan 2012, 16:31 |
|
MHajduk 06 Jan 2012, 16:39
wyvern wrote: @MHajduk I dont know PSPad. Normally i use RADasm for large things, you think PSPad is a better or more configurable envirmoment? I think you may consider use of Fresh if you want to work with really complicated projects. |
|||
06 Jan 2012, 16:39 |
|
f0dder 06 Jan 2012, 19:03
First: if you change the console codepage, it's probably a good idea to be a nice citizen and return it to it's previous codepage when exiting your app - of course that means a utility (rather than an interactive) app will seem to produce gibberish. Something that ought to be documented in a README.TXT (ie. "make sure to set a unicode-able font, and do a `chcp 65001` to set UTF-8 console encoding).
Also, WriteConsoleOutputW() is great for interactive use - but if you want to support stdout redirection (which is often crucial for non-interactive apps), it's no-go. In that case, you want to use standard WriteFile output, but the CP_UTF8 codepage... and of course UTF-8 (rather than UCS-2/UTF-16) data, which means a WideCharToMultiByte() call. _________________ - carpe noctem |
|||
06 Jan 2012, 19:03 |
|
typedef 06 Jan 2012, 21:42
weird, When I use '=5' I get a compiler error.
|
|||
06 Jan 2012, 21:42 |
|
< Last Thread | Next Thread > |
Forum Rules:
|
Copyright © 1999-2025, Tomasz Grysztar. Also on GitHub, YouTube.
Website powered by rwasa.