flat assembler
Message board for the users of flat assembler.
Index
> Projects and Ideas > ANSI & ASCII File Cleaner |
Author |
|
asmfan 05 Aug 2007, 19:49
Wipes out all unneeded (unprintable) characters (spaces, tabulation) at the end of line, leaving printable characters un touched.
USAGE: just drag'n'drop needed file (asm,inc,txt,h etc.) that consists of plain ANSI or ASCII text/code/data on the any exe (UNICODE or ANSI - both work correctly). ANSI version is the best choise for old versions of Windows /poor Unicode support/. EXAMPLE: - say you have file with only spaces inside as input - after processing you'll get zero sized file. - say you have file - " -[bla-bla !!] " as input - after processing you'll get " -[bla-bla !!]" as output. such mechanism has FASM, but i noticed that it works imperfectly (not always correctly - some tabs & spaces are skipped some times by FASM). [ADDED] It's not harmful if you ocassionally put a unicode text file to process - Cleaner just checks for unicode and skips file if true. Checking is done by Byte-order mark. Checks for UTF-8, UTF-16 & UTF-32 (Big & Little Endian). Integration: - you can add it to registry as i did. Just use Regedit. Quote:
Supported following filetypes (also text filetype dynamic check inside) Code: *.h,*.inc,*.asm,*.ini,*.txt,*.hpp,*.c,*.cpp,*.log,*.rc,*.def,*.bat,*.css,*.js,*.xml,*.vbs,*.idl,*.htm,*.html,*.nfo,*.diz and much more that have plain text inside. So go on and clean them out )
_________________ Any offers? Last edited by asmfan on 11 Mar 2009, 17:06; edited 7 times in total |
|||||||||||||||||||||
05 Aug 2007, 19:49 |
|
asmfan 06 Aug 2007, 13:50
New more secure version here.
|
|||
06 Aug 2007, 13:50 |
|
asmfan 06 Aug 2007, 14:09
actually i don't know much unicode files to process (except for modern *.reg files which in UTF-16 Big endian). All sources - c, cpp etc. use ansi encodings as i remember. If you kindly point me what to process by this i'll think of UTF importance.
|
|||
06 Aug 2007, 14:09 |
|
vid 06 Aug 2007, 15:00
for example UTF8 FASM sources?
i doubt there are any existing, i really meant it just as a kind of excercise... |
|||
06 Aug 2007, 15:00 |
|
asmfan 06 Aug 2007, 15:46
No, thanks vid
Until i find at least a file on my computer to optimize such a way i'd better read somethind interesting than programm something i cannot even test on _________________ Any offers? |
|||
06 Aug 2007, 15:46 |
|
vid 06 Aug 2007, 16:46
that is the problem... no one knows unicode, so no one programs with it, and thus we are still living in world full of 8 bit characters
|
|||
06 Aug 2007, 16:46 |
|
vador 07 Aug 2007, 13:24
what are really the advantages of UNICODE other ASCII and ANSI? is the speed increase when using unicode noticeable?
|
|||
07 Aug 2007, 13:24 |
|
vid 07 Aug 2007, 19:25
No, actually Unicode support causes considerable speed decrease.
Advantage is that everyone on world uses same character set, so you don't get couple of "?" or boxes if you try to read something written in other than yours. |
|||
07 Aug 2007, 19:25 |
|
asmfan 08 Aug 2007, 19:37
Updated.
_________________ Any offers? |
|||
08 Aug 2007, 19:37 |
|
asmfan 11 Aug 2007, 13:29
Updated 11 august 2007.
|
|||
11 Aug 2007, 13:29 |
|
asmfan 16 Nov 2007, 07:23
Not an update, but addition to filetypes - htm and html added to list as they hold only plain text, that can be restucted according to Cleaner's algorithm.
If anybody knows addition filetypes that hold plain text - please write there to add them. For few bytes sake;) |
|||
16 Nov 2007, 07:23 |
|
asmfan 10 Dec 2008, 08:50
Improved command line parser
|
|||
10 Dec 2008, 08:50 |
|
asmfan 11 Mar 2009, 17:07
Added support of MS header files (*.h), which contain 0Ch symbol inside ()
|
|||
11 Mar 2009, 17:07 |
|
< Last Thread | Next Thread > |
Forum Rules:
|
Copyright © 1999-2024, Tomasz Grysztar. Also on GitHub, YouTube.
Website powered by rwasa.