flat assembler
Message board for the users of flat assembler.

Index > Projects and Ideas > Text statistical analysis [DONE]

Author
Thread Post new topic Reply to topic
Madis731



Joined: 25 Sep 2003
Posts: 2139
Location: Estonia
Madis731 27 May 2007, 22:41
This program takes textual input and makes statistical analyses on the text. It will count all the one-, two-, ..., N-letter combinations and arrange them lexically.

Detailed usage:
You can write your input text in the textbox or into a file Alt+I or "Intput" checkbox toggles that. Filenames can be relative or absolute
You can see the output in the other textbox or put it into a file with Alt+O toggling the choice.
There are three other checkboxes that change the behaviour of the program: they are Alt+A, +C and +F respective to custom "A"lfabet (I don't know why alfa sound more natural Razz), "C"ase sensitiveness and scanning from word beginnings or "F"rom every position of the word.
Alt+S will start the program (usually the performance is about 1MB/s). Alt+E will exit. Processing time shows the time it took in milliseconds.

Don't know if it has a better name in English, but that's what it was called in school: "Täheühendite leidmine" (compund letter search? maybe Razz)

Implementation in ASM
64-bit source+executable
32-bit source+executable
(64 is only *slightly* better ~2-3%)
4MB sample file made out of fragments of "Alice" from Gutenberg project.


Description: The source and executable in English
Download
Filename: YhendidENG.7z
Filesize: 8.29 KB
Downloaded: 790 Time(s)

Description: Source+executable+testdata
Download
Filename: Täheühendid.7z
Filesize: 292.48 KB
Downloaded: 802 Time(s)


_________________
My updated idol Very Happy http://www.agner.org/optimize/


Last edited by Madis731 on 28 May 2007, 13:43; edited 2 times in total
Post 27 May 2007, 22:41
View user's profile Send private message Visit poster's website Yahoo Messenger MSN Messenger Reply with quote
MHajduk



Joined: 30 Mar 2006
Posts: 6115
Location: Poland
MHajduk 28 May 2007, 11:04
Interesting subject. Smile Statistical analysis of text is adapted in cryptology (some old ciphers may be broken that way). Smile
Post 28 May 2007, 11:04
View user's profile Send private message Visit poster's website Reply with quote
Display posts from previous:
Post new topic Reply to topic

Jump to:  


< Last Thread | Next Thread >
Forum Rules:
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum
You cannot attach files in this forum
You can download files in this forum


Copyright © 1999-2025, Tomasz Grysztar. Also on GitHub, YouTube.

Website powered by rwasa.