Thursday, June 17, 2004

Fun With Numbers: Textalyser


Via a Blogcritics post, I learned about the Textalyser, which analyses any chunk of text or any webpage and can spit some fun statistics back at you.

I tried it with the posts on today's page of Half-Bakered (minus the two side-columns of links) and got the following results:

Total word count : 7794
Number of different words : 2534
Complexity factor (Lexical Density) : 32.5%
Readability (Gunning-Fog Index) : (6-easy 20-hard) 6.6
Total number of characters : 58418
Number of characters without spaces : 44679
Average Syllables per Word : 1.58
Sentence count : 807
Average sentence length (words) : 12.51
Max sentence length (words) : 55
(one other thing if you are familiar with organizations which claim to hold \ the truth\ or \ the way\ or claim \ to have the best intentions of the people\ in mind infinitely self righteous crusading and self certain groups you will most often also find those same groups very opposed to anyone who questions their claims)
Min sentence length (words) : 1
( ok)
Readability (Alternative) beta : (100-easy 20-hard, optimal 60-70) 60.3

Frequency and top words :
Word Occurrences Frequency Rank
the 469 6% 1
and 300 3.8% 2
that 134 1.7% 3
was 74 0.9% 4
you 72 0.9% 4
for 72 0.9% 4
this 66 0.8% 5
with 65 0.8% 5
but 55 0.7% 6
have 50 0.6% 7

Word Length :
Word Length (characters) Word count Frequency
3 2088 19.9%
4 1859 17.7%
2 1792 17.1%
5 1124 10.7%
6 851 8.1%
1 821 7.8%
7 684 6.5%
8 476 4.5%
9 367 3.5%
10 200 1.9%
11 97 0.9%
12 44 0.4%
19 35 0.3%
13 23 0.2%
14 14 0.1%
15 3 0%
17 2 0%
16 1 0%



Whee! Numbers, tables, percentages. I'm a happy pig; watch me root.

No comments: