Overview- Occurrences: 147347
- Words (tokens): 6815
- Occ./Words: 21,62
- Occurrences of function words: 87582
- Function words: 237
- Average occurrence length: 4,11
- Average word length: 6,94
Occurrence distribution by frequency groups
| 1 | 2738 | ====== | | 2 | 1904 | ==== | | 3 | 1590 | === | | 4 | 1380 | === | | 5 | 1350 | === | | 6 | 1374 | === | | 7 | 1162 | === | | 8 | 1096 | === | | 9 | 864 | == | | 10 | 980 | == | | 11 | 935 | == | | 12 | 828 | == | | 13 | 689 | == | | 14 | 952 | == | | 20-15 | 3870 | ======== | | 50-21 | 11908 | ======================= | | 100-51 | 10572 | ==================== | | 200-101 | 17526 | ================================= | | 500-201 | 18752 | ==================================== | | 1000-501 | 14920 | ============================= | | 2000-1001 | 18067 | ================================== | | 5000-2001 | 9942 | =================== | | 9759-5001 | 23948 | ============================================== |
|
Word (token) distribution by frequency
Alphabetical occurrence distribution (by first letter)
| (0-9) | 5706 | ========== | | A | 18154 | ============================= | | B | 6195 | ========== | | C | 4413 | ======= | | D | 3503 | ====== | | E | 2295 | ==== | | F | 5749 | ========== | | G | 3193 | ====== | | H | 10590 | ================= | | I | 6859 | =========== | | J | 1312 | === | | K | 1160 | == | | L | 3378 | ====== | | M | 5351 | ========= | | N | 3363 | ====== | | O | 8473 | ============== | | P | 3451 | ====== | | Q | 61 | = | | R | 1930 | ==== | | S | 9480 | =============== | | T | 28446 | ============================================== | | U | 3075 | ===== | | V | 553 | = | | W | 9538 | ================ | | X | 3 | = | | Y | 1066 | == | | Z | 50 | = |
|
Alphabetical word (token) distribution (by first letter)
| (0-9) | 96 | ====== | | A | 510 | ============================== | | B | 412 | ======================== | | C | 576 | ================================== | | D | 393 | ======================= | | E | 303 | ================== | | F | 346 | ===================== | | G | 206 | ============ | | H | 282 | ================= | | I | 170 | ========== | | J | 110 | ======= | | K | 49 | === | | L | 232 | ============== | | M | 336 | ==================== | | N | 133 | ======== | | O | 169 | ========== | | P | 448 | ========================== | | Q | 25 | == | | R | 347 | ===================== | | S | 776 | ============================================== | | T | 359 | ===================== | | U | 121 | ======== | | V | 79 | ===== | | W | 298 | ================== | | X | 1 | = | | Y | 18 | == | | Z | 20 | == |
|
Occurrence distribution by last letter
| (0-9) | 5706 | ========== | | A | 2374 | ==== | | B | 113 | = | | C | 26 | = | | D | 19580 | =============================== | | E | 28428 | ============================================== | | F | 6287 | ========== | | G | 3017 | ===== | | H | 5727 | ========== | | I | 996 | == | | K | 737 | == | | L | 4598 | ======== | | M | 4150 | ======= | | N | 10523 | ================= | | O | 6791 | =========== | | P | 733 | == | | R | 8636 | ============== | | S | 14450 | ======================= | | T | 14027 | ======================= | | U | 1151 | == | | W | 1279 | === | | X | 44 | = | | Y | 7971 | ============= | | Z | 3 | = |
|
Word (token) distribution by last letter
| (0-9) | 96 | ==== | | A | 124 | ==== | | B | 27 | = | | C | 7 | = | | D | 936 | ============================== | | E | 761 | ======================== | | F | 26 | = | | G | 432 | ============== | | H | 549 | ================== | | I | 45 | == | | K | 68 | === | | L | 206 | ======= | | M | 93 | === | | N | 492 | ================ | | O | 31 | = | | P | 41 | == | | R | 353 | ============ | | S | 1432 | ============================================== | | T | 541 | ================== | | U | 11 | = | | W | 64 | === | | X | 10 | = | | Y | 467 | =============== | | Z | 3 | = |
|
Occurrence distribution by length
| 1 | 4736 | ====== | | 2 | 26696 | ================================ | | 3 | 37905 | ============================================== | | 4 | 31910 | ====================================== | | 5 | 15979 | =================== | | 6 | 9677 | ============ | | 7 | 8072 | ========== | | 8 | 5077 | ======= | | 9 | 3992 | ===== | | 10 | 1718 | === | | 11 | 937 | == | | 12 | 343 | = | | 13 | 210 | = | | 14 | 71 | = | | 15 | 24 | = |
|
Word (token) distribution by length
| 15 | 5 | = | | 14 | 20 | = | | 13 | 52 | == | | 12 | 111 | ===== | | 11 | 250 | ========== | | 10 | 467 | ================== | | 9 | 763 | ============================= | | 8 | 946 | ==================================== | | 7 | 1202 | ============================================== | | 6 | 1111 | ========================================== | | 5 | 917 | =================================== | | 4 | 635 | ======================== | | 3 | 196 | ======== | | 2 | 127 | ===== | | 1 | 13 | = |
|
|