Overview- Occurrences: 51195
- Words (tokens): 5766
- Occ./Words: 8,88
- Occurrences of function words: 22167
- Function words: 225
- Average occurrence length: 4,65
- Average word length: 6,8
Occurrence distribution by frequency groups
| 1 | 2608 | =============== | | 2 | 1760 | ========== | | 3 | 1482 | ========= | | 4 | 1132 | ======= | | 5 | 1130 | ======= | | 6 | 870 | ===== | | 7 | 1008 | ====== | | 8 | 736 | ===== | | 9 | 900 | ====== | | 10 | 690 | ==== | | 11 | 616 | ==== | | 12 | 432 | === | | 13 | 728 | ===== | | 14 | 504 | === | | 20-15 | 2702 | ================ | | 50-21 | 7967 | ============================================== | | 100-51 | 5519 | ================================ | | 200-101 | 3795 | ====================== | | 500-201 | 3976 | ======================= | | 1000-501 | 3839 | ====================== | | 2000-1001 | 2777 | ================ | | 3365-2001 | 6024 | =================================== |
|
Word (token) distribution by frequency
Alphabetical occurrence distribution (by first letter)
| (0-9) | 4 | = | | A | 5523 | =================================== | | B | 2670 | ================= | | C | 1872 | ============ | | D | 2026 | ============= | | E | 915 | ====== | | F | 2771 | ================== | | G | 1303 | ========= | | H | 3032 | ==================== | | I | 2234 | =============== | | J | 198 | == | | K | 953 | ======= | | L | 1537 | ========== | | M | 1984 | ============= | | N | 983 | ======= | | O | 2807 | ================== | | P | 1903 | ============= | | Q | 119 | = | | R | 1243 | ======== | | S | 5085 | ================================= | | T | 7130 | ============================================== | | U | 541 | ==== | | V | 638 | ===== | | W | 3194 | ===================== | | X | 23 | = | | Y | 503 | ==== | | Z | 4 | = |
|
Alphabetical word (token) distribution (by first letter)
| (0-9) | 4 | = | | A | 317 | =================== | | B | 345 | ===================== | | C | 435 | ========================== | | D | 345 | ===================== | | E | 192 | ============ | | F | 346 | ===================== | | G | 206 | ============= | | H | 234 | ============== | | I | 152 | ========= | | J | 41 | === | | K | 85 | ===== | | L | 247 | =============== | | M | 266 | ================ | | N | 113 | ======= | | O | 96 | ====== | | P | 357 | ===================== | | Q | 30 | == | | R | 294 | ================== | | S | 767 | ============================================== | | T | 320 | =================== | | U | 130 | ======== | | V | 105 | ======= | | W | 285 | ================= | | X | 15 | = | | Y | 35 | === | | Z | 4 | = |
|
Occurrence distribution by last letter
| (0-9) | 2 | = | | (other) | 4 | = | | A | 2119 | =========== | | B | 6 | = | | C | 110 | = | | D | 7290 | =================================== | | E | 9482 | ============================================== | | F | 1901 | ========== | | G | 1343 | ======= | | H | 1825 | ========= | | I | 470 | === | | J | 1 | = | | K | 432 | === | | L | 1518 | ======== | | M | 784 | ==== | | N | 4267 | ===================== | | O | 1408 | ======= | | P | 119 | = | | Q | 1 | = | | R | 3448 | ================= | | S | 7640 | ===================================== | | T | 3305 | ================ | | U | 388 | == | | V | 64 | = | | W | 467 | === | | X | 34 | = | | Y | 2767 | ============== |
|
Word (token) distribution by last letter
| (0-9) | 2 | = | | (other) | 4 | = | | A | 214 | ========= | | B | 5 | = | | C | 17 | = | | D | 1062 | ========================================== | | E | 698 | ============================ | | F | 27 | == | | G | 414 | ================= | | H | 139 | ====== | | I | 61 | === | | J | 1 | = | | K | 61 | === | | L | 204 | ======== | | M | 62 | === | | N | 383 | =============== | | O | 24 | = | | P | 32 | == | | Q | 1 | = | | R | 302 | ============ | | S | 1162 | ============================================== | | T | 460 | ================== | | U | 23 | = | | V | 15 | = | | W | 44 | == | | X | 12 | = | | Y | 337 | ============== |
|
Occurrence distribution by length
| 1 | 1817 | ======== | | 2 | 6190 | ========================= | | 3 | 11196 | ============================================== | | 4 | 8833 | ==================================== | | 5 | 7545 | =============================== | | 6 | 5189 | ===================== | | 7 | 4257 | ================== | | 8 | 2869 | ============ | | 9 | 1722 | ======= | | 10 | 656 | === | | 11 | 486 | == | | 12 | 145 | = | | 13 | 124 | = | | 14 | 121 | = | | 15 | 25 | = | | 16 | 13 | = | | 17 | 3 | = | | 18 | 3 | = | | 25 | 1 | = |
|
Word (token) distribution by length
| 25 | 1 | = | | 18 | 3 | = | | 17 | 3 | = | | 16 | 9 | = | | 15 | 18 | = | | 14 | 49 | === | | 13 | 76 | ==== | | 12 | 87 | ==== | | 11 | 155 | ======= | | 10 | 294 | ============= | | 9 | 493 | ====================== | | 8 | 742 | ================================= | | 7 | 974 | =========================================== | | 6 | 1020 | ============================================== | | 5 | 922 | ========================================= | | 4 | 685 | =============================== | | 3 | 178 | ======== | | 2 | 45 | == | | 1 | 12 | = |
|
|