Overview- Occurrences: 51195
- Words (tokens): 5766
- Occ./Words: 8,88
- Occurrences of function words: 22167
- Function words: 225
- Average occurrence length: 4,65
- Average word length: 6,8
Occurrence distribution by frequency groups
1 | 2608 | =============== | 2 | 1760 | ========== | 3 | 1482 | ========= | 4 | 1132 | ======= | 5 | 1130 | ======= | 6 | 870 | ===== | 7 | 1008 | ====== | 8 | 736 | ===== | 9 | 900 | ====== | 10 | 690 | ==== | 11 | 616 | ==== | 12 | 432 | === | 13 | 728 | ===== | 14 | 504 | === | 20-15 | 2702 | ================ | 50-21 | 7967 | ============================================== | 100-51 | 5519 | ================================ | 200-101 | 3795 | ====================== | 500-201 | 3976 | ======================= | 1000-501 | 3839 | ====================== | 2000-1001 | 2777 | ================ | 3365-2001 | 6024 | =================================== |
|
Word (token) distribution by frequency
Alphabetical occurrence distribution (by first letter)
(0-9) | 4 | = | A | 5523 | =================================== | B | 2670 | ================= | C | 1872 | ============ | D | 2026 | ============= | E | 915 | ====== | F | 2771 | ================== | G | 1303 | ========= | H | 3032 | ==================== | I | 2234 | =============== | J | 198 | == | K | 953 | ======= | L | 1537 | ========== | M | 1984 | ============= | N | 983 | ======= | O | 2807 | ================== | P | 1903 | ============= | Q | 119 | = | R | 1243 | ======== | S | 5085 | ================================= | T | 7130 | ============================================== | U | 541 | ==== | V | 638 | ===== | W | 3194 | ===================== | X | 23 | = | Y | 503 | ==== | Z | 4 | = |
|
Alphabetical word (token) distribution (by first letter)
(0-9) | 4 | = | A | 317 | =================== | B | 345 | ===================== | C | 435 | ========================== | D | 345 | ===================== | E | 192 | ============ | F | 346 | ===================== | G | 206 | ============= | H | 234 | ============== | I | 152 | ========= | J | 41 | === | K | 85 | ===== | L | 247 | =============== | M | 266 | ================ | N | 113 | ======= | O | 96 | ====== | P | 357 | ===================== | Q | 30 | == | R | 294 | ================== | S | 767 | ============================================== | T | 320 | =================== | U | 130 | ======== | V | 105 | ======= | W | 285 | ================= | X | 15 | = | Y | 35 | === | Z | 4 | = |
|
Occurrence distribution by last letter
(0-9) | 2 | = | (other) | 4 | = | A | 2119 | =========== | B | 6 | = | C | 110 | = | D | 7290 | =================================== | E | 9482 | ============================================== | F | 1901 | ========== | G | 1343 | ======= | H | 1825 | ========= | I | 470 | === | J | 1 | = | K | 432 | === | L | 1518 | ======== | M | 784 | ==== | N | 4267 | ===================== | O | 1408 | ======= | P | 119 | = | Q | 1 | = | R | 3448 | ================= | S | 7640 | ===================================== | T | 3305 | ================ | U | 388 | == | V | 64 | = | W | 467 | === | X | 34 | = | Y | 2767 | ============== |
|
Word (token) distribution by last letter
(0-9) | 2 | = | (other) | 4 | = | A | 214 | ========= | B | 5 | = | C | 17 | = | D | 1062 | ========================================== | E | 698 | ============================ | F | 27 | == | G | 414 | ================= | H | 139 | ====== | I | 61 | === | J | 1 | = | K | 61 | === | L | 204 | ======== | M | 62 | === | N | 383 | =============== | O | 24 | = | P | 32 | == | Q | 1 | = | R | 302 | ============ | S | 1162 | ============================================== | T | 460 | ================== | U | 23 | = | V | 15 | = | W | 44 | == | X | 12 | = | Y | 337 | ============== |
|
Occurrence distribution by length
1 | 1817 | ======== | 2 | 6190 | ========================= | 3 | 11196 | ============================================== | 4 | 8833 | ==================================== | 5 | 7545 | =============================== | 6 | 5189 | ===================== | 7 | 4257 | ================== | 8 | 2869 | ============ | 9 | 1722 | ======= | 10 | 656 | === | 11 | 486 | == | 12 | 145 | = | 13 | 124 | = | 14 | 121 | = | 15 | 25 | = | 16 | 13 | = | 17 | 3 | = | 18 | 3 | = | 25 | 1 | = |
|
Word (token) distribution by length
25 | 1 | = | 18 | 3 | = | 17 | 3 | = | 16 | 9 | = | 15 | 18 | = | 14 | 49 | === | 13 | 76 | ==== | 12 | 87 | ==== | 11 | 155 | ======= | 10 | 294 | ============= | 9 | 493 | ====================== | 8 | 742 | ================================= | 7 | 974 | =========================================== | 6 | 1020 | ============================================== | 5 | 922 | ========================================= | 4 | 685 | =============================== | 3 | 178 | ======== | 2 | 45 | == | 1 | 12 | = |
|
|