Overview- Occurrences: 113334
- Words (tokens): 7444
- Occ./Words: 15,22
- Occurrences of function words: 72468
- Function words: 269
- Average occurrence length: 4,17
- Average word length: 7,4
Occurrence distribution by frequency groups
| 1 | 3342 | ======== | | 2 | 2418 | ====== | | 3 | 1923 | ===== | | 4 | 1544 | ==== | | 5 | 1270 | === | | 6 | 1260 | === | | 7 | 973 | === | | 8 | 1032 | === | | 9 | 900 | === | | 10 | 870 | == | | 11 | 715 | == | | 12 | 804 | == | | 13 | 611 | == | | 14 | 532 | == | | 20-15 | 3175 | ======== | | 50-21 | 8536 | ==================== | | 100-51 | 8503 | ==================== | | 200-101 | 9293 | ===================== | | 500-201 | 15928 | ==================================== | | 1000-501 | 13489 | =============================== | | 2000-1001 | 16094 | ==================================== | | 4674-2001 | 20122 | ============================================== |
|
Word (token) distribution by frequency
Alphabetical occurrence distribution (by first letter)
| (0-9) | 1372 | ==== | | A | 11683 | =========================== | | B | 6293 | =============== | | C | 3390 | ======== | | D | 2878 | ======= | | E | 2127 | ===== | | F | 4635 | =========== | | G | 1846 | ===== | | H | 5578 | ============= | | I | 9159 | ===================== | | J | 312 | = | | K | 615 | == | | L | 2640 | ====== | | M | 6319 | =============== | | N | 3643 | ========= | | O | 6831 | ================ | | P | 2373 | ====== | | Q | 54 | = | | R | 1616 | ==== | | S | 7012 | ================ | | T | 20211 | ============================================== | | U | 1814 | ===== | | V | 622 | == | | W | 9422 | ===================== | | Y | 882 | == | | Z | 7 | = |
|
Alphabetical word (token) distribution (by first letter)
| (0-9) | 70 | ==== | | A | 475 | ========================== | | B | 351 | =================== | | C | 637 | ================================== | | D | 517 | ============================ | | E | 378 | ==================== | | F | 428 | ======================= | | G | 219 | ============ | | H | 271 | =============== | | I | 303 | ================ | | J | 54 | === | | K | 46 | === | | L | 245 | ============= | | M | 314 | ================= | | N | 114 | ======= | | O | 165 | ========= | | P | 531 | ============================ | | Q | 26 | == | | R | 435 | ======================= | | S | 854 | ============================================== | | T | 354 | =================== | | U | 200 | =========== | | V | 118 | ======= | | W | 315 | ================= | | Y | 21 | == | | Z | 3 | = |
|
Occurrence distribution by last letter
| (0-9) | 1372 | === | | A | 1313 | === | | B | 30 | = | | C | 77 | = | | D | 13361 | ============================= | | E | 20886 | ============================================== | | F | 4483 | ========== | | G | 2972 | ======= | | H | 4625 | ========== | | I | 2532 | ====== | | K | 567 | == | | L | 2627 | ====== | | M | 2130 | ===== | | N | 7454 | ================= | | O | 5959 | ============= | | P | 326 | = | | R | 6445 | ============== | | S | 11314 | ========================= | | T | 14104 | =============================== | | U | 1148 | === | | W | 1140 | === | | X | 17 | = | | Y | 8451 | =================== | | Z | 1 | = |
|
Word (token) distribution by last letter
| (0-9) | 70 | === | | A | 22 | = | | B | 11 | = | | C | 22 | = | | D | 1201 | ====================================== | | E | 931 | ============================== | | F | 32 | = | | G | 686 | ====================== | | H | 305 | ========== | | I | 2 | = | | K | 70 | === | | L | 219 | ======= | | M | 65 | === | | N | 447 | ============== | | O | 34 | == | | P | 40 | == | | R | 369 | ============ | | S | 1444 | ============================================== | | T | 723 | ======================= | | U | 4 | = | | W | 58 | == | | X | 7 | = | | Y | 681 | ====================== | | Z | 1 | = |
|
Occurrence distribution by length
| 1 | 4704 | ========= | | 2 | 22164 | ======================================= | | 3 | 25979 | ============================================== | | 4 | 22665 | ======================================== | | 5 | 12076 | ===================== | | 6 | 8489 | =============== | | 7 | 6998 | ============= | | 8 | 3808 | ======= | | 9 | 3242 | ====== | | 10 | 1680 | === | | 11 | 761 | == | | 12 | 396 | = | | 13 | 245 | = | | 14 | 73 | = | | 15 | 33 | = | | 16 | 13 | = | | 17 | 2 | = | | 18 | 3 | = | | 19 | 3 | = |
|
Word (token) distribution by length
| 19 | 3 | = | | 18 | 3 | = | | 17 | 2 | = | | 16 | 10 | = | | 15 | 17 | = | | 14 | 56 | === | | 13 | 101 | ==== | | 12 | 203 | ======== | | 11 | 377 | ============== | | 10 | 634 | ======================= | | 9 | 919 | ================================= | | 8 | 1089 | ======================================== | | 7 | 1255 | ============================================== | | 6 | 1096 | ======================================== | | 5 | 837 | =============================== | | 4 | 587 | ====================== | | 3 | 152 | ====== | | 2 | 89 | ==== | | 1 | 14 | = |
|
|