"Great Expectations", by Charles Dickens Rank Occurances Percentage 1-tuple (of 185167) 1 8143 4.3976518% the 2 7078 3.8224954% and 3 6482 3.5006238% i 4 5079 2.7429294% to 5 4431 2.3929750% of 6 4041 2.1823543% a 7 3025 1.6336604% in 8 2988 1.6136785% that 9 2836 1.5315904% was 10 2669 1.4414015% it 11 2208 1.1924371% he 12 2181 1.1778557% you 13 2093 1.1303310% had 14 2070 1.1179098% my 15 1996 1.0779459% me 16 1858 1.0034185% his 17 1773 0.9575140% as 18 1760 0.9504933% with 19 1639 0.8851469% at 20 1419 0.7663353% on 21 1381 0.7458132% for 22 1349 0.7285315% said 23 1172 0.6329422% her 24 1149 0.6205209% him 25 1084 0.5854175% have 26 1068 0.5767766% but 27 1067 0.5762366% not 28 1033 0.5578748% be 29 887 0.4790270% she 30 881 0.4757867% when 31 809 0.4369029% by 32 797 0.4304223% were 33 791 0.4271819% so 34 784 0.4234016% out 35 781 0.4217814% if 36 761 0.4109804% we 37 747 0.4034196% this 38 734 0.3963989% all 39 689 0.3720965% there 40 681 0.3677761% joe 41 655 0.3537347% is 42 631 0.3407735% no 43 628 0.3391533% what 44 612 0.3305125% up 45 609 0.3288923% been 46 599 0.3234918% would 47 592 0.3197114% mr 48 579 0.3126907% from 49 566 0.3056700% or 50 547 0.2954090% which 51 494 0.2667862% one 52 492 0.2657061% into 53 483 0.2608456% could 54 457 0.2468042% do 55 450 0.2430239% now 56 445 0.2403236% an 57 438 0.2365432% then 58 404 0.2181814% more 59 398 0.2149411% very 60 388 0.2095406% your 61 383 0.2068403% miss 62 382 0.2063003% know 63 371 0.2003597% little 64 368 0.1987395% them 65 366 0.1976594% upon 66 365 0.1971194% down 67 363 0.1960393% they 68 360 0.1944191% time 69 360 0.1944191% come 70 359 0.1938790% again 71 350 0.1890186% should 72 331 0.1787576% who 73 325 0.1755172% some 74 324 0.1749772% looked 75 320 0.1728170% about 76 315 0.1701167% like 77 314 0.1695767% never 78 311 0.1679565% much 79 310 0.1674164% old 80 309 0.1668764% before 81 304 0.1641761% man 82 304 0.1641761% did 83 301 0.1625560% say 84 300 0.1620159% made 85 299 0.1614759% pip 86 293 0.1582355% than 87 291 0.1571554% after 88 290 0.1566154% went 89 287 0.1549952% herbert 90 286 0.1544552% how 91 284 0.1533751% dont 92 274 0.1479745% its 93 274 0.1479745% are 94 273 0.1474345% see 95 272 0.1468944% way 96 269 0.1452743% any 97 266 0.1436541% here 98 266 0.1436541% hand 99 265 0.1431141% us 100 265 0.1431141% being