"Pride and Prejudice", by Jane Austen Rank Occurances Percentage 1-tuple (of 121296) 1 4327 3.5673064% the 2 4135 3.4090160% to 3 3608 2.9745416% of 4 3584 2.9547553% and 5 2228 1.8368289% her 6 2062 1.6999736% i 7 1958 1.6142330% a 8 1866 1.5383854% in 9 1845 1.5210724% was 10 1708 1.4081256% she 11 1578 1.3009497% that 12 1530 1.2613771% it 13 1426 1.1756365% not 14 1358 1.1195753% you 15 1338 1.1030867% he 16 1268 1.0453766% his 17 1236 1.0189949% be 18 1179 0.9720024% as 19 1176 0.9695291% had 20 1057 0.8714220% for 21 1053 0.8681243% with 22 1002 0.8260784% but 23 862 0.7106582% is 24 842 0.6941696% have 25 788 0.6496504% at 26 764 0.6298641% him 27 718 0.5919404% my 28 717 0.5911159% on 29 635 0.5235127% by 30 623 0.5136196% mr 31 623 0.5136196% all 32 601 0.4954821% they 33 597 0.4921844% elizabeth 34 588 0.4847645% so 35 564 0.4649782% were 36 538 0.4435431% which 37 526 0.4336499% could 38 515 0.4245812% been 39 488 0.4023216% no 40 488 0.4023216% from 41 487 0.4014972% very 42 478 0.3940773% what 43 470 0.3874819% would 44 453 0.3734666% your 45 447 0.3685200% this 46 447 0.3685200% me 47 440 0.3627490% their 48 433 0.3569780% them 49 407 0.3355428% will 50 400 0.3297718% said 51 388 0.3198786% such 52 372 0.3066878% when 53 351 0.2893748% there 54 350 0.2885503% an 55 348 0.2869015% if 56 346 0.2852526% do 57 340 0.2803060% are 58 339 0.2794816% darcy 59 327 0.2695884% much 60 326 0.2687640% more 61 317 0.2613441% am 62 307 0.2530999% must 63 299 0.2465044% or 64 295 0.2432067% any 65 284 0.2341380% who 66 284 0.2341380% miss 67 283 0.2333135% bennet 68 282 0.2324891% than 69 277 0.2283670% one 70 272 0.2242448% did 71 264 0.2176494% jane 72 254 0.2094051% we 73 249 0.2052829% mrs 74 247 0.2036341% bingley 75 246 0.2028097% should 76 238 0.1962142% know 77 230 0.1896188% how 78 227 0.1871455% herself 79 226 0.1863211% though 80 225 0.1854966% before 81 221 0.1821989% has 82 220 0.1813745% never 83 216 0.1780768% soon 84 215 0.1772523% only 85 213 0.1756035% well 86 211 0.1739546% think 87 211 0.1739546% some 88 211 0.1739546% can 89 210 0.1731302% other 90 207 0.1706569% now 91 203 0.1673592% every 92 200 0.1648859% time 93 200 0.1648859% might 94 200 0.1648859% after 95 192 0.1582905% may 96 189 0.1558172% little 97 188 0.1549927% most 98 183 0.1508706% own 99 183 0.1508706% lady 100 183 0.1508706% good