Frequency Table for normal English text, based on a 10,000 letter sample:
7.78% |
A |
778 |
12.77% |
E |
1277 |
|
1.41% |
B |
141 |
8.55% |
T |
855 |
|
2.96% |
C |
296 |
8.07% |
O |
807 |
|
4.02% |
D |
402 |
7.78% |
A |
778 |
|
12.77% |
E |
1277 |
6.86% |
N |
686 |
|
1.97% |
F |
197 |
6.67% |
I |
667 |
|
1.74% |
G |
174 |
6.51% |
R |
651 |
|
5.95% |
H |
595 |
6.22% |
S |
622 |
|
6.67% |
I |
667 |
5.95% |
H |
595 |
|
0.51% |
J |
51 |
4.02% |
D |
402 |
|
0.74% |
K |
74 |
3.72% |
L |
372 |
|
3.72% |
L |
372 |
3.08% |
U |
308 |
|
2.88% |
M |
288 |
2.96% |
C |
296 |
|
6.86% |
N |
686 |
2.88% |
M |
288 |
|
8.07% |
O |
807 |
2.23% |
P |
223 |
|
2.23% |
P |
223 |
1.97% |
F |
197 |
|
0.08% |
Q |
8 |
1.96% |
Y |
196 |
|
6.51% |
R |
651 |
1.76% |
W |
176 |
|
6.22% |
S |
622 |
1.74% |
G |
174 |
|
8.55% |
T |
855 |
1.41% |
B |
141 |
|
3.08% |
U |
308 |
1.12% |
V |
112 |
|
1.12% |
V |
112 |
0.74% |
K |
74 |
|
1.76% |
W |
176 |
0.51% |
J |
51 |
|
0.27% |
X |
27 |
0.27% |
X |
27 |
|
1.96% |
Y |
196 |
0.17% |
Z |
17 |
|
0.17% |
Z |
17 |
0.08% |
Q |
8 |
From: Friedman: Elements of Cryptanalysis, pp. 4-5
Proportions of vowels and consonants to the total number of letters:
Vowels A E I O U Y … |
40.33% |
40.33% |
||||
High-frequency consonants H N R S T … |
34.09% |
|||||
Medium-frequency consonants |
59.67% |
|||||
D L C M P F W G B V … |
23.81% |
|||||
Low-frequency consonants J K Q X Z … |
1.77% |
|||||
Total: |
100.00% |
100.00% |
From: Friedman: Elements of Cryptanalysis, pp. 4-5
Relative frequencies of the vowels:
A |
19.50% |
|
E |
32.00% |
|
I |
16.70% |
|
O |
20.20% |
|
U |
8.00% |
|
Y |
3.60% |
|
100.00% |
From: Friedman: Elements of Cryptanalysis, pp. 23
One-letter words:
a, I, O.
From: Smith: Cryptography, pp. 153
Order of frequency of most common doubles:
SS EE TT FF LL MM OO
From: Smith: Cryptography, pp. 153
Most frequent digraphs:
TH |
50 |
AT |
25 |
ST |
20 |
||
ER |
40 |
EN |
25 |
IO |
18 |
||
ON |
39 |
ES |
25 |
LE |
18 |
||
AN |
38 |
OF |
25 |
IS |
17 |
||
RE |
36 |
OR |
25 |
OU |
17 |
||
HE |
33 |
NT |
24 |
AR |
16 |
||
IN |
31 |
EA |
22 |
AS |
16 |
||
ED |
30 |
TI |
22 |
DE |
16 |
||
ND |
30 |
TO |
22 |
RT |
16 |
||
HA |
26 |
IT |
20 |
VE |
16 |
From: Friedman: Elements of Cryptanalysis, pp. 22
Most frequent two-letter words:
of, to, in, it, is, be, as, at, so, we, he, by, or,
on, do, if, me, my, up, an, go, no, us, am
From: Smith: Cryptography, pp. 153
Most Common Reversals:
er re
es se
an na
ti it
on no
in ni
en ne
at ta
te et
or ro
to ot
ar ra
st ts
is si
ed de
of fo
Most frequent trigraphs:
THE |
89 |
TIO |
33 |
EDT |
27 |
||
AND |
54 |
FOR |
33 |
TIS |
25 |
||
THA |
47 |
NDE |
31 |
OFT |
23 |
||
ENT |
39 |
HAS |
28 |
STH |
21 |
||
ION |
36 |
NCE |
27 |
MEN |
20 |
From: Friedman: Elements of Cryptanalysis, pp. 23
Most frequent three-letter words:
the, and, for, are, but, not, you, all, any, can, had, her, was, one,
our, out, day, get, has, him, his, how, man, new, now, old, see, two,
way, who, boy, did, its, let, put, say, she, too, use
From: Smith: Cryptography, pp. 153
Most frequent four-letter words:
that, with, have, this, will, your, from, they, know, want, been, good,
much, some, time, very, when, come, here, just, like, long, make, many,
more, only, over, such, take, than, them, well, were
From: Smith: Cryptography, pp. 153-154
Frequency of Initial and Final Letters:
% |
Letters |
Initial |
% |
Letters |
Initial |
|
9.00% |
A |
9 |
17.00% |
T |
17 |
|
6.00% |
B |
6 |
10.00% |
O |
10 |
|
6.00% |
C |
6 |
9.00% |
A |
9 |
|
5.00% |
D |
5 |
7.00% |
W |
7 |
|
2.00% |
E |
2 |
6.00% |
B |
6 |
|
4.00% |
F |
4 |
6.00% |
C |
6 |
|
2.00% |
G |
2 |
5.00% |
D |
5 |
|
3.00% |
H |
3 |
5.00% |
S |
5 |
|
3.00% |
I |
3 |
4.00% |
F |
4 |
|
1.00% |
J |
1 |
4.00% |
M |
4 |
|
1.00% |
K |
1 |
4.00% |
R |
4 |
|
2.00% |
L |
2 |
3.00% |
H |
3 |
|
4.00% |
M |
4 |
3.00% |
I |
3 |
|
2.00% |
N |
2 |
3.00% |
Y |
3 |
|
10.00% |
O |
10 |
2.00% |
E |
2 |
|
2.00% |
P |
2 |
2.00% |
G |
2 |
|
0.00% |
Q |
2.00% |
L |
2 |
||
4.00% |
R |
4 |
2.00% |
N |
2 |
|
5.00% |
S |
5 |
2.00% |
P |
2 |
|
17.00% |
T |
17 |
2.00% |
U |
2 |
|
2.00% |
U |
2 |
1.00% |
J |
1 |
|
0.00% |
V |
1.00% |
K |
1 |
||
7.00% |
W |
7 |
0.00% |
Q |
||
0.00% |
X |
0.00% |
V |
|||
3.00% |
Y |
3 |
0.00% |
X |
||
0.00% |
Z |
0.00% |
Z |
% |
Letters |
Final |
% |
Letters |
Final |
|
1.00% |
A |
1 |
17.00% |
E |
17 |
|
0.00% |
B |
11.00% |
T |
11 |
||
0.00% |
C |
10.00% |
D |
10 |
||
10.00% |
D |
10 |
9.00% |
N |
9 |
|
17.00% |
E |
17 |
9.00% |
S |
9 |
|
6.00% |
F |
6 |
8.00% |
R |
8 |
|
4.00% |
G |
4 |
8.00% |
Y |
8 |
|
2.00% |
H |
2 |
6.00% |
F |
6 |
|
0.00% |
I |
6.00% |
L |
6 |
||
0.00% |
J |
4.00% |
G |
4 |
||
1.00% |
K |
1 |
4.00% |
O |
4 |
|
6.00% |
L |
6 |
2.00% |
H |
2 |
|
1.00% |
M |
1 |
1.00% |
A |
1 |
|
9.00% |
N |
9 |
1.00% |
K |
1 |
|
4.00% |
O |
4 |
1.00% |
M |
1 |
|
1.00% |
P |
1 |
1.00% |
P |
1 |
|
0.00% |
Q |
1.00% |
U |
1 |
||
8.00% |
R |
8 |
1.00% |
W |
1 |
|
9.00% |
S |
9 |
0.00% |
B |
||
11.00% |
T |
11 |
0.00% |
C |
||
1.00% |
U |
1 |
0.00% |
I |
||
0.00% |
V |
0.00% |
J |
|||
1.00% |
W |
1 |
0.00% |
Q |
||
0.00% |
X |
0.00% |
V |
|||
8.00% |
Y |
8 |
0.00% |
X |
||
0.00% |
Z |
0.00% |
Z |
From: Friedman: Elements of Cryptanalysis, pp. 23
Word Endings:
Two word endings appear regularly: TION, and ING, with
the plural versions also. Another common word ending is
NESS. Worth noting also is LLY.