Frequency Table for normal English text, based on a 10,000 letter sample:

7.78%

A

778

12.77%

E

1277

1.41%

B

141

8.55%

T

855

2.96%

C

296

8.07%

O

807

4.02%

D

402

7.78%

A

778

12.77%

E

1277

6.86%

N

686

1.97%

F

197

6.67%

I

667

1.74%

G

174

6.51%

R

651

5.95%

H

595

6.22%

S

622

6.67%

I

667

5.95%

H

595

0.51%

J

51

4.02%

D

402

0.74%

K

74

3.72%

L

372

3.72%

L

372

3.08%

U

308

2.88%

M

288

2.96%

C

296

6.86%

N

686

2.88%

M

288

8.07%

O

807

2.23%

P

223

2.23%

P

223

1.97%

F

197

0.08%

Q

8

1.96%

Y

196

6.51%

R

651

1.76%

W

176

6.22%

S

622

1.74%

G

174

8.55%

T

855

1.41%

B

141

3.08%

U

308

1.12%

V

112

1.12%

V

112

0.74%

K

74

1.76%

W

176

0.51%

J

51

0.27%

X

27

0.27%

X

27

1.96%

Y

196

0.17%

Z

17

0.17%

Z

17

0.08%

Q

8

From: Friedman: Elements of Cryptanalysis, pp. 4-5

 

 

 

 

Proportions of vowels and consonants to the total number of letters:

Vowels A E I O U Y …

40.33%

40.33%

High-frequency consonants H N R S T …

34.09%

Medium-frequency consonants

59.67%

D L C M P F W G B V …

23.81%

Low-frequency consonants J K Q X Z …

1.77%

Total:

100.00%

100.00%

From: Friedman: Elements of Cryptanalysis, pp. 4-5

 

 

Relative frequencies of the vowels:

A

19.50%

E

32.00%

I

16.70%

O

20.20%

U

8.00%

Y

3.60%

100.00%

From: Friedman: Elements of Cryptanalysis, pp. 23

 

 

One-letter words:

a, I, O.

From: Smith: Cryptography, pp. 153

 

 

Order of frequency of most common doubles:

SS EE TT FF LL MM OO

From: Smith: Cryptography, pp. 153

 

 

Most frequent digraphs:

TH

50

AT

25

ST

20

ER

40

EN

25

IO

18

ON

39

ES

25

LE

18

AN

38

OF

25

IS

17

RE

36

OR

25

OU

17

HE

33

NT

24

AR

16

IN

31

EA

22

AS

16

ED

30

TI

22

DE

16

ND

30

TO

22

RT

16

HA

26

IT

20

VE

16

From: Friedman: Elements of Cryptanalysis, pp. 22

 

 

Most frequent two-letter words:

of, to, in, it, is, be, as, at, so, we, he, by, or,

on, do, if, me, my, up, an, go, no, us, am

From: Smith: Cryptography, pp. 153

 

 

Most Common Reversals:

er re

es se

an na

ti it

on no

in ni

en ne

at ta

te et

or ro

to ot

ar ra

st ts

is si

ed de

of fo

 

 

Most frequent trigraphs:

THE

89

TIO

33

EDT

27

AND

54

FOR

33

TIS

25

THA

47

NDE

31

OFT

23

ENT

39

HAS

28

STH

21

ION

36

NCE

27

MEN

20

From: Friedman: Elements of Cryptanalysis, pp. 23

 

 

Most frequent three-letter words:

the, and, for, are, but, not, you, all, any, can, had, her, was, one,

our, out, day, get, has, him, his, how, man, new, now, old, see, two,

way, who, boy, did, its, let, put, say, she, too, use

From: Smith: Cryptography, pp. 153

 

 

Most frequent four-letter words:

that, with, have, this, will, your, from, they, know, want, been, good,

much, some, time, very, when, come, here, just, like, long, make, many,

more, only, over, such, take, than, them, well, were

From: Smith: Cryptography, pp. 153-154

 

 

Frequency of Initial and Final Letters:

%

Letters

Initial

%

Letters

Initial

9.00%

A

9

17.00%

T

17

6.00%

B

6

10.00%

O

10

6.00%

C

6

9.00%

A

9

5.00%

D

5

7.00%

W

7

2.00%

E

2

6.00%

B

6

4.00%

F

4

6.00%

C

6

2.00%

G

2

5.00%

D

5

3.00%

H

3

5.00%

S

5

3.00%

I

3

4.00%

F

4

1.00%

J

1

4.00%

M

4

1.00%

K

1

4.00%

R

4

2.00%

L

2

3.00%

H

3

4.00%

M

4

3.00%

I

3

2.00%

N

2

3.00%

Y

3

10.00%

O

10

2.00%

E

2

2.00%

P

2

2.00%

G

2

0.00%

Q

2.00%

L

2

4.00%

R

4

2.00%

N

2

5.00%

S

5

2.00%

P

2

17.00%

T

17

2.00%

U

2

2.00%

U

2

1.00%

J

1

0.00%

V

1.00%

K

1

7.00%

W

7

0.00%

Q

0.00%

X

0.00%

V

3.00%

Y

3

0.00%

X

0.00%

Z

0.00%

Z

%

Letters

Final

%

Letters

Final

1.00%

A

1

17.00%

E

17

0.00%

B

11.00%

T

11

0.00%

C

10.00%

D

10

10.00%

D

10

9.00%

N

9

17.00%

E

17

9.00%

S

9

6.00%

F

6

8.00%

R

8

4.00%

G

4

8.00%

Y

8

2.00%

H

2

6.00%

F

6

0.00%

I

6.00%

L

6

0.00%

J

4.00%

G

4

1.00%

K

1

4.00%

O

4

6.00%

L

6

2.00%

H

2

1.00%

M

1

1.00%

A

1

9.00%

N

9

1.00%

K

1

4.00%

O

4

1.00%

M

1

1.00%

P

1

1.00%

P

1

0.00%

Q

1.00%

U

1

8.00%

R

8

1.00%

W

1

9.00%

S

9

0.00%

B

11.00%

T

11

0.00%

C

1.00%

U

1

0.00%

I

0.00%

V

0.00%

J

1.00%

W

1

0.00%

Q

0.00%

X

0.00%

V

8.00%

Y

8

0.00%

X

0.00%

Z

0.00%

Z

From: Friedman: Elements of Cryptanalysis, pp. 23

 

 

Word Endings:

Two word endings appear regularly: TION, and ING, with

the plural versions also. Another common word ending is

NESS. Worth noting also is LLY.

 

 

 


Go Back