Encryption and Decryption in Digital Communications

By Bernard Sklar and Fredric J. Harris
Dec 24, 2020

📄 Contents

␡

17.1 Models, Goals, and Early Cipher Systems
17.2 The Secrecy of a Cipher System
17.3 Practical Security
17.4 Stream Encryption
17.5 Public Key Cryptosystems
17.6 Pretty Good Privacy
17.7 Conclusion
References
Problems
Questions

⎙ Print

< Back Page 2 of 10 Next >

This chapter is from the book 

Digital Communications: Fundamentals and Applications, 3rd Edition

Learn More Buy

17.2 The Secrecy of a Cipher System

17.2.1 Perfect Secrecy

Consider a cipher system with a finite message space {M} = M₀, M₁, . . . , M_{N - 1} and a finite ciphertext space {C} = C₀, C₁,... , C_{U – 1}. For any M_i, the a priori probability that M_i is transmitted is P(M_i). Given that C_j is received, the a posteriori probability that M_i was transmitted is P(M_i |C_j). A cipher system is said to have perfect secrecy if for every message M_i and every ciphertext C_j, the a posteriori probability is equal to the a priori probability:

Thus, for a system with perfect secrecy, a cryptanalyst who intercepts C_j obtains no further information to enable him or her to determine which message was transmitted. A necessary and sufficient condition for perfect secrecy is that for every M_i and C_j,

The schematic in Figure 17.4 illustrates an example of perfect secrecy. In this example, {M} = M₀, M₁, M₂, M₃, {C} = C₀, C₁, C₂, C₃, {K} = K₀, K₁, K₂, K₃, N = U = 4,

Figure 17.4 Example of perfect secrecy.

and P(M_i) = P(C_j) = . The transformation from message to ciphertext is obtained by

where T_{K_j} indicates a transformation under the key, K_j, and x modulo-y is defined as the remainder of dividing x by y. Thus s = 0, 1, 2, 3. A cryptanalyst intercepting one of the ciphertext messages C_s = C₀, C₁, C₂, or C₃ would have no way of determining which of the four keys was used, and therefore whether the correct message is M₀, M₁, M₂, or M₃. A cipher system in which the number of messages, the number of keys, and the number of ciphertext transformations are all equal is said to have perfect secrecy if and only if the following two conditions are met:

There is only one key transforming each message to each ciphertext.
All keys are equally likely.

If these conditions are not met, there would be some message M_i such that for a given C_j, there is no key that can decipher C_j into M_i, implying that P(M_i |C_j) = 0 for some i and j. The cryptanalyst could then eliminate certain plaintext messages from consideration, thereby simplifying the task. Perfect secrecy is a very desirable objective since it means that the cipher system is unconditionally secure. It should be apparent, however, that for systems which transmit a large number of messages, the amount of key that must be distributed for perfect secrecy can result in formidable management problems, making such systems impractical. Since in a system with perfect secrecy, the number of different keys is at least as great as the number of possible messages, if we allow messages of unlimited length, perfect secrecy requires an infinite amount of key.

EXAMPLE 17.1 BREAKING A CIPHER SYSTEM WHEN THE KEY SPACE IS SMALLER THAN THE MESSAGE SPACE

Consider that the 29-character ciphertext

G R O B O K B O D R O R O B Y O C Y P I O C D O B I O K B

was produced by a Caesar cipher (see Section 17.1.4) such that each letter has been shifted by K positions, where 1 ≤ K ≤ 25. Show how a cryptanalyst can break this code.

Solution

Because the number of possible keys (there are 25) is smaller than the number of possible 29-character meaningful messages (there are a myriad), perfect secrecy cannot be achieved. In the original polyalphabetic cipher of Figure 17.3, a plaintext character is replaced by a letter of increasingly higher rank as the row number (K) increases. Hence, in analyzing the ciphertext, we reverse the process by creating rows such that each ciphertext letter is replaced by letters of decreasing rank. The cipher is easily broken by trying all the keys, from 1 to 25, as shown in Figure 17.5, yielding only one key (K = 10) that produces the meaningful message: WHERE ARE THE HEROES OF YESTERYEAR (The spaces have been added.)

Figure 17.5 Example of breaking a cipher system when the key space is smaller than the message space.

EXAMPLE 17.2 PERFECT SECRECY

We can modify the key space of Example 17.1 to create a cipher having perfect secrecy. In this new cipher system each character in the message is encrypted using a randomly selected key value. The key, K, is now given by the sequence k₁, k₂, . . . , k₂₉, where each k_i is a random integer in the range (1, 25) dictating the shift used for the ith character; thus there are a total of (25)²⁹ different key sequences. Then the 29-character ciphertext in Example 17.1 could correspond to any meaningful 29-character message. For example, the ciphertext could correspond to the plaintext (the spaces have been added)

ENGLISH AND FRENCH ARE SPOKEN HERE

derived by the key 2, 4, 8, 16, 6, 18, 20,.... Most of the 29-character possibilities can be ruled out because they are not meaningful messages (this much is known without the ciphertext). Perfect secrecy is achieved because interception of the ciphertext in this system reveals no additional information about the plaintext message.

17.2.2 Entropy and Equivocation

As discussed in Chapter 9, the amount of information in a message is related to the probability of occurrence of the message. Messages with probability of either 0 or 1 contain no information, since we can be very confident concerning our prediction of their occurrence. The more uncertainty there is in predicting the occurrence of a message, the greater is the information content. Hence when each of the messages in a set is equally likely, we can have no confidence in our ability to predict the occurrence of a particular message, and the uncertainty or information content of the message is maximum.

Entropy, H(X), is defined as the average amount of information per message. It can be considered a measure of how much choice is involved in the selection of a message X. It is expressed by the following summation over all possible messages:

When the logarithm is taken to the base 2, as shown, H(X) is the expected number of bits in an optimally encoded message X. This is not quite the measure that a cryptanalyst desires. He will have intercepted some ciphertext and will want to know how confidently he can predict a message (or key) given that this particular ciphertext was sent. Equivocation, defined as the conditional entropy of X given Y, is a more useful measure for the cryptanalyst in attempting to break the cipher and is given by

Equivocation can be thought of as the uncertainty that message X was sent, having received Y. The cryptanalyst would like H(X Y) to approach zero as the amount of intercepted ciphertext, Y, increases.

EXAMPLE 17.3 ENTROPY AND EQUIVOCATION

Consider a sample message set consisting of eight equally likely messages {X} = X₁, X₂,... , X8.

(a) Find the entropy associated with a message from the set {X}.
(b) Given another equally likely message set {Y} = Y₁, Y₂. Consider that the occur-rence of each message Y narrows the possible choices of X in the following way:
- If Y₁ is present: only X₁, X₂, X₃, or X₄ is possible
- If Y₂ is present: only X₅, X₆, X₇, or X₈ is possible

Find the equivocation of message X conditioned on message Y.

Solution

(a)
(b) for four of the X’s and P(X|Y) = 0 for the remaining four X’s. Using Equation (17.6), we obtain

We see that knowledge of Y has reduced the uncertainty of X from 3 bits/message to 2 bits/message.

17.2.3 Rate of a Language and Redundancy

The true rate of a language is defined as the average number of information bits contained in each character and is expressed for messages of length N by

where H(X) is the message entropy, or the number of bits in the optimally encoded message. For large N, estimates of r for written English range between 1.0 and 1.5 bits/character [4]. The absolute rate or maximum entropy of a language is defined as the maximum number of information bits contained in each character assuming that all possible sequences of characters are equally likely. The absolute rate is given by

where L is the number of characters in the language. For the English alphabet r′ = log₂ 26 = 4.7 bits/character. The true rate of English is, of course, much less than its absolute rate since, like most languages, English is highly redundant and structured.

The redundancy of a language is defined in terms of its true rate and absolute rate as

For the English language with r′ = 4.7 bits/character and r = 1.5 bits/character, D = 3.2, and the ratio D/r′ = 0.68 is a measure of the redundancy in the language.

17.2.4 Unicity Distance and Ideal Secrecy

We stated earlier that perfect secrecy requires an infinite amount of key if we allow messages of unlimited length. With a finite key size, the equivocation of the key H(K|C) generally approaches zero, implying that the key can be uniquely determined and the cipher system can be broken. The unicity distance is defined as the smallest amount of ciphertext, N, such that the key equivocation H(K|C) is close to zero. Therefore, the unicity distance is the amount of ciphertext needed to uniquely determine the key and thus break the cipher system. Shannon [5] described an ideal secrecy system as one in which H(K|C) does not approach zero as the amount of ciphertext approaches infinity; that is, no matter how much ciphertext is intercepted, the key cannot be determined. The term “ideal secrecy” describes a system that does not achieve perfect secrecy but is nonetheless unbreakable (unconditionally secure) because it does not reveal enough information to determine the key.

Most cipher systems are too complex to determine the probabilities required to derive the unicity distance. However, it is sometimes possible to approximate unicity distance, as shown by Shannon [5] and Hellman [6]. Following Hellman, assume that each plaintext and ciphertext message comes from a finite alphabet of L symbols.

Thus there are 2^r′ N possible messages of length, N, where r′ is the absolute rate of the language. We can consider the total message space partitioned into two classes, meaningful messages, M₁, and meaningless messages M₂. We then have

where r is the true rate of the language, and where the a priori probabilities of the message classes are

Let us assume that there are 2^H(K) possible keys (size of the key alphabet), where H(K) is the entropy of the key (number of bits in the key). Assume that all keys are equally likely; that is,

The derivation of the unicity distance is based on a random cipher model, which states that for each key K and ciphertext C, the decryption operation D_K(C) yields an independent random variable distributed over all the possible 2^r′N messages (both meaningful and meaningless). Therefore, for a given K and C, the D_K(C) operation can produce any one of the plaintext messages with equal probability.

Given an encryption described by C_i = E_{K_i}(M_i), a false solution F arises whenever encryption under another key K_j could also produce C_i either from the message M_i or from some other message M_j; that is,

A cryptanalyst intercepting C_i would not be able to pick the correct key and hence could not break the cipher system. We are not concerned with the decryption operations that produce meaningless messages because these are easily rejected.

For every correct solution to a particular ciphertext there are 2^H(K) - 1 incorrect keys, each of which has the same probability P(F) of yielding a false solution. Because each meaningful plaintext message is assumed equally likely, the probability of a false solution, is the same as the probability of getting a meaningful message, namely,

where D = r′ - r is the redundancy of the language. The expected number of false solutions is then

Because of the rapid decrease of with increasing N,

is defined as the point where the number of false solutions is sufficiently small so that the cipher can be broken. The resulting unicity distance is therefore

We can see from Equation (17.17) that if H(K) is much larger than DN, there will be a large number of meaningful decryptions, and thus a small likelihood of a cryptanalyst distinguishing which meaningful message is the correct message. In a loose sense, DN represents the number of equations available for solving for the key, and H(K) the number of unknowns. When the number of equations is smaller than the number of unknown key bits, a unique solution is not possible and the system is said to be unbreakable. When the number of equations is larger than the number of unknowns, a unique solution is possible and the system can no longer be characterized as unbreakable (although it may still be computationally secure).

It is the predominance of meaningless decryptions that enables cryptograms to be broken. Equation (17.19) indicates the value of using data compression techniques prior to encryption. Data compression removes redundancy, thereby increasing the unicity distance. Perfect data compression would result in D = 0 and N = ∞ for any key size.

EXAMPLE 17.4 UNICITY DISTANCE

Calculate the unicity distance for a written English encryption system, where the key is given by the sequence k₁, k₂, . . . , k₂₉, where each k_i is a random integer in the range (1, 25) dictating the shift number (Figure 17.3) for the ith character. Assume that each of the possible key sequences is equally likely.

Solution

There are (25)²⁹ possible key sequences, each of which is equally likely. Therefore, using Equations (17.5), (17.8), and (17.19) we have

Key entropy: H (K) = log₂ (25)²⁹ = 135 bits

Absolute rate for English: r′ = log₂ 26 = 4.7 bits/character

Assumed true rate for English: r = 1.5 bits/character

Redundancy: D = r′ r = 3.2 bits/character

In Example 17.2, perfect secrecy was illustrated using the same type of key sequence described here, with a 29-character message. In this example we see that if the available ciphertext is 43 characters long (which implies that some portion of the key sequence must be used twice), a unique solution may be possible. However, there is no indication as to the computational difficulty in finding the solution. Even though we have estimated the theoretical amount of ciphertext required to break the cipher, it might be computationally infeasible to accomplish this.

< Back Page 2 of 10 Next >

🔖 Save To Your Account

InformIT Promotional Mailings & Special Offers

I would like to receive exclusive offers and hear about products from InformIT and its family of brands. I can unsubscribe at any time.

Email Address