Advances in Cryptology - CRYPTO '98, 18 conf

Lecture Notes in Computer Science Edited by G. Goos, J. Hartmanis and J. van Leeuwen 1462 Hugo Krawczyk (Ed.) Advanc...

Author: Hugo Krawczyk

29 downloads 1248 Views 8MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
Report copyright / DMCA form

DOWNLOAD PDF

Lecture Notes in Computer Science Edited by G. Goos, J. Hartmanis and J. van Leeuwen

1462

Hugo Krawczyk (Ed.)

Advances in Cryptology CRYPTO ' 98 18th Annual International Cryptology Conference Santa Barbara, California, USA August 23-27, 1998 Proceedings

~ Springer

Series Editors Gerhard Goos, Karlsruhe University, Germany Juris Hartmanis, Cornell University, NY, USA Jan van Leeuwen, Utrecht University, The Netherlands

Volume Editor Hugo Krawczyk Department of Electrical Engineering Technion Haifa 32000, Israel E-mail: [email protected] Cataloging-in-Publication data applied for

Die Deutsche Bibliothek - CIP-Einheitsaufnahme Advances in cryptology : proceedings / Crypto '98, 18th Annual International Cryptology Conference, Santa Barbara, California, USA, August 23 - 27, 1998. Hugo Krawczyk (ed.). [IACR]. - Berlin ; Heidelberg ; New York ; Barcelona ; Budapest ; Hong Kong ; London ; Milan ; Paris ; Singapore ; Tokyo : Springer, 1998 (Lecture notes in computer science ; Vol. 1462) ISBN 3-540-64892-5

CR Subject Classification (1991): E.3, G.2.1, D.4.6, K.6.5, F.2.1-2, C.2, J.1 ISSN 0302-9743 ISBN 3-540-64892-5 Springer-Verlag Berlin Heidelberg New York This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting, reproduction on microfilms or in any other way, and storage in data banks. Duplication of this publication or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965, in its current version, and permission for use must always be obtained from Springer-Verlag. Violations are liable for prosecution under the German Copyright Law. 9 Springer-Verlag Berlin Heidelberg 1998 Printed in Germany Typesetting: Camera-ready by author SPIN 10638300 06/3142 - 5 4 3 2 1 0

Printed on acid-free paper

Preface

Crypto '98, the Eighteenth Annual Crypto Conference, is sponsored by the International Association for Cryptologic Research (IACR), in cooperation with the IEEE Computer Society Technical Committee on Security and Privacy and the Computer Science Department, University of California, Santa Barbara (UCSB). The General Chair, Andrew Klapper, is responsible for local organization and registration. The Program Committee considered 144 papers and selected 33 for presentation. This year's conference program also includes two invited lectures. Michael Rabin will deliver an IACR Distinguished Lecture on the subject of "Authentication". The tradition of IACR Distinguished Lectures at Crypto and Eurocrypt conferences was initiated a few years ago and it honors scientists who have made outstanding contributions to the field of cryptography. Michael Rabin is one of the most prominent pioneers of modern cryptography with many brilliant contributions to the fundamental aspects of this science. The second invited lecture, titled "Cryptography and the Internet", will be delivered by Steve Bellovin. I believe that Bellovin's talk stresses an important point, namely, the need for the active participation of the crypto community in the challenging task of transferring cryptographic science into real-world applications and implementations. In addition to these two invited lectures, Miles Smid from the US National Institute of Standards and Technology (NIST) will present a first report on the Advanced Encryption Standard (AES) Conference, which takes place shortly before Crypto'98. The AES Conference's goal is to present candidate encryption algorithms from which a new US standard for symmetric encryption is to be produced. Finally, we will have the traditional Rump Session for informal short presentations of new results. Stuart Haber kindly agreed to run this session. These proceedings include the revised versions of the 33 papers accepted by the Program Committee. These papers were selected from all the submissions to the conference on the basis of perceived originality, quality and relevance to the field of cryptography. Revisions were not checked as to their contents. The authors bear full responsibility for the contents of their papers. The selection of papers is a difficult and challenging task. I am very grateful to the Program Committee members who did an excellent job in reviewing the submissions in spite of the severe time constraints imposed by the Program Committee's work schedule. Each submission was refereed by at least three reviewers. In total, close to 600 reports were provided by the reviewers - about 18 000 lines of text in total! The Program Committee was assisted by a large number of colleagues who reviewed submissions in their areas of expertise. External reviewers included: W. Aiello, A. Antipa, S. Arita, B. Baum-Waidner, D. Beaver, A. Beimel, M. Bellare, J. Benaloh, C. Bennett, C. Berg, J. Black, S. BlakeWilson, D. Bleichenbacher, G. Bleumer, T. Boogaerts, C. Cachin, J. Ca.menisch, R. Canetti, B. Chor, S. Contini, R. Cramer, C. Crepeau, G. Di Crescenzo,

yl J-F. Dhem, U. Feige, M. Fitzi, R. Gallant, J. A. Garay, P. Gemmell, R. Gennaro, J. Giesen, N. Gilboa, O. Goldreich, S. Haber, S. Halevi, T. Helleseth, M. Hirt, R. Impagliazzo, Y. Ishai, G. Itkis, M. Jakobsson, C. Jutla, J. Kilian, F. Koeune, R. Kohlas, T. Krovetz, E. Kushilevtiz, X. Lai, R. Lambert, P. Landrock, A. Lauder, A. Lenstra, P. MacKenzie, D. Malkhi, H. Massias, W. Meier, M. Michels, V. Miller, M. Naor, M. N~islund, K. Nissim, K. Nyberg, H. Peterson, E. Petrank, B. Pinkas, B. Preneel, C. Rackoff, S. Rajagopalan, O. Reingold, P. Rohatgi, A. Rosen, K. Sakurai, P. Shor, R. Sidney, T. Spies, M. Stadler, D. Stinson, Y. Tsiounis, Y. Tsunoo, D. Tygar, S. Ulfberg, R. Venkatesan, M. Waidner, S. Wolf, R. Wright, Y. Yacobi, Y. Yin, A. Young, and O. Ytrehus. My thanks go to all these reviewers and I apologize for any inadvertent omissions. I also wish to thank the committee's two advisory members, Butt Kaliski and Mike Wiener, the program chairs for Crypto '97 and '98, for their advice, help, and support. Crypto '98 is the first IACR conference with both electronic submissions and an electronic version of the proceedings. The electronic submission option was a clear choice for most authors, with 90% of the papers submitted this way. All credit and thanks for the setup and smooth operation of this process go to Joe Kilian who volunteered to run this first electronic experience for Crypto. To this end, Joe adapted the electronic submission software developed by ACM's SIGACT group. I thank the ACM for allowing the use of their system. The electronic version of these proceedings will be published by Springer and will be available under h t t p : / / l i n k , s p r i n g e r , d e / s e r i e s / l n c s / In organizing the scientific program of the conference and putting together these proceedings I have been assisted by many people in addition to those mentioned above. I would like to especially thank the following people: Tal Rabin for providing me with essential help and support in many of the organizational aspects; Andrew Klapper, the General Chair of the conference, for freeing me from all the issues not directly related to the scientific program and proceedings; Gitta Abraham for secretarial help; Robert Schapire for providing excellent software for automating many of the chores of running a conference program committee; Kevin McCurley for his help with the electronic submissions procedure; Don Coppersmith for much timely help and support. Finally, I wish to thank the authors of all submissions for making this conference possible, and the authors of accepted papers for their work and cooperation in the production of these proceedings.

June 1998

Hugo Krawczyk Program Chair Crypto '98

CRYPTO

'98

August 23-27, 1998, Santa Barbara, California, USA Sponsored by the

International Association for Cryptologic Research (IACR) in cooperation with

IEEE Computer Society Technical Committee on Security and Privacy Computer Science Department, University of California, Santa Barbara General Chair Andrew Klapper, University of Kentucky, USA

Program Chair Hugo Krawczyk, Technion, Israel and IBM Research, USA

Program Committee Dan Boneh . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Stanford University, USA Don Coppersmith . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . IBM Research, USA Yair Frankel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . CertCo, USA Matt Franklin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A T & T Labs-Research, USA Johan Hs .......................... Royal Institute of Technology, Sweden Lars Knudsen . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . University of Bergen, Norway Ueli Maurer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . E T H Zurich, Switzerland Alfred Menezes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Waterloo University, Canada Andrew Odlyzko . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A T & T Labs-Research, USA Rafail Ostrovsky . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Bellcore, USA Jean-Jacques Quisquater . . . . . . . . . . . . . . . . . . . . . . Universit~ de Louvain, Belgium Tal Rabin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . IBM Research, USA Matt Robshaw . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . RSA Laboratories, USA Phillip Rogaway . . . . . . . . . . . . . . . . . . . . . . . University of California at Davis, USA Rainer Rueppel . . . . . . . . . . . . . . . . . . . . . R 3 Security Engineering AG, Switzerland Kazue Sako . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . NEC, Japan Dan Simon . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Microsoft Research, USA Moti Yung . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . CertCo, USA

Advisory members B u t t Kaliski (Crypto'97 program chair) . . . . . . . . . . . . . . . . RSA Laboratories, USA Michael J. Wiener (Crypto'99 program chair) . . . . . Entrust Technologies, Canada

Table of Contents

Chosen-Ciphertext Security Chosen Ciphertext Attacks Against Protocols Based on the RSA Encryption Standard PKCS # 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

Daniel Bleichenbacher A Practical Public Key Cryptosystem Provably Secure Against Adaptive Chosen Ciphertext Attack . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

13

Ronald Cramer, Victor Shoup Relations Among Notions of Security for Public-Key Encryption Schemes . 26

Mihir Bellare, Anand Desai, David Pointcheval, PhiUip Rogaway

Invited Lecture Cryptography and the Internet . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

46

Steven M. Bellovin

Cryptanalysis of Hash Functions and Block Ciphers Differential Collisions in SHA-0 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

56

Florent Chabaud, Antoine Joux From Differential Cryptanalysis to Ciphertext-Only Attacks . . . . . . . . . . . . .

72

Alex Biryukov, Eyal Kushilevitz

Distributed Cryptography A Simplified Approach to Threshold and Proactive RSA . . . . . . . . . . . . . . . .

89

Tal Rabin New Efficient and Secure Protocols for Verifiable Signature Sharing and Other Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

105

Dario Catalano, Rosario Gennaro Trading Correctness for Privacy in Unconditional Multi-party Computation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

121

Matthias Fitzi, Martin Hirt, Ueli Maurer

Identification and Certification Fast Digital Identity Revocation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

William Aiello, Sachin Lodha, Rafail Ostrovsky

137

Self-Delegation with Controlled P r o p a g a t i o n - or - W h a t If You Lose Your Laptop . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

153

Oded Goldreich, Birgit Pfitzmann, Ronald L. Rivest Identity Escrow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

169

Joe Kilian, Erez Petrank

Block Cipher Design and Analysis Generalized Birthday Attacks on Unbalanced Feistel Networks . . . . . . . . . . .

186

Charanjit S. Jutla Quadratic Relation of S-box and Its Application to the Linear Attack of Full Round DES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

200

Takeshi Shimoyama, Toshinobu Kaneko Cryptanalysis of Block Ciphers with Probabilistic Non-linear Relations of Low Degree . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

212

Thomas Jakobsen

Algebraic Cryptanalysis Cryptanalysis of the Ajtai-Dwork C r y p t o s y s t e m . . . . . . . . . . . . . . . . . . . . . . .

223

Phon9 Nguyen, Jacques Stern Cryptanalysis of the Chor-Rivest C r y p t o s y s t e m . . . . . . . . . . . . . . . . . . . . . . . .

243

Serge Vaudenay Cryptanalysis of the Oil & Vinegar Signature Scheme . . . . . . . . . . . . . . . . . . .

257

Aviad Kipnis, Adi Shamir

Relations Among Cryptographic Primitives From Unpredictability to Indistinguishability: A Simple Construction of P s e u d o - R a n d o m Functions from MACs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

267

Moni Naor, Omer Reingold Many-to-One T r a p d o o r Functions and their Relation to Public-Key Cryptosystems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

283

Mihir Bellare, Shai Halevi, Amit Sahai, Salil Vadhan

I A C R Distinguished Lecture Authentication, Enhanced Security and Error Correcting Codes . . . . . . . . . .

299

Yonatan Aumann, Michael O. Rabin

Algebraic Schemes An Efficient Discrete Log Pseudo R a n d o m Generator . . . . . . . . . . . . . . . . . . .

Sarvar Patel, Ganapathy S. Sundaram

304

• Fast RSA-type Cryptosystem Modulo pkq . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Tsuyoshi Takagi An Elliptic Curve Implementation of the Finite Field Digital Signature Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Neal Koblitz Quantum

327

Cryptography

Quantum Bit Commitment from a Physical Assumption . . . . . . . . . . . . . . . . Louis Salvail Signatures,

318

Random

Functions

338

and Ideal Ciphers

On Concrete Security Treatment of Signatures Derived from Identification 354 Kazuo Ohta, Tatsuaki Okamoto Building PRFs from PRPs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Chris Hall, David Wagner, John Kelsey, Bruce Schneier Security Amplification by Composition: The Case of Doubly-Iterated, Ideal Ciphers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . William Aiello, Mihir BeUare, Giovanni Di Crescenzo, Ramarathnam Venkatesan

370

390

Zero-Knowledge On the Existence of 3-Round Zero-Knowledge Protocols . . . . . . . . . . . . . . . . Satoshi Hada, Toshiaki Tanaka

408

Zero-Knowledge Proofs for Finite Field Arithmetic, or: Can Zero-Knowledge Be for Free? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 424 Ronald Cramer, Ivan Damg~rd Concurrent Zero-Knowledge: Reducing the Need for Timing Constraints .. 442 Cynthia Dwork, Amit Sahai Implementation The Solution of McCurley's Discrete Log Challenge . . . . . . . . . . . . . . . . . . . . Damian Weber, Thomas Denny

458

Optimal Extension Fields for Fast Arithmetic in Public-Key Algorithms .. 472 Daniel V. Bailey, Christof Paar Rights

Protection

Time-Stamping with Binary Linking Schemes . . . . . . . . . . . . . . . . . . . . . . . . . . Ahto Buldas, Peeter Laud, Helger Lipmaa, Jan Villemson

486

• Threshold Traitor Tracing ...........................................

502

Moni Naor, Benny Pinkas Author Index

.................................................

519

Chosen Ciphertext Attacks Against Protocols Based on the RSA Encryption Standard PKCS #1 Daniel Bleichenbacher Bell Laboratories 700 Mountain Ave., Murray Hill, NJ 07974 [email protected]

Abstract. This paper introduces a new adaptive chosen ciphertext attack against certain protocols based on RSA. We show that an RSA private-key operation can be performed if the attacker has access to an oracle that, for any chosen ciphertext, returns only one bit telling whether the ciphertext corresponds to some unknown block of data encrypted using PKCS #1. An example of a protocol susceptible to our attack is SSL V.3.0. Keywords: chosen ciphertext attack, RSA, PKCS, SSL

1

Overview

In this paper, we analyze the following situation. Let n, e be an RSA public key, and let d be the corresponding secret key. Assume that an attacker has access to an oracle that, for any chosen ciphertext c, indicates whether the corresponding plaintext cd mod n has the correct format according to the RSA encryption standard PKCS #1. We show how to use this oracle to decrypt or sign a message. The attacker carefully prepares ciphertexts that are sent to the oracle. Combining the returns from the oracle, the attacker gradually gains information on cd . The chosen ciphertexts are based on previous outcomes of the oracle. Thus, this technique is an example of an adaptive chosen-ciphertext attack. Usually, a chosen ciphertext attack is based on the theoretical assumption that the attacker has access to a decryption device that returns the complete decryption for a chosen ciphertext. Hence, if a public-key cryptosystem is susceptible to a chosen-ciphertext attack, that often is considered to be only a theoretical weakness. However, the attack shown in this paper is practical, because it is easy to get the necessary information corresponding to the oracle reply. The attack can be carried out if, for example, the attacker has access to a server that accepts encrypted messages and returns an error message depending on whether the decrypted message is PKCS conforming. This paper is organized as follows. We describe the RSA encryption standard PKCS #1 in Section 2. In Section 3, we describe and analyze our chosenciphertext attack. Different situations in which this attack can be carried out H. Krawczyk (Ed.): CRYPTO’98, LNCS 1462, pp. 1–12, 1998. c Springer-Verlag Berlin Heidelberg 1998

2

Daniel Bleichenbacher

are listed in Section 4. We then analyze the vulnerability of SSL to our attack in Section 5. In Section 6, we report experiments with the technique. In Section 7, we conclude by offering recommendations.

2

PKCS #1

In this section, we describe briefly the RSA encryption standard PKCS #1; refer to [11] for details. Currently, there are three block formats: Block types 0 and 1 are reserved for digital signatures, and block type 2 is used for encryption. We describe only block type 2, because it is relevant for this paper.

00

02

padding string

00

data block

Fig. 1. PKCS #1 block format for encryption. The first two bytes in this format are constant. The length of the padding block can vary.

Let n, e be an RSA public key, and let p, q, d be the corresponding secret key (i.e, n = pq and d ≡ e−1 (mod ϕ(n))). Moreover, let k be the byte length of n. Hence, we have 28(k−1) ≤ n < 28k . A data block D, consisting of |D| bytes, is encrypted as follows. First, a padding string P S, consisting of k −3−|D| nonzero bytes, is generated pseudo-randomly. Here, |D| must not exceed k − 11; in other words, the byte length of P S is a least 8. Now, the encryption block EB = 00||02||P S||00||D is formed (Figure 1), is converted into an integer x, and is encrypted with RSA, giving the ciphertext c ≡ xe (mod n). The representation of the ciphertext is not important for this paper. We are, however, interested in how the receiver parses a ciphertext. First, he gets an integer x0 by decrypting the ciphertext with his private key. Then, he converts x0 into an encryption block EB0 . Now he looks for the first zero byte, which indicates the ending of the padding string PS and the start of the data block D. The following definition specifies when this parsing process is successful. Definition 1. An encryption block EB consisting of k bytes – that is, EB = EB1 ||...||EBk is called PKCS conforming – if it satisfies the requirements of block type 2 in PKCS #1. In particular, EB must satisfy the following conditions: – – – –

EB1 = 00. EB2 = 02. EB3 through EB10 are nonzero. At least one of the bytes EB11 through EBk is 00.

Chosen Ciphertext Attacks Against Protocols

3

We also call a ciphertext c PKCS conforming if its decryption is PKCS conforming. Note that the definition of conforming does not include possible integrity checks. We show in Section 3 that it should not be possible for an attacker to decide whether a chosen ciphertext is PKCS conforming. It is sometimes possible for an attacker to do so even if the data block contains further integrity checks.

3

Chosen-Ciphertext Attacks

In a chosen-ciphertext attack, the attacker selects the ciphertext, sends it to the victim, and is given in return the corresponding plaintext or some part thereof. A chosen-plaintext attack is called adaptive if the attacker can chose the ciphertexts depending on previous outcomes of the attack. It is well known that plain RSA is susceptible to a chosen-ciphertext attack [5]. An attacker who wishes to find the decryption m ≡ cd (mod n) of a ciphertext c can chose a random integer s and ask for the decryption of the innocent-looking message c0 ≡ se c mod n. From the answer m0 ≡ (c0 )d , it is easy to recover the original message, because m ≡ m0 s−1 (mod n). Another well-known result is that the least significant bit of RSA encryption is as secure as the whole message [8] (see also [1]). In particular, there exists an algorithm that can decrypt a ciphertext if there exists another algorithm that can predict the least significant bit of a message given only the corresponding ciphertext and the public key. H˚ astad and N¨ aslund recently extended this result to show that all individual RSA bits are secure [9]. Hence, it is not necessary for an attacker to learn the complete decrypted message in a chosen-ciphertext attack: Single bits per chosen ciphertext may be sufficient. The result reported in this paper is similar. We assume that the attacker has access to an oracle that, for every ciphertext, returns whether the corresponding plaintext is PKCS conforming. We show that we can use this oracle to compute cd (mod n) for any chosen integer c. Theoretically, we can use H˚ astad’s and N¨ aslund’s algorithm [9] to find c. In this paper, we describe a different algorithm that has as its goal to minimize the number of chosen ciphertexts; thus, we show the practicality of the attack. That is, we are not trying to generalize the attack; rather, we would like to take advantage of specific properties of PKCS #1. In particular, the algorithm relies on the facts that the first two bytes of the PKCS #1 format are constant, and that we know these two bytes with certainty when a ciphertext is accepted. Also, we use heuristic arguments in our the analysis of the algorithm to approximate the number of expected chosen ciphertexts, rather than finding an upper bound. 3.1

Description of the Attack

First, we give a short overview over the attack; then, we describe the attack in detail.

4


Assume that the attacker wants to find m ≡ cd (mod n), where c is an arbitrary integer. Basically, the attacker chooses integers s, computes c0 ≡ cse

(mod n),

and sends c0 to the oracle. If the oracle says that c0 is PKCS conforming, then the attacker knows that the first two bytes of ms are 00 and 02. For convenience, let B = 28(k−2) . Recall that k is the length of n in bytes. Hence, that ms is PKCS conforming implies that 2B ≤ ms mod n < 3B. By collecting several such pieces of information, we can eventually derive m. Typically, 220 chosen ciphertexts will be sufficient, but this number varies widely depending on numerous implementation details. The attack can be divided into three phases. In the first phase, the message is blinded, giving a ciphertext c0 that corresponds to an unknown message m0 . In the second phase, the attacker tries to find small values si for which the ciphertext c0 (si )e mod n is PKCS conforming. For each successful value for si , the attacker computes, using previous knowledge about m0 , a set of intervals that must contain m0 . We elaborate this process later. The third phase starts when only one interval remains. Then, the attacker has sufficient information about m0 to choose si such that c0 (si )e mod n is much more likely to be PKCS conforming than is a randomly chosen message. The size of si is increased gradually, narrowing the possible range of m0 until only one possible value remains. Now we describe this attack in detail. The variable Mi will always be a set of (closed) intervals that is computed after a successful si has been found, such that m0 is contained in one of the intervals of Mi . Step 1: Blinding. Given an integer c, choose different random integers s0 ; then check, by accessing the oracle, whether c(s0 )e mod n is PKCS conforming. For the first successful value s0 , set c0 ← c(s0 )e mod n M0 ← {[2B, 3B − 1]} i ← 1. Step 2: Searching for PKCS conforming messages. Step 2.a: Starting the search. If i = 1, then search for the smallest positive integer s1 ≥ n/(3B), such that the ciphertext c0 (s1 )e mod n is PKCS conforming. Step 2.b: Searching with more than one interval left. Otherwise, if i > 1 and the number of intervals in Mi−1 is at least 2, then search for the smallest integer si > si−1 , such that the ciphertext c0 (si )e mod n is PKCS conforming.


5

Step 2.c: Searching with one interval left. Otherwise, if Mi−1 contains exactly one interval (i.e., Mi−1 = {[a, b]}), then choose small integer values ri , si such that bsi−1 − 2B (1) ri ≥ 2 n and

2B + ri n 3B + ri n ≤ si < , b a

(2)

until the ciphertext c0 (si )e mod n is PKCS conforming. Step 3: Narrowing the set of solutions. After si has been found, the set Mi is computed as [ 2B + rn 3B − 1 + rn max a, , min b, (3) Mi ← si si (a,b,r)

for all [a, b] ∈ Mi−1 and

bsi − 2B asi − 3B + 1 ≤r≤ . n n

Step 4: Computing the solution. If Mi contains only one interval of length 1 (i.e., Mi = {[a, a]}), then set m ← a(s0 )−1 mod n, and return m as solution of m ≡ cd (mod n). Otherwise, set i ← i + 1 and go to step 2. Remarks. Step 1 can be skipped if c is already PKCS conforming (i.e., when c is an encrypted message). In that case, we set s0 ← 1. However, step 1 is always necessary for computing a signature, even if we do not wish to get a blind signature. In Step 2.a, we start with s1 = dn/(3B)e, because, for smaller values m0 s1 is never PKCS conforming. We use condition (1) because we want to divide the remaining interval in each iteration roughly in half. We can often improve the attack by using more information. For example, we have not used the fact that any PKCS-conforming message m0 si contains at least one zero byte. Moreover, if the attack is performed in a client–server environment, where both parties use the message m0 si to generate session keys, we might be able to find this message by exhaustive search if we already knew a sufficient portion of it. 3.2

Analysis of the Attack

We now analyze the correctness of the attack and approximate the complexity of, and, in particular, the number of oracle accesses necessary for, this attack. We must make a few heuristic assumptions; hence, we cannot give a rigorous proof of our result. First, we approximate the probability Pr(P ) that a randomly chosen integer 0 ≤ m < n is PKCS conforming. Let Pr(A) = B n be the probability that, for a

6


randomly chosen integer, the first two bytes are 00 and 02, respectively. Since we have 216 B > n > 28 B, it follows that 2−16 < Pr(A) < 2−8 . The RSA modulus is usually chosen to be a multiple of 8; hence, Pr(A) will usually be close to 2−16 . The probability that the padding block PS contains at least 8 non-zero bytes followed by a zero byte is 8 k−10 ! 255 255 . Pr(P |A) = · 1− 256 256 Assuming a modulus n of at least 512 bit (i.e. k ≥ 64), we have 0.18 < Pr(P |A) < 0.97; hence, we have

0.18 · 2−16 < Pr(P ) < 0.97 · 2−8 .

Next, we explain why our algorithm finds m0 and thus m. We prove that m0 ∈ Mi for all i by induction over i. Since m0 is PKCS conforming, we have 2B ≤ m0 ≤ 3B − 1, and so, trivially, m0 ∈ M0 . Now assume that m0 ∈ Mi−1 . Hence, there exists an interval [a, b] ∈ Mi−1 with a ≤ m0 ≤ b. Since m0 si is PKCS conforming, there exists an integer r such that 2B ≤ m0 si − rn ≤ 3B − 1, and hence asi − (3B − 1) ≤ rn ≤ bsi − 2B. We also have 3B − 1 + rn 2B + rn ≤ m0 ≤ . si si Hence, it follows from the definition of Mi that m0 is contained in one of the intervals. Now we analyze the complexity of the attack. The messages in step 1 are chosen randomly; therefore, this step needs about 1/Pr(P ) accesses to the oracle on average to find s0 . We assume again that, on average, we need 1/Pr(P ) accesses to the oracle to find si for i ≥ 1 in step 2.a and 2.b. (See also the remark at the end of this section.) Let ωi be the number of intervals in Mi . Using heuristic arguments, we can expect that ωi will satisfy the following equation for i ≥ 1. i B ωi ≤ 1 + 2i−1 si (4) n l m Indeed, the length of an interval in Mi is upper bounded by sBi . The knowledge that m0 si is PKCS conforming alone would lead to sinB intervals of the form 3B − 1 + rn 2B + rn , , (5) Ir = si si since r can take at most sinB values in equation (3).


7

In particular, M1 will contain about s1nB intervals. If i > 1, then each of the intervals Ir or a fraction of Ir is included in Mi if Ir overlaps with one interval of Mi−1 . No interval Ir can overlap with two intervals in Mi−1 . If intervals Ir were randomly distributed, then the probability that one intersects with Mi−1 would be upper bounded by 1 1 ωi−1 . + si si−1 Hence, we get Equation (4) by taking into account that one interval must contain m0 . In our case, we expect s2 to be approximately 2/Pr(P ), and we have 2(B/n)2 /Pr(P ) = 2B/(nPr(P |A)) < 2B/(0.18n) < 1/20. Hence, w2 is 1 with high probability. Thus, we expect that Step 2.b will be executed only once. Now we analyze Step 2.c. We have Mi = {[a, b]}; hence, a ≤ m0 ≤ b, and thus 2B + ri n 3B − 1 + ri n 3B − 1 + ri n 2B + ri n ≤ . ≤ si ≤ ≤ b m0 m0 a i n 3B−1+ri n The length of the interval [ 2B+r , ] is b a

3B − 1 + ri n 2B + ri n B−1 1B−1 3B − 1 + ri n 2B + ri n − ≥ . − ≥ ≥ a b m0 m0 m0 3 B Therefore, we can expect to find a pair ri , si that satisfies (2) for about each third value of ri that is tried. Thus, it seems easy to find such pairs ri , si that satisfy (1) and (2) just by iterating through possible values for ri . i n 3B−1+ri n The probability that si ∈ [ 2B+r ] is roughly 1/2. Thus, we will m0 , m0 find a PKCS-conforming si after trying about 2/Pr(P |A) chosen ciphertexts. Since the remaining interval in Mi is divided in half in each step 2.c, we expect to find m0 with about 3/Pr(P ) + 16k/Pr(P |A) chosen ciphertexts, where k denotes the size of the modulus in bytes. For Pr(P ) = 0.18 · 2−16 and k = 128 (which corresponds to a 1024-bit modulus), we expect that the attack needs roughly 220 chosen ciphertexts to succeed. The bit length of the modulus is usually a multiple of 8; hence, Pr(P ) is close to 0.18 · 2−16 , as assumed previously. Remarks. The probabilities in this section were computed under the assumption that the values si are independent of each other. We made that assumption to allow a heuristic analysis of the algorithm. However, the assumption may be wrong in special cases. For example, let us assume that m0 and si m0 are both PKCS conforming with padding strings of similar length; that is, we have, for some integer j, m0 = 2 · 28(k−2) + 28j PS + D si m0 = 2 · 28(k−2) + 28j PS0 + D0 .

8


Then, (2si − 1)m0 is PKCS conforming with high probability, since (2si − 1)m0 = 2 · 28(k−2) + 28j (2PS0 − PS) + 2D0 − D often is PKCS conforming too. We believe that such relations generally help the attacker, but it in certain situations the attack might require many more chosen ciphertexts than our analysis indicates. Usually, the bit size of the RSA modulus is a multiple of 8. This choice is a good one, because, for such a modulus, Pr(P ) is small. A modulus with a bit length 8k − 7 would make the attack much easier, because, in that case, only about 213 chosen messages would be necessary.

4

Access to an Oracle

In this section, we describe three situations in which an attacker could get access to an oracle. 4.1

Plain Encryption

Let us assume that a cryptographic protocol starts as follows. Alice generates a message m (e.g., a randomly chosen key). She encrypts it with PKCS #1, without applying any further integrity checks, and sends the ciphertext to Bob. Bob decrypts the message. If the format of the message is not PKCS conforming, then he returns an error; otherwise, he proceeds according to the protocol. If Eve impersonates Alice, she can easily send messages to Bob and check them for conformance. Note that Eve’s attack works even when the protocol includes strong authentication at a later step, since Eve has obtained useful information before she has to respond with an authenticated message. Note that the RSA encryption standard PKCS #1 [11, page 8, note 3] recommends that a message digest be included before an RSA operation, but for only the signing procedure. Even though the standard mentions that an encrypted message does not ensure integrity by itself, the standard does not indicate where such an integrity check should be included. 4.2

Detailed Error Messages

Thus far, we have shown that a reliable integrity check is an important part of an RSA encryption. One way to include such a check is to let the sender sign the message with his private key, before he encrypts it with the receiver’s public key. Then, an attacker can no longer hope to create a correct message by accident. Her attack will nonetheless be successful when, in the case of a failed verification, the receiver returns an error message that gives detailed information about where the verification failed. In particular, it would compromise security to return different error messages for a message that is not PKCS conforming and for a message where only the signature verification failed.


4.3

9

A Timing Attack

Certain applications combine encryption and signatures. In such cases, a reliable integrity check often is part of the signature, but is not included in the encryption. Let us assume that an encrypted message c is decrypted and verified as shown in the following pseudo-code: 1. 2. 3. 4.

Let m ≡ cd (mod n) be the RSA-decryption of c. If m is not PKCS conforming, then reject. Otherwise, verify the signature of m. If the signature is not correct, then reject; otherwise, accept.

An attacker will not be able to generate a chosen ciphertext c such that this message has a correct signature. However, she will be able to generate messages such that c sometimes passes the check in step 2 and is rejected only after the signature is checked. Hence, by measuring the server’s response time, an attacker could determine whether c is PKCS conforming. This timing attack is much easier to perform than is Kocher’s timing attack [10], which measures the time difference of single modular multiplications – a small fraction of the time used for one exponentiation. In our case, however, we have to distinguish between performing only an decryption and performing both an decryption and a signature verification. In the worst case, the time for the signature verification could be significantly longer than the time for the decryption – when, for example, we have a 512-bit encryption key because of export restrictions, but we use a 2048-bit key to ensure strong authentication. In addition, the attacker can chose what signing key is sent to the server.

5

SSL V.3.0

00

02

padding string

00

03

00

premastersecret 46 bytes

Fig. 2. SSL block format. Unlike the PKCS format, this format contains the SSL version number. Moreover, the length of the data block is constant.

The situation discussed in this paper arises in SSL V.3.0 [7] during the handshake protocol. In particular. the client and server first exchange the messages client.hello and server.hello, which, among other information exchanges, select the cryptographic routines. After that, the client and server may send their public keys and certificates. The client then generates a random secret bit

10


string called pre master secret, encrypts that secret bit string with RSA (if that mode was chosen earlier), and sends the resulting ciphertext to the server. The server decrypts the ciphertext. If the plaintext is not PKCS conforming, the server sends an alert message to the client and closes the connection; otherwise, the server continues the handshake protocol. Finally, the client has to send a finished message, which contains strong authentication. In particular, the client has to know the pre master secret to compute that message. Because an attacker must generate a finished message that depends on the pre master secret, she cannot complete the handshake protocol successfully. However, she does not have to complete it; she gets the necessary information – namely, whether her chosen message is PKCS conforming – before the protocol is finished. There are details of SSL V.3.0 that might hinder this attack if they are implemented the right way. Figure 2 shows the format of the message containing the pre master secret before the latter is encrypted with RSA. It contains the version number of the protocol, the purpose of which is to detect versionrollback attacks, in which an attacker tries to modify the hello messages such that both client and server use the compatibility mode and hence use the Version 2.0, instead of Version 3.0, protocols. One implementation that we analyzed [12] checks the version number only if the server is running in the compatibility mode, because otherwise obviously no rollback attack has occurred. A much more secure implementation would check the version number in all modes, and, if it identified a mismatch, would send back to the client the same error alert as it sends in the case of a decryption error. The result would be that a randomly generated message would be accepted with a probability of about 2−40 ; although such a protocol still could not be called secure, the attack shown in this paper would at least be impractical. The SSL documentation does not specify clearly the error conditions and corresponding alerts. As a result, different implementations of SSL do not react consistently with one another in error situations.

6

Experimental Results

We implemented the algorithm described in Section 3 and verified experimentally that this algorithm can decrypt a PKCS #1 encrypted message given access to an oracle that, for any ciphertext, indicates whether the the corresponding plaintext is PKCS conforming. We tested the algorithm with different 512-bit and 1024-bit keys. The algorithm needed between 300 thousand and 2 million chosen ciphertexts to find the message. We implemented our own version of the oracle, rather than using an existing software product. Finney checked three different SSL servers [6] to find out how carefully the servers analyze the message format and what error alerts are returned. One of the servers verified only the PKCS format. The second server checked the PKCS format, message length, and version number, but returned different message


11

alerts, thus still allowing our attack. Only the third server checked all aspects correctly and did not leak information by sending different alerts.

7

Conclusion

We have shown a chosen-ciphertext attack that can be carried out when only partial information about the corresponding message is leaked. We conclude not only that it is important to include a strong integrity check into an RSA encryption, but also that this integrity check must be performed in the correct step of the protocol – preferably immediately after decryption. The phase between decryption and integrity check is critical, because even sending out error messages can present a security risk. We also believe that we have provided a strong argument to use plaintext-aware encryption schemes, such as the one described by Bellare and Rogaway [3]. Note that plaintext awareness implies security against chosenciphertext attacks [2,3]. In particular, Version 2 of PKCS #1, which makes use of [3], is not susceptible to the attack described in this paper. It is a good idea to have a receiver check the integrity of a message immediately after decrypting that message. Even better is to check integrity before decrypting a message, as Cramer and Shoup show is possible [4].

Acknowledgments I thank Markus Jakobsson, David M. Kristol, and Jean-Fran¸cois Misarsky, as well as the members of the program committee, for all their comments and suggestions. I am grateful for the cooperation of the people at RSA Laboratories. I thank Hal Finney for telling me about his experiments on different SSL servers. I am also grateful to Lyn Dupré for editing this paper.

References 1. W. Alexi, B. Chor, O. Goldreich, and P. Schnorr. Bit security of RSA and Rabin functions. SIAM Journal of computing, 17(2):194–209, Apr. 1988. 3 2. M. Bellare, A. Desai, D. Pointcheval, and P. Rogaway. Relations among notions of security for public-key encryptions schemes. In H. Krawczyk, editor, Advances in Cryptology – CRYPTO ’98, Lecture Notes in Computer Science. Springer Verlag. (in press). 11 3. M. Bellare and P. Rogaway. Optimal asymmetric encryption. In A. D. Santis, editor, Advances in Cryptology – EUROCRYPT ’94, volume 950 of Lecture Notes in Computer Science, pages 92–111, Berlin, 1995. Springer Verlag. 11, 11, 11 4. R. Cramer and V. Shoup. A practical public key cryptosystem provably secure against adaptive chosen ciphertext attack. In H. Krawczyk, editor, Advances in Cryptology – CRYPTO ’98, Lecture Notes in Computer Science. Springer Verlag. (in press). 11 5. G. I. Davida. Chosen signature cryptanalysis of the RSA (MIT) public key cryptosystem. Technical Report TR-CS-82-2, Departement of Electrical Engineering and Computer Science, University of Wisconsin, Milwaukee, 1982. 3

12


6. H. Finney. personal communication. 10 7. A. O. Freier, P. Karlton, and P. C. Kocher. The SSL Protocol, Version 3.0. Netscape, Mountain View, CA, 96. 9 8. S. Goldwasser, S. Micali, and P. Tong. Why and how to establish a private code on a public network. In Proc. 23rd IEEE Symp. on Foundations of Comp. Science, pages 134–144, Chicago, 1982. 3 9. J. H˚ astad and M. N¨ aslund. The security of individual RSA bits. manusrcipt, 1998. 3, 3 10. P. C. Kocher. Timing attacks on implementations of Diffie–Hellman RSA, DSS, and other systems. In N. Koblitz, editor, Advances in Cryptology – CRYPTO ’96, volume 1109 of Lecture Notes in Computer Science, pages 104–113, Berlin, 1996. Springer Verlag. 9 11. RSA Data Security, Inc. PKCS #1: RSA Encryption Standard. Redwood City, CA, Nov. 1993. Version 1.5. 2, 8 12. E. A. Young. SSLeay 0.8.1. url = http://www.cryptsoft.com/ 10

A Practical Public Key Cryptosystem Provably Secure against Adaptive Chosen Ciphertext Attack Ronald Cramer1 and Victor Shoup2 1

Institute for Theoretical Computer Science, ETH Zurich, 8092 Zurich, Switzerland [email protected] 2 IBM Zurich Research Laboratory, S¨ aumerstr. 4, 8803 R¨ uschlikon, Switzerland [email protected]

Abstract. A new public key cryptosystem is proposed and analyzed. The scheme is quite practical, and is provably secure against adaptive chosen ciphertext attack under standard intractability assumptions. There appears to be no previous cryptosystem in the literature that enjoys both of these properties simultaneously.

1

Introduction

In this paper, we present and analyze a new public key cryptosystem that is provably secure against adaptive chosen ciphertext attack (as defined by Rackoff and Simon [20]). The scheme is quite practical, requiring just a few exponentiations over a group. Moreover, the proof of security relies only on a standard intractability assumption, namely, the hardness of the Diffie-Hellman decision problem in the underlying group. The hardness of the Diffie-Hellman decision problem is essentially equivalent to the semantic security of the basic El Gamal encryption scheme [12]. Thus, with just a bit more computation, we get security against adaptive chosen ciphertext attack, whereas the basic El Gamal scheme is completely insecure against adaptive chosen ciphertext attack. Actually, the basic scheme we describe also requires a universal one-way hash function. In a typical implementation, this can be efficiently constructed without extra assumptions; however, we also present a hash-free variant as well. While there are several provably secure encryption schemes in the literature, they are all quite impractical. Also, there have been several practical cryptosystems that have been proposed, but none of them have been proven secure under standard intractability assumptions. The significance of our contribution is that it provides a scheme that is provably secure and practical at the same time. There appears to be no other encryption scheme in the literature that enjoys both of these properties simultaneously. H. Krawczyk (Ed.): CRYPTO’98, LNCS 1462, pp. 13–25, 1998. c Springer-Verlag Berlin Heidelberg 1998

14

Ronald Cramer and Victor Shoup

Chosen Ciphertext Security Semantic security, defined by Goldwasser and Micali [14], captures the intuition that an adversary should not be able to obtain any partial information about a message given its encryption. However, this guarantee of secrecy is only valid when the adversary is completely passive, i.e., can only eavesdrop. Indeed, semantic security offers no guarantee of secrecy at all if an adversary can mount an active attack, i.e., inject messages into a network or otherwise influence the behavior of parties in the network. To deal with active attacks, Rackoff and Simon [20] defined the notion of security against an adaptive chosen ciphertext attack. If an adversary can inject messages into a network, these messages may be encryptions, and the adversary may be able to extract partial information about the corresponding cleartexts through its interactions with the parties in the network. Rackoff and Simon’s definition models this type of attack by simply allowing an adversary to obtain decryptions of its choice, i.e., the adversary has access to a “decryption oracle.” Now, given an encryption of a message—the “target” ciphertext—we want to guarantee that the adversary cannot obtain any partial information about the message. To achieve this, we have to restrict the adversary’s behavior in some way, otherwise the adversary could simply submit the target ciphertext itself to the decryption oracle. The restriction proposed by Rackoff and Simon is the weakest possible: the adversary is not allowed to submit the target ciphertext itself to the oracle; however, it may submit any other ciphertext, including ciphertexts that are related to the target ciphertext. A different notion of security against active attacks, called non-malleability, was proposed by Dolev, Dwork, and Naor [9]. Here, the adversary also has access to a decryption oracle, but his goal is not to obtain partial information about the target ciphertext, but rather, to create another encryption of a different message that is related in some interesting way to the original, encrypted message. For example, for a non-malleable encryption scheme, given an encryption of n, it should be infeasible to create an encryption of n + 1. It turns out that non-malleability and security against adaptive chosen ciphertext attack are equivalent [10]. A cryptosystem secure against adaptive chosen ciphertext attack is a very powerful cryptographic primitive. It is essential in designing protocols that are secure against active adversaries. For example, this primitive is used in protocols for authentication and key exchange [11,10,2] and in protocols for escrow, certified e-mail, and more general fair exchange [1,22]. The practical importance of this primitive is also highlighted by the adoption of Bellare and Rogaway’s OAEP scheme [4] (a practical but only heuristically secure scheme) as an internet encryption standard and for use in the SET protocol for electronic commerce. There are also intermediate notions of security, between semantic security and adaptive chosen ciphertext security. Naor and Yung [19] propose an attack model where the adversary has access to the decryption oracle only prior to obtaining the target ciphertext, and the goal of the adversary is to obtain partial information about the encrypted message. Naor and Yung called this type

A Practical Public Key Cryptosystem Provably Secure

15

of attack a chosen ciphertext attack; it has also been called a “lunch-time” or “midnight” attack. In this paper, we will always use the phrase adaptive chosen ciphertext attack for Rackoff and Simon’s definition, to distinguish it from Naor and Yung’s definition.

Previous Work Provably Secure Schemes. Naor and Yung [19] presented the first scheme provably secure against lunch-time attacks. Subsequently, Dolev, Dwork, and Naor [9] presented a scheme that is provably secure against adaptive chosen ciphertext attack. All of the previously known schemes provably secure under standard intractability assumptions are completely impractical (albeit polynomial time), as they rely on general and expensive constructions for non-interactive zeroknowledge proofs. Practical Schemes. Damgard [8] proposed a practical scheme that he conjectured to be secure against lunch-time attacks; however, this scheme is not known to be provably secure, and is in fact demonstrably insecure against adaptive chosen ciphertext attack. Zheng and Seberry [24] proposed practical schemes that are conjectured to be secure against chosen ciphertext attack, but again, no proof based on standard intractability assumptions is known. Lim and Lee [16] also proposed practical schemes that were later broken by Frankel and Yung [13]. Bellare and Rogaway [3,4] have presented practical schemes for which they give heuristic proofs of adaptive chosen ciphertext security; namely, they prove security in an idealized model of computation, the so-called random oracle model, wherein a hash function is represented by a random oracle. Shoup and Gennaro [22] also give El Gamal-like schemes that are secure against adaptive chosen ciphertext attack in the random oracle model, and that are also amenable to efficient threshold decryption. We stress that although a security proof in the random oracle model is of some value, it is still only a heuristic proof. In particular, these types of proofs do not rule out the possibility of breaking the scheme without breaking the underlying intractability assumption. Nor do they even rule out the possibility of breaking the scheme without finding some kind of weakness in the hash function, as recently shown by Canetti, Goldreich, and Halevi [7].

Outline of paper In §2 we review the basic definitions that we need for security and intractability assumptions. In §3 we outline our basic scheme, and in §4 we prove its security. In §5 we discuss some implementation details and variations on the basic scheme.

16

2 2.1


Definitions Security Against Adaptive Chosen Ciphertext Attack

We recall Rackoff and Simon’s definition. Security is defined via the following game played by the adversary. First, the encryption scheme’s key generation algorithm is run, with a security parameter as input. Next, the adversary makes arbitrary queries to a “decryption oracle,” decrypting ciphertexts of his choice. Next the adversary chooses two messages, m0 , m1 , and sends these to an “encryption oracle.” The encryption oracle chooses a bit b ∈ {0, 1} at random, and encrypts mb . The corresponding ciphertext is given to the adversary (the internal coin tosses of the encryption oracle, in particular b, are not in the adversary’s view). After receiving the ciphertext from the encryption oracle, the adversary continues to query the decryption oracle, subject only to the restriction that the query must be different than the output of the encryption oracle. At the end of the game, the adversary outputs b0 ∈ {0, 1}, which is supposed to be the adversary’s guess of the value b. If the probability that b0 = b is 1/2 + , then the adversary’s advantage is defined to be . The cryptosystem is said to be secure against adaptive chosen ciphertext attack if the advantage of any polynomial-time adversary is negligible (as a function of the security parameter). 2.2

The Diffie-Hellman Decision Problem

There are several equivalent formulations of the Diffie-Hellman decision problem. The one that we shall use is the following. Let G be a group of large prime order q, and consider the following two distributions: – the distribution R of random quadruples (g1 , g2 , u1 , u2 ) ∈ G4 ; – the distribution D of quadruples (g1 , g2 , u1 , u2 ) ∈ G4 , where g1 , g2 are random, and u1 = g1r and u2 = g2r for random r ∈ Zq . An algorithm that solves the Diffie-Hellman decision problem is a statistical test that can effectively distinguish these two distributions. That is, given a quadruple coming from one of the two distributions, it should output 0 or 1, and there should be a non-negligible difference between (a) the probability that it outputs a 1 given an input from R, and (b) the probability that it outputs a 1 given an input from D. The Diffie-Hellman decision problem is hard if there is no such polynomial-time statistical test. This formulation of the Diffie-Hellman decision problem is equivalent to several others. First, making the substitution g1 → g, g2 → g x , u1 → g y , u2 → g xy ,


17

one sees that this is equivalent to distinguishing Diffie-Hellman triples (g x , g y , g xy ) from non-Diffie-Hellman triples (g x , g y , g z ). Note that by a trivial random self-reducibility property, it does not matter if the base g is random or fixed. Second, although we have described it as a problem of distinguishing two distributions, the Diffie-Hellman decision problem is equivalent to the worst-case decision problem: given (g x , g y , g z ), decide—with negligible error probability— if z = xy mod q. This equivalence follows immediately from a random selfreducibility property first observed by Stadler [23] and later by Naor and Reingold [17]. Related to the Diffie-Hellman decision problem is the Diffie-Hellman problem (given g, g x and g y , compute g xy ), and the discrete logarithm problem (given g and g x , compute x). There are obvious polynomial-time reductions from the Diffie-Hellman decision problem to the Diffie-Hellman problem, and from the Diffie-Hellman problem to the discrete logarithm problem, but reductions in the reverse direction are not known. Moreover, these reductions are essentially the only known methods of solving the Diffie-Hellman or Diffie-Hellman decision problems. All three problems are widely conjectured to be hard, and have been used as assumptions in proving the security of a variety of cryptographic protocols. Some heuristic evidence for the hardness of all of these problems is provided in [21], where it is shown that they are hard in a certain natural, structured model of computation. See [23,17,6] for further applications and discussion of the Diffie-Hellman decision problem. Note that the hardness of the Diffie-Hellman decision problem is equivalent to the semantic security of the basic El Gamal encryption scheme. Recall that in the basic El Gamal scheme, we encrypt a message m ∈ G as (g r , hr m), where h is the public key of the recipient. On the one hand, if the Diffie-Hellman decision problem is hard, then the group element hr could be replaced by a random group element without changing significantly the behavior of the attacker; however, if we perform this substitution, the message m is perfectly hidden, which implies security. On the other hand, if the Diffie-Hellman decision problem can be efficiently solved, then an attacker can break El Gamal as follows. The attacker chooses two messages m0 , m1 , giving these to an encryption oracle. The encryption oracle produces an encryption (u, e) = (g r , hr mb ), where b ∈ {0, 1} is chosen at random. The attacker’s task is to determine b, which he can do by simply determining which of (u, h, e/m0 ) and (u, h, e/m1 ) is a Diffie-Hellman triple. Note that the basic El Gamal scheme is completely insecure against adaptive chosen ciphertext attack. Indeed, given an encryption (u, e) of a message m, we can feed the (u, g · e) to the decryption oracle, which gives us g · m.

18

2.3


Collision-Resistant Hash Functions

A family of hash functions is said to be collision resistant if upon drawing a function H at random from the family, it is infeasible for an adversary to find two different inputs x and y such that H(x) = H(y). A weaker notion is that of a universal one-way family of hash functions [18]. Here, it should be infeasible for an adversary to choose an input x, draw a random hash function H, and then find a different input y such that H(x) = H(y). Such hash function families are also called target collision resistant. See [5] for recent results and further discussion.

3

The Basic Scheme

We assume that we have a group G of prime order q, where q is large. We also assume that cleartext messages are (or can be encoded as) elements of G (although this condition can be relaxed—see §5.2). We also use a universal one-way family of hash functions that map long bit strings to elements of Zq (although we can do without this—see §5.3). Key Generation. The key generation algorithm runs as follows. Random elements g1 , g2 ∈ G are chosen, and random elements x1 , x2 , y1 , y2 , z ∈ Zq are also chosen. Next, the group elements c = g1x1 g2x2 , d = g1y1 g2y2 , h = g1z are computed. Next, a hash function H is chosen from the family of universal one-way hash functions. The public key is (g1 , g2 , c, d, h, H), and the private key is (x1 , x2 , y1 , y2 , z). Encryption. Given a message m ∈ G, the encryption algorithm runs as follows. First, it chooses r ∈ Zq at random. Then it computes u1 = g1r , u2 = g2r , e = hr m, α = H(u1 , u2 , e), v = cr drα . The ciphertext is (u1 , u2 , e, v). Decryption. Given a ciphertext (u1 , u2 , e, v), the decryption algorithm runs as follows. It first computes α = H(u1 , u2 , e), and tests if u1x1 +y1 α u2x2 +y2 α = v. If this condition does not hold, the decryption algorithm outputs “reject”; otherwise, it outputs m = e/uz1 .


19

We first verify that this is an encryption scheme, in the sense that the decryption of an encryption of a message yields the message. Since u1 = g1r and u2 = g2r , we have ux1 1 ux2 2 = g1rx1 g2rx2 = cr . Likewise, uy11 uy22 = dr and uz1 = hr Therefore, the test performed by the decryption algorithm will pass, and the output will be e/hr = m.

4

Proof of Security

In this section, we prove the following theorem. Theorem 1. The above cryptosystem is secure against adaptive chosen ciphertext attack assuming that (1) the hash function H is chosen from a universal one-way family, and (2) the Diffie-Hellman decision problem is hard in the group G. To prove the theorem, we will assume that there is an adversary that can break the cryptosystem, and that the hash family is universal one-way, and show how to use this adversary to construct a statistical test for the Diffie-Hellman decision problem. For the statistical test, we are given (g1 , g2 , u1 , u2 ) coming from either the distribution R or D. At a high level, our construction works as follows. We build a simulator that simulates the joint distribution consisting of adversary’s view in its attack on the cryptosystem, and the hidden bit b generated by the generated oracle (which is not a part of the adversary’s view). We will show that if the input comes from D, the simulation will be nearly perfect, and so the adversary will have a non-negligible advantage in guessing the hidden bit b. We will also show that if the input comes from R, then the adversary’s view is essentially independent of b, and therefore the adversary’s advantage is negligible. This immediately implies a statistical test distinguishing R from D: run the simulator and adversary together, and if the simulator outputs b and the adversary outputs b0 , the distinguisher outputs 1 if b = b0 , and 0 otherwise. We now give the details of the simulator. The input to the simulator is (g1 , g2 , u1 , u2 ). The simulator runs the following key generation algorithm, using the given g1 , g2 . The simulator chooses x1 , x2 , y1 , y2 , z1 , z2 ∈ Zq at random, and computes c = g1x1 g2x2 , d = g1y1 g2y2 , h = g1z1 g2z2 . The simulator also chooses a hash function H at random. The public key that the adversary sees is (g1 , g2 , c, d, h, H). The simulator knows (x1 , x2 , y1 , y2 , z1 , z2 ).

20


Note that the simulator’s key generation algorithm is slightly different from the key generation algorithm of the actual cryptosystem; in the latter, we essentially fix z2 = 0. The simulator answers decryption queries as in the actual attack, except that it computes m = e/(uz11 uz22 ). We now describe the simulation of the encryption oracle. Given m0 , m1 , the simulator chooses b ∈ {0, 1} at random, and computes e = uz11 uz22 mb , α = H(u1 , u2 , e), v = u1x1 +y1 α u2x2 +y2 α , and outputs (u1 , u2 , e, v). That completes the description of the simulator. As we will see, when the input to the simulator comes from D, the output of the encryption oracle is a perfectly legitimate ciphertext; however, when the input to the simulator comes from R, the output of the decryption oracle will not be legitimate, in the sense that logg1 u1 6= logg2 u2 . This is not a problem, and indeed, it is crucial to the proof of security. The theorem now follows immediately from the following two lemmas. Lemma 1. When the simulator’s input comes from D, the joint distribution of the adversary’s view and the hidden bit b is is statistically indistinguishable from that in the actual attack. Consider the joint distribution of the adversary’s view and the bit b when the input comes from the distribution D. Say u1 = g1r and u2 = g2r . It is clear in this case that the output of the encryption oracle has the right distribution, since ux1 1 ux2 2 = cr , uy11 uy22 = dr , and uz11 uz22 = hr ; indeed, these equations imply that e = mb hr and v = cr drα , and α itself is already of the right form. To complete the proof, we need to argue that the output of the decryption oracle has the right distribution. Let us call (u01 , u02 , e0 , v 0 ) ∈ G4 a valid ciphertext if logg1 u01 = logg2 u02 . 0 0 0 Note that if a ciphertext is valid, with u01 = g1r and u02 = g2r , then hr = 0 (u01 )z1 (u02 )z2 ; therefore, the decryption oracle outputs e/hr , just as it should. Consequently, the lemma follows immediately from the following: Claim. The decryption oracle—in both an actual attack against the cryptosystem and in an attack against the simulator—rejects all invalid ciphertexts, except with negligible probability. We now prove this claim by considering the distribution of the point P = (x1 , x2 , y1 , y2 ) ∈ Z4q , conditioned on the adversary’s view. Let log(·) denote logg1 (·), and let w = log g2 . ¿From the adversary’s view, P is a random point on the plane P formed by intersecting the hyperplanes log c = x1 + wx2

(1)


21

and log d = y1 + wy2 .

(2)

These two equations come from the public key. The output from the encryption oracle does not constrain P any further, as the hyperplane defined by log v = rx1 + wrx2 + αry1 + αrwy2

(3)

contains P. Now suppose the adversary submits an invalid ciphertext (u01 , u02 , v 0 , e0 ) to the decryption oracle, where log u01 = r10 and log u02 = wr20 , with r10 6= r20 . The decryption oracle will reject, unless P happens to lie on the hyperplane H defined by log v 0 = r10 x1 + wr20 x2 + α0 r10 y1 + α0 r20 wy2 , (4) where α0 = H(u01 , u02 , e0 ). But it is clear that the equations (1), (2), and (4) are linearly independent, and so H intersects the plane P at a line. It follows that the first time the adversary submits an invalid ciphertext, the decryption oracle rejects with probability 1 − 1/q. This rejection actually constrains the point P, puncturing the plane H at a line. Therefore, for i = 1, 2, . . ., the ith invalid ciphertext submitted by the adversary will be rejected with probability at least 1−1/(q−i+1). ¿From this it follows that the decryption oracle rejects all invalid ciphertexts, except with negligible probability. Lemma 2. When the simulator’s input comes from R, the distribution of the hidden bit b is (essentially) independent from the adversary’s view. Let u1 = g1r1 and u2 = g1wr2 . We may assume that r1 6= r2 , since this occurs except with negligible probability. The lemma follows immediately from the following two claims. Claim 1. If the decryption oracle rejects all invalid ciphertexts during the attack, then the distribution of the hidden bit b is independent of the adversary’s view. To see this, consider the point Q = (z1 , z2 ) ∈ Z2q . At the beginning of the attack, this is a random point on the line log h = z1 + wz2 ,

(5)

determined by the public key. Moreover, if the decryption oracle only decrypts valid ciphertexts (u01 , u02 , e0 , v 0 ), then the adversary obtains only linearly depen0 0 0 dent relations r0 log h = r0 z1 + r0 wz2 (since (u01 )z1 (u02 )z2 = g1r z1 g2r z2 = hr ). Thus, no further information about Q is leaked. Consider now the output (u1 , u2 , e, v) of the simulator’s encryption oracle. We have e = · mb , where = uz11 uz22 . Now, consider the equation log = r1 z1 + wr2 z2 .

(6)

Clearly, (5) and (6) are linearly independent, and so the conditional distribution of —conditioning on b and everything in the adversary’s view other than e— is uniform. In other words, is a perfect one-time pad. It follows that b is independent of the adversary’s view.

22


Claim 2. The decryption oracle will reject all invalid ciphertexts, except with negligible probability. As in the proof of Lemma 1, we study the distribution of P = (x1 , x2 , y1 , y2 ) ∈ Z4q , conditioned on the adversary’s view. ¿From the adversary’s view, this is a random point on the line L formed by intersecting the hyperplanes (1), (2), and log v = r1 x1 + wr2 x2 + αr1 y1 + αwr2 y2 .

(7)

Equation (7) comes from the output of the encryption oracle. Now assume that the adversary submits an invalid ciphertext (u01 , u02 , e0 , v 0 ) 6= (u1 , u2 , e, v), where log u01 = r10 and log u02 = wr20 , with r10 6= r20 . Let α0 = H(u01 , u02 , e0 ). There are three cases we consider. Case 1. (u01 , u02 , e0 ) = (u1 , u2 , e). In this case, the hash values are the same, but v 0 6= v implies that the decryption oracle will certainly reject. Case 2. (u01 , u02 , e0 ) 6= (u1 , u2 , e) and α0 6= α. The decryption oracle will reject unless the point P lies on the hyperplane H defined by (4). However, the equations (1), (2), (7), and (4) are linearly independent. This can be verified by observing that   1 w 0 0 0 0 1  w 2 0 0 0  det   r1 wr2 αr1 αwr2  = w (r2 − r1 )(r2 − r1 )(α − α ) 6= 0. r10 wr20 α0 r10 α0 wr20 Thus, H intersects the line L at a point, from which it follows (as in the proof of Lemma 1) that the decryption oracle rejects, except with negligible probability. Case 3. (u01 , u02 , e0 ) 6= (u1 , u2 , e) and α0 = α. We argue that if this happens with nonnegligible probability, then in fact, the family of hash functions is not universal one-way—a contradiction. Note that if we made the stronger assumption of collision resistance, there would be essentially nothing to prove, but with the weaker universal one-way assumption, an argument is needed. We use the adversary to break the universal one-way hash function as follows. We modify the encryption oracle in the simulator, so that it outputs (u1 , u2 , e, v) as before, except that now, e ∈ G is simply chosen completely at random. Up until such time that a collision occurs, the adversary’s view in this modified simulation is statistically indistinguishable from the view in the original simulation, and so the adversary will also find a collision with nonnegligible probability in the modified simulation. But the argument (u1 , u2 , e) to H is independent of H, and in particular, we can choose it before choosing H.

5

Implementation Details and Variations

In this section, we briefly discuss some implementation details and possible variations of the basic encryption scheme.


5.1

23

A Simple Implementation

We choose a large prime p such that p − 1 = 2q, where q is also prime. The group G is the subgroup of order q in Z∗p . We restrict a message to be an element of the set {1, . . . , q}, and “encode” it by squaring it modulo p, giving us an element in G. We can recover a message from its encoding by computing the unique square root of its encoding modulo p that is in the set {1, . . . , q}. For the hash function, one could use a function like SHA-1, or possibly some keyed variant, and make the appropriate collision-resistance assumption. However, it is only marginally more expensive to do the following, which is based only on the hardness of discrete logarithms in G. Say we want to hash a bit string to an integer mod q. Write the bit string as a sequence (a1 , . . . , ak ), with each ai ∈ {0, . . . , q − 1}. To define the hash function, choose h1 , . . . , hk in G at random. The hash of (a1 , . . . , ak ) is then the least non-negative residue of ±ha1 1 · · · hakk ∈ Z∗p , where the sign is chosen so that this value is in {1, . . . , q}. This hash function is collision resistant, provided computing discrete logarithms in G is hard. To see this, note that from a collision, we obtain a nonzero sequence (a1 , . . . , ak ) mod q such that ha1 1 · · · hakk ∈ {1, −1} ∩ G = {1}. Using a standard argument, it is easy to see that finding such a relation is equivalent to computing discrete logarithms. Note that the group elements g1 , g2 and h1 , . . . , hk can be system-wide parameters, used by all users of the system. 5.2

A Hybrid Implementation

It would be more practical to work in a smaller subgroup, and it would be nice to have a more flexible and efficient way to encode messages. To do this, assume we have a symmetric-key cipher C with a key length of l bits. Now choose a large prime p such that p − 1 = qm, where q is a 3l-bit prime. The group G is the subgroup of order q in Z∗p . A message in this scheme is just an arbitrary bit string. To encrypt a message m, we modify our encryption algorithm, computing e = CK (m), where the encryption key K is computed by hashing hr to an l-bit string with a public 2-universal hash function. For the hash function H used in the encryption scheme, something like SHA-1, possibly keyed, would be appropriate. The security of this variant is easily proved using the techniques of this paper, along with the left-over hash lemma [15], assuming the cipher C is semantically secure. 5.3

A Hash-Free Variant

We can actually eliminate the hash function H from the scheme, so that the security can be based strictly on the Diffie-Hellman decision problem for an

24


arbitrary group G. Suppose the strings we need to hash in the original scheme are of the form (a1 , . . . , ak ), where 0 ≤ ai < p. In the modified scheme, we replace the group element d in the public key by d1 , . . . , dk . For 1 ≤ i ≤ k, we have di = g1yi1 g2yi2 , where yi1 and yi2 are random elements of Zq included in the secret key. When encrypting, we compute v = cr

k Y

dai i r ,

i=1

and when decrypting, we verify that Pk Pk x1 + ai yi1 x2 + ai yi2 i=1 i=1 u2 . v = u1 Using the same proof techniques as for the basic scheme, it is straightforward to prove that this modified version is secure against adaptive chosen ciphertext attack, assuming the Diffie-Hellman decision problem in G is hard. 5.4

A “lite” Version Secure Against Lunch-Time Attacks

To achieve security against lunch-time attacks only, one can simplify the basic scheme significantly, essentially by eliminating d, y1 , y2 , and the hash function H. When encrypting, we compute v = cr , and when decrypting, we verify that v = ux1 1 ux2 2 .

Acknowledgments We would like to thank Moni Naor for his very useful comments on an earlier draft of this paper, and in particular, for pointing out that a universal one-way hash function is sufficient to prove the security of our basic scheme, and for suggesting the hash-free variant in §5.3.

References 1. N. Asokan, V. Shoup, and M. Waidner. Optimistic fair exchange of digital signatures. In Advances in Cryptology–Eurocrypt ’98, 1998. 14 2. M. Bellare, R. Canetti, and H. Krawczyk. A modular approach to the design and analysis of authentication and key exchange protocols. In 30th Annual ACM Symposium on Theory of Computing, 1998. 14 3. M. Bellare and P. Rogaway. Random oracles are practical: a paradigm for designing efficient protocols. In First ACM Conference on Computer and Communications Security, pages 62–73, 1993. 15 4. M. Bellare and P. Rogaway. Optimal asymmetric encryption. In Advances in Cryptology—Crypto ’94, pages 92–111, 1994. 14, 15 5. M. Bellare and P. Rogaway. Collision-resistant hashing: towards making UOWHFs practical. In Advances in Cryptology–Crypto ’97, 1997. 18


25

6. D. Boneh and R. Venkatesan. Hardness of computing the most significant bits of secret keys in Diffie-Hellman and related schemes. In Advances in Cryptology– Crypto ’96, pages 129–142, 1996. 17 7. R. Canetti, O. Goldreich, and S. Halevi. The random oracle model, revisted. In 30th Annual ACM Symposium on Theory of Computing, 1998. To appear. 15 8. I. Damgard. Towards practical public key cryptosystems secure against chosen ciphertext attacks. In Advances in Cryptology–Crypto ’91, pages 445–456, 1991. 15 9. D. Dolev, C. Dwork, and M. Naor. Non-malleable cryptography. In 23rd Annual ACM Symposium on Theory of Computing, pages 542–552, 1991. 14, 15 10. D. Dolev, C. Dwork, and M. Naor. Non-malleable cryptography, 1998. Manuscript (updated, full length version of STOC paper). 14, 14 11. C. Dwork and M. Naor. Method for message authentication from non-malleable cryptosystems, 1996. U. S. Patent No. 05539826. 14 12. T. El Gamal. A public key cryptosystem and signature scheme based on discrete logarithms. IEEE Trans. Inform. Theory, 31:469–472, 1985. 13 13. Y. Frankel and M. Yung. Cryptanalysis of immunized LL public key systems. In Advances in Cryptology–Crypto ’95, pages 287–296, 1995. 15 14. S. Goldwasser and S. Micali. Probabilistic encryption. Journal of Computer and System Sciences, 28:270–299, 1984. 14 15. R. Impagliazzo, L. Levin, and M. Luby. Pseudo-random number generation from any one-way function. In 21st Annual ACM Symposium on Theory of Computing, pages 12–24, 1989. 23 16. C. H. Lim and P. J. Lee. Another method for attaining security against adaptively chosen ciphertext attacks. In Advances in Cryptology–Crypto ’93, pages 420–434, 1993. 15 17. M. Naor and O. Reingold. Number-theoretic constructions of efficient pseudorandom functions. In 38th Annual Symposium on Foundations of Computer Science, 1997. 17, 17 18. M. Naor and M. Yung. Universal one-way hash functions and their cryptographic applications. In 21st Annual ACM Symposium on Theory of Computing, 1989. 18 19. M. Naor and M. Yung. Public-key cryptosystems provably secure against chosen ciphertext attacks. In 22nd Annual ACM Symposium on Theory of Computing, pages 427–437, 1990. 14, 15 20. C. Rackoff and D. Simon. Noninteractive zero-knowledge proof of knowledge and chosen ciphertext attack. In Advances in Cryptology–Crypto ’91, pages 433–444, 1991. 13, 14 21. V. Shoup. Lower bounds for discrete logarithms and related problems. In Advances in Cryptology–Eurocrypt ’97, 1997. 17 22. V. Shoup and R. Gennaro. Securing threshold cryptosystems against chosen ciphertext attack. In Advances in Cryptology–Eurocrypt ’98, 1998. 14, 15 23. M. Stadler. Publicly verifiable secrete sharing. In Advances in Cryptology– Eurocrypt ’96, pages 190–199, 1996. 17, 17 24. Y. Zheng and J. Seberry. Practical approaches to attaining security against adaptively chosen ciphertext attacks. In Advances in Cryptology–Crypto ’92, pages 292–304, 1992. 15

Relations Among Notions of Security for Public-Key Encryption Schemes Mihir Bellare1 , Anand Desai1 , David Pointcheval2 , and Phillip Rogaway3 1

Dept. of Computer Science & Engineering, University of California at San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA. {mihir,adesai}@cs.ucsd.edu URL: http://www-cse.ucsd.edu/users/{mihir,adesai} 2 ´ Laboratoire d’Informatique de l’Ecole Normale Supérieure, 75005 Paris, France, and GREYC, Dépt d’Informatique, Université de Caen, 14032 Caen Cedex, France. [email protected] URL: http://www.dmi.ens.fr/~pointche/ 3 Dept. of Computer Science, Engineering II Bldg., University of California at Davis, Davis, CA 95616, USA. [email protected] URL: http://www.cs.ucdavis.edu/~rogaway/

Abstract. We compare the relative strengths of popular notions of security for public key encryption schemes. We consider the goals of privacy and non-malleability, each under chosen plaintext attack and two kinds of chosen ciphertext attack. For each of the resulting pairs of definitions we prove either an implication (every scheme meeting one notion must meet the other) or a separation (there is a scheme meeting one notion but not the other, assuming the first notion can be met at all). We similarly treat plaintext awareness, a notion of security in the random oracle model. An additional contribution of this paper is a new definition of non-malleability which we believe is simpler than the previous one. Keywords: Asymmetric encryption, Chosen ciphertext security, Nonmalleability, Rackoff-Simon attack, Plaintext awareness, Relations among definitions.

1

Introduction

In this paper we compare the relative strengths of various notions of security for public key encryption. We want to understand which definitions of security imply which others. We start by sorting out some of the notions we will consider. 1.1

Notions of Encryption Scheme Security

A convenient way to organize definitions of secure encryption is by considering separately the various possible goals and the various possible attack models, and then obtain each definition as a pairing of a particular goal and a particular attack model. This viewpoint was suggested to us by Moni Naor [22]. H. Krawczyk (Ed.): CRYPTO’98, LNCS 1462, pp. 26–46, 1998. c Springer-Verlag Berlin Heidelberg 1998

Relations Among Notions of Security for Public-Key Encryption Schemes

27

We consider two different goals: indistinguishability of encryptions, due to Goldwasser and Micali [17], and non-malleability, due to Dolev, Dwork and Naor [11]. Indistinguishability (IND) formalizes an adversary’s inability to learn any information about the plaintext x underlying a challenge ciphertext y, capturing a strong notion of privacy. Non-malleability (NM) formalizes an adversary’s inability, given a challenge ciphertext y, to output a different ciphertext y 0 such that the plaintexts x, x0 underlying these two ciphertexts are “meaningfully related”. (For example, x0 = x + 1.) It captures a sense in which ciphertexts can be tamper-proof. Along the other axis we consider three different attacks. In order of increasing strength these are chosen plaintext attack (CPA), non-adaptive chosen ciphertext attack (CCA1), and adaptive chosen ciphertext attack (CCA2). Under CPA the adversary can obtain ciphertexts of plaintexts of her choice. In the public key setting, giving the adversary the public key suffices to capture these attacks. Under CCA1, formalized by Naor and Yung [23], the adversary gets, in addition to the public key, access to an oracle for the decryption function. The adversary may use this decryption function only for the period of time preceding her being given the challenge ciphertext y. (The term non-adaptive refers to the fact that queries to the decryption oracle cannot depend on the challenge y. Colloquially this attack has also been called a “lunchtime,” “lunch-break,” or “midnight” attack.) Under CCA2, due to Rackoff and Simon [24], the adversary again gets (in addition to the public key) access to an oracle for the decryption function, but this time she may use this decryption function even on ciphertexts chosen after obtaining the challenge ciphertext y, the only restriction being that the adversary may not ask for the decryption of y itself. (The attack is called adaptive because queries to the decryption oracle can depend on the challenge y.) As a mnemonic for the abbreviations CCA1 / CCA2, just remember that the bigger number goes with the stronger attack. One can “mix-and-match” the goals {IND, NM} and attacks {CPA, CCA1, CCA2} in any combination, giving rise to six notions of security: IND-CPA, IND-CCA1, IND-CCA2, NM-CPA, NM-CCA1, NM-CCA2 . Most are familiar (although under different names). IND-CPA is the notion of [17];1 IND-CCA1 is the notion of [23]; IND-CCA2 is the notion of [24]; NM-CPA, NM-CCA1 and NM-CCA2 are from [11,12,13]. 1.2

Implications and Separations

In this paper we work out the relations between the above six notions. For each pair of notions A, B ∈ { IND-CPA, IND-CCA1, IND-CCA2, NM-CPA, NM-CCA1, NM-CCA2 }, we show one of the following: – A ⇒ B: A proof that if Π is any encryption scheme meeting notion of security A then Π also meets notion of security B. 1

Goldwasser and Micali referred to IND-CPA as polynomial security, and also showed this was equivalent to another notion, semantic security.

28

Mihir Bellare et al.

NM-CCA1 PP P iP PP PP P P 1 4 PPPPPP 3 1 PP PP PP q PP ? ? IND-CPA IND-CCA1 NM-CPA

5

-

NM-CCA2

6 1

2

? IND-CCA2

Fig. 1. An arrow is an implication, and in the directed graph given by the arrows, there is a path from A to B if and only A ⇒ B. The hatched arrows represent separations we actually prove; all others follow automatically. The number on an arrow or hatched arrow refers to the theorem in this paper which establishes this relationship. – A 6⇒ B: A construction of an encryption scheme Π that provably meets notion of security A but provably does not meet notion of security B.2 We call a result of the first type an implication, and a result of the second type a separation. For each pair of notions we provide one or the other, so that no relation remains open. These results are represented diagrammatically in Figure 1. The (unhatched) arrows represent implications that are proven or trivial, and the hatched arrows represent explicitly proven separations. Specifically, the non-trivial implication is that IND-CCA2 implies NM-CCA2, and the separations shown are that IND-CCA1 does not imply NM-CPA; nor does NM-CPA imply IND-CCA1; nor does NM-CCA1 imply NM-CCA2. Figure 1 represents a complete picture of relations in the following sense. View the picture as a graph, the edges being those given by the (unhatched) arrows. (So there are eight edges.) We claim that for any pair of notions A, B, it is the case that A implies B if and only if there is a path from A to B in the graph. The “if” part of this claim is of course clear from the definition of implication. The “only if” part of this claim can be verified for any pair of notions by utilizing the hatched and unhatched arrows. For example, we claim that IND-CCA1 does not imply IND-CCA2. For if we had that IND-CCA1 implies IND-CCA2 then this, coupled with NM-CCA1 implying IND-CCA1 and IND-CCA2 implying NM-CCA2, would give NM-CCA1 implying NM-CCA2, which we know to be false. That IND-CCA2 implies all of the other notions helps bolster the view that adaptive CCA is the “right” version of CCA on which to focus. (IND-CCA2 has already proven to be a better tool for protocol design.) We thus suggest that, in the future, “CCA” should be understood to mean adaptive CCA. 2

This will be done under the assumption that there exists some scheme meeting notion A, since otherwise the question is vacuous. This (minimal) assumption is the only one made.


1.3

29

Plaintext Awareness

Another adversarial goal we will consider is plaintext awareness (PA), first defined by Bellare and Rogaway [4]. PA formalizes an adversary’s inability to create a ciphertext y without “knowing” its underlying plaintext x. (In the case that the adversary creates an “invalid” ciphertext what she should know is that the ciphertext is invalid.) So far, plaintext awareness has only been defined in the random oracle (RO) model. Recall that in the RO model one embellishes the customary model of computation by providing all parties (good and bad alike) with a random function H from strings to strings. See [3] for a description of the random oracle model and a discussion of its use. The six notions of security we have described can be easily “lifted” to the RO model, giving six corresponding definitions. Once one makes such definitional analogs it is easily verified that all of the implications and separations mentioned in Section 1.2 and indicated in Figure 1 also hold in the RO setting. For example, the RO version of IND-CCA2 implies the RO version of NM-CCA2. Since PA has only been defined in the RO model it only makes sense to compare PA with other RO notions. Our results in this vein are as follows. Theorem 6 shows that PA (together with the RO version of IND-CPA) implies the RO version of IND-CCA2. In the other direction, Theorem 7 shows that the RO version of IND-CCA2 does not imply PA. 1.4

Definitional Contributions

Beyond the implications and separations we have described, we have two definitional contributions: a new definition of non-malleability, and a refinement to the definition of plaintext awareness. The original definition of non-malleability [11,12,13] is in terms of simulation, requiring, for every adversary, the existence of some appropriate simulator. We believe our formulation is simpler. It is defined via an experiment involving only the adversary; there is no simulator. Nonetheless, it does not lose strength: Theorem 8 (due to [5]) says that our definition implies that of [12,13] under any form of attack. The definitions are not known to be equivalent because the other direction is open. See Appendix A. We stress that the results in this paper are not affected by the definitional change; they hold under either definition. We view the new definition as an additional, orthogonal contribution which could simplify the task of working with non-malleability. We also note that our definitional idea lifts to other settings, like defining semantic security [17] against chosen ciphertext attacks. (Semantic security seems not to have been defined against CCA.) With regard to plaintext awareness, we make a small but important refinement to the definition of [4]. The change allows us to substantiate their claim that plaintext awareness implies chosen ciphertext security and non-malleability, by giving us that PA (plus IND-CPA) implies the RO versions of IND-CCA2 and NM-CCA2. Our refinement is to endow the adversary with an encryption oracle, the queries to which are not given to the extractor. See Section 4.

30


1.5 Motivation In recent years there has been an increasing role played by public key encryption schemes which meet notions of security beyond IND-CPA. We are realizing that one of their most important uses is as tools for designing higher level protocols. For example, encryption schemes meeting IND-CCA2 appear to be the right tools in the design of authenticated key exchange protocols in the public-key setting [1]. As another example, the designers of SET (Secure Electronic Transactions) selected an encryption scheme which achieves more than IND-CPA [25]. This was necessary, insofar as the SET protocols would be wrong if instantiated by a primitive which achieves only IND-CPA security. Because encryption schemes which achieve more than IND-CPA make for easier-to-use (or harder-to-misuse) tools, emerging standards rightly favor them. We comment that if one takes the CCA models “too literally” the attacks we describe seem rather artificial. Take adaptive CCA, for example. How could an adversary have access to a decryption oracle, yet be forbidden to use it on the one point she really cares about? Either she has the oracle and can use it as she likes, or she does not have it at all. Yet, in fact, just such a setting effectively arises when encryption is used in session key exchange protocols. In general, one should not view the definitional scenarios we consider too literally, but rather understand that these are the right notions for schemes to meet when these schemes are to become generally-useful tools in the design of high level protocols. 1.6 Related Work and Discussion The most recent version of the work of Dolev, Dwork and Naor (the manuscript [13]) has, independently of our work, considered the question of relations between notions of encryption, and contains (currently in Remark 3.6) various claims that overlap to some extent with ours. (Public versions of their work, namely the 1991 proceedings version [11] and the 1995 technical report [12], do not contain these claims.) It is not the purpose of this paper to discuss specific schemes designed for meeting any of the notions of security described in this paper. Nonetheless, as a snapshot of the state of the art, we attempt to summarize what is known about meeting “beyond IND-CPA” notions of security. Schemes proven secure under standard assumptions include that of [23], which meets IND-CCA1, that of [11], which meets IND-CCA2, and the much more efficient recent scheme of Cramer and Shoup [8], which also meets IND-CCA2. Next are the schemes proven secure in a random oracle model; here we have those of [3,4], which meet PA and are as efficient as schemes in current standards. Then there are schemes without proofs, such as those of [9,26]. Finally, there are schemes for non-standard models, like [15,24]. It follows from our results that the above mentioned scheme of [8], shown to meet IND-CCA2, also meets NM-CCA2, and in particular is non-malleable under all three forms of attack. Bleichenbacher [6] has recently shown that a popular encryption scheme, RSA PKCS #1, does not achieve IND-CCA1.


31

We comment that non-malleability is a general notion that applies to primitives other than encryption [11]. Our discussion is limited to its use in asymmetric encryption. Similarly, chosen ciphertext attack applies to both the symmetric and asymmetric settings, but this work is specific to the latter. Due to space limitations, we have omitted various parts of this paper. A full version of the paper is available [2].

2

Definitions of Security

This section provides formal definitions for the six notions of security of an asymmetric (ie., public key) encryption scheme discussed in Section 1.1. Plaintext awareness will be described in Section 4. We begin by describing the syntax of an encryption scheme, divorcing syntax from the notions of security. Experiments. We use standard notations and conventions for writing probabilistic algorithms and experiments. If A is a probabilistic algorithm, then A(x1 , x2 , . . . ; r) is the result of running A on inputs x1 , x2 , . . . and coins r. We let y ← A(x1 , x2 , . . .) denote the experiment of picking r at random and letting y be A(x1 , x2 , . . . ; r). If S is a finite set then x ← S is the operation of picking an element uniformly from S. If α is neither an algorithm nor a set then x ← α is a simple assignment statement. We say that y can be output by A(x1 , x2 , . . .) if there is some r such that A(x1 , x2 , . . . ; r) = y. Syntax and conventions. The syntax of an encryption scheme specifies what kinds of algorithms make it up. Formally, an asymmetric encryption scheme is given by a triple of algorithms, Π = (K, E, D), where • K, the key generation algorithm, is a probabilistic algorithm that takes a security parameter k ∈ N (provided in unary) and returns a pair (pk, sk) of matching public and secret keys. • E, the encryption algorithm, is a probabilistic algorithm that takes a public key pk and a message x ∈ {0, 1}∗ to produce a ciphertext y. • D, the decryption algorithm, is a deterministic algorithm which takes a secret key sk and ciphertext y to produce either a message x ∈ {0, 1}∗ or a special symbol ⊥ to indicate that the ciphertext was invalid. We require that for all (pk, sk) which can be output by K(1k ), for all x ∈ {0, 1}∗, and for all y that can be output by Epk (x), we have that Dsk (y) = x. We also require that K, E and D can be computed in polynomial time. As the notation indicates, the keys are indicated as subscripts to the algorithms. Recall that a function : N → R is negligible if for every constant c ≥ 0 there exists an integer kc such that (k) ≤ k −c for all k ≥ kc . 2.1

Framework

The formalizations that follow have a common framework that it may help to see at a high level first. In formalizing both indistinguishability and non-malleability we regard an adversary A as a pair of probabilistic algorithms, A = (A1 , A2 ). (We will say that A is polynomial time if both A1 and A2 are.) This corresponds

32


to A running in two “stages.” The exact purpose of each stage depends on the particular adversarial goal, but for both goals the basic idea is that in the first stage the adversary, given the public key, seeks and outputs some “test instance,” and in the second stage the adversary is issued a challenge ciphertext y generated as a probabilistic function of the test instance, in a manner depending on the goal. (In addition A1 can output some state information s that will be passed to A2 .) Adversary A is successful if she passes the challenge, with what “passes” means again depending on the goal. We consider three types of attacks under this setup. In a chosen-plaintext attack (CPA) the adversary can encrypt plaintexts of her choosing. Of course a CPA is unavoidable in the public-key setting: knowing the public key, an adversary can, on her own, compute a ciphertext for any plaintext she desires. So in formalizing definitions of security under CPA we “do nothing” beyond giving the adversary access to the public key; that’s already enough to make a CPA implicit. In a non-adaptive chosen ciphertext attack (CCA1) we give A1 (the public key and) access to a decryption oracle, but we do not allow A2 access to a decryption oracle. This is sometimes called a non-adaptive chosen ciphertext attack, in that the decryption oracle is used to generate the test instance, but taken away before the challenge appears. In an adaptive chosen ciphertext attack (CCA2) we continue to give A1 (the public key and) access to a decryption oracle, but also give A2 access to the same decryption oracle, with the only restriction that she cannot query the oracle on the challenge ciphertext y. This is an extremely strong attack model. As a mnemonic, the number i in CCAi can be regarded as the number of adversarial stages during which she has access to a decryption oracle. Additionally, the bigger number corresponds to the stronger (and chronologically later) formalization. By the way: we do not bother to explicitly give A2 the public key, because A1 has the option of including it in s. 2.2

Indistinguishability of Encryptions

The classical goal of secure encryption is to preserve the privacy of messages: an adversary should not be able to learn from a ciphertext information about its plaintext beyond the length of that plaintext. We define a version of this notion, indistinguishability of encryptions (IND), following [17,21], through a simple experiment. Algorithm A1 is run on input the public key, pk. At the end of A1 ’s execution she outputs a triple (x0 , x1 , s), the first two components being messages which we insist be of the same length, and the last being state information (possibly including pk) which she wants to preserve. A random one of x0 and x1 is now selected, say xb . A “challenge” y is determined by encrypting xb under pk. It is A2 ’s job to try to determine if y was selected as the encryption of x0 or x1 , namely to determine the bit b. To make this determination A2 is given the saved state s and the challenge ciphertext y. For concision and clarity we simultaneously define indistinguishability with respect to CPA, CCA1, and CCA2. The only difference lies in whether or not


33

A1 and A2 are given decryption oracles. We let the string atk be instantiated by any of the formal symbols cpa, cca1, cca2, while ATK is then the corresponding formal symbol from CPA, CCA1, CCA2. When we say Oi = ε, where i ∈ {1, 2}, we mean Oi is the function which, on any input, returns the empty string, ε. Definition 1. [IND-CPA, IND-CCA1, IND-CCA2] Let Π = (K, E, D) be an encryption scheme and let A = (A1 , A2 ) be an adversary. For atk ∈ {cpa, cca1, def cca2} and k ∈ N let Advind-atk (k) = A,Π

h 1 2 · Pr (pk, sk) ← K(1k ) ; (x0 , x1 , s) ← AO 1 (pk) ; b←{0, 1} ; y ← Epk (xb ) : i 2 AO 2 (x0 , x1 , s, y) = b − 1 where and O2 (·) = ε If atk = cpa then O1 (·) = ε If atk = cca1 then O1 (·) = Dsk (·) and O2 (·) = ε If atk = cca2 then O1 (·) = Dsk (·) and O2 (·) = Dsk (·) We insist, above, that A1 outputs x0 , x1 with |x0 | = |x1 |. In the case of CCA2, we further insist that A2 does not ask its oracle to decrypt y. We say that Π is secure in the sense of IND-ATK if A being polynomial-time implies that ind-atk (·) is negligible. 2 AdvA,Π 2.3

Non-Malleability

Notation. We will need to discuss vectors of plaintexts or ciphertexts. A vector is denoted in boldface, as in x. We denote by |x| the number of components in x, and by x[i] the i-th component, so that x = (x[1], . . . , x[|x|]). We extend the set membership notation to vectors, writing x ∈ x or x 6∈ x to mean, respectively, that x is in or is not in the set {x[i] : 1 ≤ i ≤ |x|}. It will be convenient to extend the decryption notation to vectors with the understanding that operations are performed componentwise. Thus x ← Dsk (y) is shorthand for the following: for 1 ≤ i ≤ |y| do x[i] ← Dsk (y[i]). We will consider relations of arity t where t will be polynomial in the security parameter k. Rather than writing R(x1 , . . . , xt ) we write R(x, x), meaning the first argument is special and the rest are bunched into a vector x with |x| = t−1. Idea. The notion of non-malleability was introduced in [11], with refinements in [12,13]. The goal of the adversary, given a ciphertext y, is not (as with indistinguishability) to learn something about its plaintext x, but only to output a vector y of ciphertexts whose decryption x is “meaningfully related” to x, meaning that R(x, x) holds for some relation R. The question is how exactly one measures the advantage of the adversary. This turns out to need care. One possible formalization is that of [11,12,13], which is based on the idea of simulation; it asks that for every adversary there exists a certain type of “simulator” that does just as well as the adversary but without being given y. Here, we introduce a novel formalization which seems to us to be simpler. Our formalization does not

34


ask for a simulator, but just considers an experiment involving the adversary. It turns out that our notion implies DDN’s, but the converse is not known. See Appendix A for a brief comparison. Our formalization. Let A = (A1 , A2 ) be an adversary. In the first stage of the adversary’s attack, A1 , given the public key pk, outputs a description of a message space, described by a sampling algorithm M . The message space must be valid, which means that it gives non-zero probability only to strings of some one particular length. In the second stage of the adversary’s attack, A2 receives an encryption y of a random message, say x, drawn from M . The adversary then outputs a (description of a) relation R and a vector y (no component of which is y). She hopes that R(x, x) holds, where x ← Dsk (y). An adversary (A1 , A2 ) is successful if she can do this with a probability significantly more than that with which R(˜ x, x) holds for some random hidden x˜ ← M . Definition 2. [NM-CPA, NM-CCA1, NM-CCA2] Let Π = (K, E, D) be an encryption scheme and let A = (A1 , A2 ) be an adversary. For atk ∈ {cpa, cca1, cca2} and k ∈ N define def nm-atk nm-atk nm-atk (k) = SuccA,Π (k) − SuccA,Π,$ (k) AdvA,Π nm-atk where SuccA,Π (k) =

def

h 1 Pr (pk, sk) ← K(1k ) ; (M, s) ← AO 1 (pk) ; x ← M ; y ← Epk (x) ;

i 2 (R, y) ← AO (M, s, y) ; x ← D (y) : y ∈ 6 y ∧ ⊥ ∈ 6 x ∧ R(x, x) sk 2

nm-atk (k) = and SuccA,Π,$

def

h 1 ˜ ← M ; y ← Epk (x) ; Pr (pk, sk) ← K(1k ) ; (M, s) ← AO 1 (pk) ; x, x i 2 (R, y) ← AO x, x) 2 (M, s, y) ; x ← Dsk (y) : y 6∈ y ∧ ⊥ 6∈ x ∧ R(˜ where and O2 (·) = ε If atk = cpa then O1 (·) = ε If atk = cca1 then O1 (·) = Dsk (·) and O2 (·) = ε If atk = cca2 then O1 (·) = Dsk (·) and O2 (·) = Dsk (·) We insist, above, that M is valid: |x| = |x0 | for any x, x0 that are given non-zero probability in the message space M . We say that Π is secure in the sense of NM-ATK if for every polynomial p(k): if A runs in time p(k), outputs a (valid) message space M samplable in time p(k), and outputs a relation R computable nm-atk (·) is negligible. 2 in time p(k), then AdvA,Π The condition that y 6∈ y is made in order to not give the adversary credit for the trivial and unavoidable action of copying the challenge ciphertext. Otherwise, she could output the equality relation R, where R(a, b) holds iff a = b, and output


35

y = (y), and be successful with probability one. We also declare the adversary unsuccessful when some ciphertext y[i] does not have a valid decryption (that is, ⊥ ∈ x), because in this case, the receiver is simply going to reject the adversary’s message anyway. The requirement that M is valid is important; it stems from the fact that encryption is not intended to conceal the length of the plaintext. One might want to strengthen the notion to require that the adversary’s advantage remains small even in the presence a priori information about the message x; such incorporation of message “history” was made in Goldreich’s formalizations of semantic security [14] and the definition of non-malleability in [12,13]. For simplicity we have omitted histories, but note that the above definition can be easily enhanced to take histories into account, and we explain how in [2].

3

Relating IND and NM

We state more precisely the results summarized in Figure 1 and provide proofs. As mentioned before, we summarize only the main relations (the ones that require proof); all other relations follow as corollaries. 3.1

Results

The first result, that non-malleability implies indistinguishability under any type of attack, was of course established by [11] in the context of their definition of non-malleability, but since we have a new definition of non-malleability, we need to re-establish it. The (simple) proof of the following is in [2]. Theorem 1. [NM-ATK ⇒ IND-ATK] If encryption scheme Π is secure in the sense of NM-ATK then Π is secure in the sense of IND-ATK for any attack ATK ∈ {CPA, CCA1, CCA2}. Remark 1. Recall that the relation R in Definition 2 was allowed to have any polynomially bounded arity. However, the above theorem holds even under a weaker notion of NM-ATK in which the relation R is restricted to have arity two. The proof of the following is in Section 3.2. Theorem 2. [IND-CCA2 ⇒ NM-CCA2] If encryption scheme Π is secure in the sense of IND-CCA2 then Π is secure in the sense of NM-CCA2. Remark 2. Theorem 2 coupled with Theorem 1 and Remark 1 says that in the case of CCA2 attacks, it suffices to consider binary relations, meaning the notion of NM-CCA2 restricted to binary relations is equivalent to the general one. Now we turn to separations. Adaptive chosen ciphertext security implies nonmalleability according to Theorem 2. In contrast, the following says that nonadaptive chosen ciphertext security does not imply non-malleability. The proof is in Section 3.3.

36


Theorem 3. [IND-CCA16⇒NM-CPA] If there exists an encryption scheme Π which is secure in the sense of IND-CCA1, then there exists an encryption scheme Π 0 which is secure in the sense of IND-CCA1 but which is not secure in the sense of NM-CPA. Now one can ask whether non-malleability implies chosen ciphertext security. The following says it does not even imply the non-adaptive form of the latter. (As a corollary, it certainly does not imply the adaptive form.) The proof is in Section 3.4. Theorem 4. [NM-CPA6⇒IND-CCA1] If there exists an encryption scheme Π which is secure in the sense of NM-CPA, then there exists an encryption scheme Π 0 which is secure in the sense of NM-CPA but which is not secure in the sense of IND-CCA1. Now the only relation that does not immediately follow from the above results or by a trivial reduction is that the version of non-malleability allowing CCA1 does not imply the version that allows CCA2. See Section 3.5 for the proof of the following. Theorem 5. [NM-CCA16⇒NM-CCA2] If there exists an encryption scheme Π which is secure in the sense of NM-CCA1, then there exists an encryption scheme Π 0 which is secure in the sense of NM-CCA1 but which is not secure in the sense of NM-CCA2. 3.2

Proof of Theorem 2

We are assuming that encryption scheme Π is secure in the IND-CCA2 sense. We show it is also secure in the NM-CCA2 sense. The intuition is simple: since the adversary has access to the decryption oracle, she can decrypt the ciphertexts she would output, and so the ability to output ciphertexts is not likely to add power. For the proof, let B = (B1 , B2 ) be an NM-CCA2 adversary attacking Π. nm-cca2 (k) is negligible. To this end, we describe an We must show that AdvB,Π IND-CCA2 adversary A = (A1 , A2 ) attacking Π. sk Algorithm AD 1 (pk) (M, s) ← B1Dsk (pk) x0 ← M ; x1 ← M s0 ← (M, s) return (x0 , x1 , s0 )

0 0 sk Algorithm AD 2 (x0 , x1 , s , y) where s = (M, s) Dsk (R, y) ← B2 (M, s, y) ; x ← Dsk (y) if (y 6∈ y ∧ ⊥ 6∈ x ∧ R(x0 , x)) then d ← 0 else d ← {0, 1} return d

Notice A is polynomial time under the assumption that the running time of B, the time to compute R, and the time to sample from M are all bounded by a fixed ind-cca2 (k) = pk (0) − pk (1) polynomial in k. The advantage of A is given by AdvA,Π def

where for b ∈ {0, 1} we let pk (b) = h sk Pr (pk, sk) ← K(1k ) ; (x0 , x1 , s0 ) ← AD 1 (pk) ; y ← Epk (xb ) : i 0 sk AD 2 (x0 , x1 , s , y) = 0 .


37

def

Also for b ∈ {0, 1} we let p0k (b) = h Pr (pk, sk) ← K(1k ) ; (M, s) ← B1Dsk (pk) ; x0 , x1 ← M ; y ← Epk (xb ) ;

i (R, y) ← B2Dsk (M, s, y) ; x ← Dsk (y) : y 6∈ y ∧ ⊥ ∈ / x ∧ R(x0 , x) .

Now observe that A2 may return 0 either when x is R-related to x0 or as a result of the coin flip. Continuing with the advantage then, 1 1 1 ind-cca2 AdvA,Π (k) = pk (0)−pk (1) = ·[1+p0k (0)]− ·[1+p0k (1)] = ·[p0k (0)−p0k (1)] 2 2 2 We now observe that the experiment of B2 being given a ciphertext of x1 and nm-cca2 (k). On the other hand, R-relating x to x0 , is exactly that defining SuccB,Π,$ nm-cca2 in case it is x0 , we are looking at the experiment defining SuccB,Π (k). So Advnm-cca2 (k) = p0 (0) − p0 (1) = 2 · Advind-cca2 (k) . k ind-cca2 AdvA,Π (k)

B,Π

k

A,Π

But we know that is negligible because Π is secure in the sense nm-cca2 of IND-CCA2. It follows that AdvB,Π (k) is negligible, as desired. 3.3

Proof of Theorem 3

Assume there exists some IND-CCA1 secure encryption scheme Π = (K, E, D), since otherwise the theorem is vacuously true. We now modify Π to a new encryption scheme Π 0 = (K0 , E 0 , D0 ) which is also IND-CCA1 secure but not secure in the NM-CPA sense. This will prove the theorem. The new encryption scheme Π 0 = (K0 , E 0 , D0 ) is defined as follows. Here x denotes the bitwise complement of string x, namely the string obtained by flipping each bit of x. 0 0 Algorithm Epk (x) Algorithm K0 (1k ) Algorithm Dsk (y1 ky2 ) k return Dsk (y1 ) (pk, sk) ← K(1 ) y1 ← Epk (x) ; y2 ← Epk (x) return (pk, sk) return y1 ky2

In other words, a ciphertext in the new scheme is a pair y1 k y2 consisting of the encryption of the message and its complement. In decrypting, the second component is ignored. In [2] we establish that Π 0 is not secure in the sense of NM-CPA sense, while it is secure in the sense of IND-CCA1. 3.4

Proof of Theorem 4

Let’s first back up a bit and provide some intuition about why the theorem might be true and how we can prove it. Intuition and first attempts. At first glance, one might think NM-CPA does imply IND-CCA1 (or even IND-CCA2), for the following reason. Suppose an adversary has a decryption oracle, and is asked to tell whether a given ciphertext y is the encryption of x0 or x1 , where x0 , x1 are messages she has chosen earlier. She is not allowed to call the decryption oracle on y. It seems then the only strategy she could have is to modify y to some related y 0 , call the decryption oracle on y 0 , and use the answer to somehow help her determine

38


whether the decryption of y was x0 or x1 . But if the scheme is non-malleable, creating a y 0 meaningfully related to y is not possible, so the scheme must be chosen-ciphertext secure. The reasoning above is fallacious. The flaw is in thinking that to tell whether y is an encryption of x0 or x1 , one must obtain a decryption of a ciphertext y 0 related to the challenge ciphertext y. In fact, what can happen is that there are certain strings whose decryption yields information about the secret key itself, yet the scheme remains non-malleable. The approach to prove the theorem is to modify a NM-CPA scheme Π = (K, E, D) to a new scheme Π 0 = (K0 , E 0 , D0 ) which is also NM-CPA but can be broken under a non-adaptive chosen ciphertext attack. (We can assume a NM-CPA scheme exists since otherwise there is nothing to prove.) A first attempt to implement the above idea (of having the decryption of certain strings carry information about the secret key) is straightforward. Fix some ciphertext u not 0 in the range of E and define Dsk (u) = sk to return the secret key whenever it is given this special ciphertext. In all other aspects, the new scheme is the same as the old one. It is quite easy to see that this scheme falls to a (non-adaptive) chosen ciphertext attack, because the adversary need only make query u of its decryption oracle to recover the entire secret key. The problem is that it is not so easy to tell whether this scheme remains non-malleable. (Actually, we don’t know whether it is or not, but we certainly don’t have a proof that it is.) As this example indicates, it is easy to patch Π so that it can be broken in the sense of IND-CCA1; what we need is that it also be easy to prove that it remains NM-CPA secure. The idea of our construction below is to use a level of indirection: sk is returned by D0 in response to a query v which is itself a random string that can only be obtained by querying D0 at some other known point u. Intuitively, this scheme will be NM-CPA secure since v will remain unknown to the adversary. Our construction. Given a non-malleable encryption scheme Π = (K, E, D) we define a new encryption scheme Π 0 = (K0 , E 0 , D0 ) as follows. Here b is a bit. 0 0 Algorithm Epk Algorithm K0 (1k ) k u (x) Algorithm Dsk k u k v (b k y) k (pk, sk) ← K(1 ) y ← Epk (x) if b = 0 then return Dsk (y) u, v ← {0, 1}k return 0 k y else if y = u then return v pk 0 ← pk k u else if y = v return sk sk 0 ← sk k u k v else return ⊥ return (pk 0 , sk 0 )

Analysis. The proof of Theorem 4 is completed by establishing that Π 0 is vulnerable to a IND-CCA1 attack but remains NM-CPA secure. The proofs of these claims can be found in [2]. 3.5

Proof of Theorem 5

The approach, as before, is to take a NM-CCA1 secure encryption scheme Π = (K, E, D) and modify it to a new encryption scheme Π 0 = (K0 , E 0 , D0 ) which is also NM-CCA1 secure, but can be broken in the NM-CCA2 sense.


39

Intuition. Notice that the construction of Section 3.4 will no longer work, because the scheme constructed there, not being secure in the sense of IND-CCA1, will certainly not be secure in the sense of NM-CCA1, for the same reason: the adversary can obtain the decryption key in the first stage using a couple of decryption queries. Our task this time is more complex. We want queries made in the second stage, after the challenge is received, to be important, meaning they can be used to break the scheme, yet, somehow, queries made in the first stage cannot be used to break the scheme. This means we can no longer rely on a simplistic approach of revealing the secret key in response to certain queries. Instead, the “breaking” queries in the second stage must be a function of the challenge ciphertext, and cannot be made in advance of seeing this ciphertext. We implement this idea by a “tagging” mechanism. The decryption function is capable of tagging a ciphertext so as to be able to “recognize” it in a subsequent query, and reveal in that stage information related specifically to the ciphertext, but not directly to the secret key. The tagging is implemented via pseudorandom function families. Our construction. Let Π = (K, E, D) be the given NM-CCA1 secure encryption scheme. Fix a family F = { F k : k ≥ 1 } of pseudorandom functions as per [18]. (Notice that this is not an extra assumption. We know that the existence of even a IND-CPA secure encryption scheme implies the existence of a one-way function [20] which in turn implies the existence of a family of pseudorandom functions [19,18].) Here each F k = { FK : K ∈ {0, 1}k } is a finite collection in which each key K ∈ {0, 1}k indexes a particular function FK : {0, 1}k → {0, 1}k . We define the new encryption scheme Π 0 = (K0 , E 0 , D0 ) as follows. Recall that ε is the empty string. Algorithm K0 (1k ) (pk, sk) ← K(1k ) K ← {0, 1}k sk 0 ← sk k K return (pk, sk 0 )

0 Algorithm Epk (x) y ← Epk (x) return 0 k y k ε

0 Algorithm Dsk k K (b k y k z) where b is a bit if (b = 0) ∧ (z = ε) then return Dsk (y) else if (b = 1) ∧ (z = ε) then return FK (y) else if (b = 1) ∧ (z = FK (y)) return Dsk (y) else return ⊥

Analysis. The proof of Theorem 5 is completed by establishing that Π 0 is vulnerable to a NM-CCA2 attack but remains NM-CCA1 secure. Formal proofs of these two claims can be found in [2]. Let us sketch the intuition here. The first is easy to see. In stage 2, given challenge ciphertext 0ky k ε, the adversary would like to get back Dsk 0 (0ky k ε) = Dsk (y), but is not allowed to query its oracle at 0ky k ε. However, she can query 1kykε to get FK (y) and then query 1kykFK (y) to get back the decryption of y under sk. At that point she can easily win.

40


The key point for the second claim is that to defeat the scheme, the adversary must obtain FK (y) where 0 k y k ε is the challenge. However, to do this she requires the decryption oracle. This is easy for an NM-CCA2 adversary but not for an NM-CCA1 adversary, which has a decryption oracle available only in the first stage, when y is not yet known. Once y is provided (in the second stage) the possibility of computing FK (y) is small because the decryption oracle is no longer available to give it for free, and the pseudorandomness of F makes it hard to compute on one’s own.

4

Results on PA

In this section we define plaintext awareness and prove that it implies the random oracle version of IND-CCA2, but is not implied by it. Throughout this section we shall be working exclusively in the RO model. As such, all notions of security defined earlier refer, in this section, to their RO counterparts. These are obtained in a simple manner. To modify Definitions 1 and 2, begin the specified experiment (the experiment which defines advantage) by choosing a random function H from the set of all functions from strings to infinite strings. Then provide an H-oracle to A1 and A2 , and allow that Epk H H and Dsk may depend on H (which we write as Epk and Dsk ). 4.1 Definition Our definition of PA is from [4], except that we make one important refinement. An adversary B for plaintext awareness is given a public key pk and access H to the random oracle H. We also provide B with an oracle for Epk . (This is our refinement, and its purpose is explained later.) The adversary outputs a ciphertext y. To be plaintext aware the adversary B should necessarily “know” the decryption x of its output y. To formalize this it is demanded there exist some (universal) algorithm K (the “plaintext extractor”) that could have output x just by looking at the public key, B’s H-queries and the answers to them, and the H answers to B’s queries to Epk . (Note the extractor is not given the queries that H B made to Epk , just the answers received.) Let us now summarize the formal definition and then discuss it.H By (hH , C, y) ← run B H,Epk (pk) we mean the following. Run B on input pk H and oracles H and Epk , recording B’s interaction with its oracles. Form into a list hH = ((h1 , H1 ), . . . , (hqH , HqH )) all of B’s H-oracle queries, h1 , . . . , hqH , and the corresponding answers, H1 , . . . , HqH . Form into a list C = (y1 , . . . , yqE ) H the answers (ciphertexts) received as a result of Epk -queries. (The messages that formed the actual queries are not recorded.) Finally, record B’s output, y. Definition 3. [Plaintext Awareness – PA] Let Π = (K, E, D) be an encryption scheme, let B be an adversary, and let K be an algorithm (the “knowledge def

extractor”). For any k ∈ N let Succpa K,B,Π (k) = h Pr H ← Hash ; (pk, sk) ← K(1k ) ;

i H H (hH , C, y) ← run B H,Epk (pk) : K(hH , C, y, pk) = Dsk (y) .


41

We insist that y 6∈ C; that is, B never outputs a string y which coincides with the H value returned from some Epk -query. We say that K is a λ(k)-extractor if K has running time polynomial in the length of its inputs and for every adversary B, Succpa K,B,Π (k) ≥ λ(k). We say that Π is secure in the sense of PA if Π is secure in the sense of IND-CPA and there exists a λ(k)-extractor K where 1 − λ(k) is negligible. 2 Let us now discuss this notion with particular attention to our refinement, which, as we said, consists of providing the adversary with an encryption oracle. At first glance this may seem redundant: since B already has the public key, can’t B encrypt without making use of the encryption oracle? Absolutely. But in the RO model encrypting points oneself may involve making H-queries (remember that the encryption function now depends on H), meaning that B will necessarily know any RO queries used to produce the ciphertext. (Formally, H they become part of the transcript run B H,Epk .) This does not accurately model the real world, where B may have access to ciphertexts via eavesdropping, in which case B does not know the underlying RO queries. By giving B an encryption oracle whose H-queries are not made a part of B’s transcript we get a stronger definition. Intuitively, should you learn a ciphertext y1 for which you do not know the plaintext, still you should be unable to produce a ciphertext H (other than y1 ) whose plaintext you do not know. Thus the Epk oracle models the possibility that B may obtain ciphertexts in ways other than encrypting them herself. We comment that plaintext awareness, as we have defined it, is only achievable in the random oracle model. (It is easy to see that if there is a scheme not using the random oracle for which an extractor as above exists then the extractor is essentially a decryption box. This can be formalized to a statement that an IND-CPA scheme cannot be plaintext aware in the above sense without using the random oracle.) It remains an interesting open question to find an analogous but achievable formulation of plaintext awareness for the standard model. One might imagine that plaintext awareness coincides with semantic security coupled with a (non-interactive) zero-knowledge proof of knowledge [10] of the plaintext. But this is not valid. The reason is the way the extractor operates in the notion and scheme of [10]: the common random string (even if viewed as part of the public key) is under the extractor’s control. In the PA notion, pk is an input to the extractor and it cannot play with any of it. Indeed, note that if one could indeed achieve PA via a standard proof of knowledge, then it would be achievable in the standard (as opposed to random oracle) model, and we just observed above that this is not possible with the current definition. 4.2

Results

The proof of the following is in Section 4.3. Theorem 6. [PA ⇒ IND-CCA2] If encryption scheme Π is secure in the sense of PA then it is secure in the RO sense of IND-CCA2.

42


Corollary 1. [PA ⇒ NM-CCA2] If encryption scheme Π is secure in the sense of PA then Π is secure in the RO sense of NM-CCA2. Proof. Follows from Theorems 6 and the RO-version of Theorem 2. The above results say that PA ⇒ IND-CCA2 ⇒ NM-CCA2. In the other direction, we have the following, whose proof is in [2]. Theorem 7. [IND-CCA26⇒PA] If there exists an encryption scheme Π which is secure in the RO sense of IND-CCA2, then there exists an encryption scheme Π 0 which is secure in the RO sense of IND-CCA2 but which is not secure in the sense of PA. 4.3

Proof of Theorem 6

Intuition. The basic idea for proving chosen ciphertext security in the presence of some kind of proof of knowledge goes back to [15,16,7,10]. Let us begin by recalling it. Assume there is some adversary A = (A1 , A2 ) that breaks Π in the IND-CCA2 sense. We construct an adversary A0 = (A01 , A02 ) that breaks Π in the IND-CPA sense. The idea is that A0 will run A and use the extractor to simulate the decryption oracle. At first glance it may seem that the same can be done here, making this proof rather obvious. That is not quite true. Although we can follow the same paradigm, there are some important new issues that arise and must be dealt with. Let us discuss them. The first is that the extractor cannot just run on any old ciphertext. (Indeed, if it could, it would be able to decrypt, and we know that it cannot.) The extractor can only be run on transcripts that originate from adversaries B in the form of Definition 3. Thus to reason about the effectiveness of A0 we must present adversaries who output as ciphertext the same strings that A0 would ask of its decryption oracle. This is easy enough for the first ciphertext output by A, but not after that, because we did not allow our Bs to have decryption oracles. The strategy will be to define a sequence of adversaries B1 , . . . , Bq so that Bi uses the knowledge extractor K for answering the first i − 1 decryption queries, and then Bi outputs what would have been its i-th decryption query. In fact this adversary A0 might not succeed as often as A, but we will show that the loss in advantage is still tolerable. Yet, that is not the main problem. The more subtle issue is how the encryption oracle given to the adversary comes into the picture. Adversary Bi will have to call its encryption oracle to “simulate” production of the challenge ciphertext received by A2 . It cannot create this ciphertext on its own, because to do so would incorrectly augment its transcript by the ensuing H-query. Thus, in fact, only one call to the encryption oracle will be required — yet this call is crucial. Construction. For contradiction we begin with an IND-CCA2 adversary A = ind-cca2 (k) against Π. In addition, (A1 , A2 ) with a non-negligible advantage, AdvA,Π we know there exists a plaintext extractor, K, with high probability of success, Succpa K,B,Π (k), for any adversary B. Using A and K we construct an IND-CPA ind-cpa (k) against adversary A0 = (A01 , A02 ) with a non-negligible advantage, AdvA 0 ,Π

Relations Among Notions of Security for Public-Key Encryption Schemes Algorithm A01 (pk; R) hH ← () Take R1 from R Run A1 (pk; R1 ), wherein When A1 makes a query, h, to H: A01 asks its H-oracle h, obtaining H(h) Put (h, H(h)) at end of hH Answer A1 with H(h) H : When A1 makes its jth query, y, to Dsk x ← K(hH, ε, y, pk) Answer A1 with x Finally A1 halts, outputting (x0 , x1 , s) return (x0 , x1 , (s, hH, pk))

43

Algorithm A02 (x0 , x1 , (s, hH, pk), y; R) Take R2 from R Run A2 (x0 , x1 , s, y; R2 ), wherein When A2 makes a query, h, to H: A02 asks its H-oracle h, obtaining H(h) Put (h, H(h)) at end of hH Answer A2 with H(h) When A2 makes its jth query, y 0 , H : to Dsk x ← K(hH, (y), y 0 , pk) Answer A2 with x Finally A2 halts, outputting bit, d return d

Fig. 2. Construction of IND-CPA adversary A0 = (A01 , A02 ) based on given IND-CCA2 adversary A = (A1 , A2 ) and plaintext extractor K.

Π. Think of A0 as the adversary A with access only to a simulated decryption oracle rather than the real thing. Let () denote the empty list. Recall that if C(·, ·, · · ·) is any probabilistic algorithm then C(x, y, · · · ; R) means we run it with coin tosses fixed to R. The adversary A0 is defined in Figure 2. Analysis. To reason about the behavior of A0 we define a sequence of adversaries B1 , . . . , Bq , where q is the number of decryption queries made by A. Using the existence of B1 , B2 , . . . we can lower bound the probability of the correctness of K’s answers in A01 . The analysis can be found in [2].

Acknowledgments

Following an oral presentation of an earlier version of this paper, Moni Naor suggested that we present notions of security in a manner that treats the goal and the attack model orthogonally [22]. We are indebted to him for this suggestion. We thank Hugo Krawczyk, Moti Yung, and the (other) members of the CRYPTO ’98 program committee for excellent and extensive comments. Finally we thank Oded Goldreich for many discussions on these topics. The first author was supported by a 1996 Packard Foundation Fellowship in Science and Engineering, and by NSF CAREER Award CCR-9624439. The second author was supported in part by the above mentioned grants of the first author. The fourth author was supported by NSF CAREER Award CCR9624560 and a MICRO grant from RSA Data Security, Inc..

44


References 1. M. Bellare, R. Canetti and H. Krawczyk, A modular approach to the design and analysis of authentication and key exchange protocols. Proceedings of the 30th Annual Symposium on Theory of Computing, ACM, 1998. 30 2. M. Bellare, A. Desai, D. Pointcheval, and P. Rogaway, Relations among notions of security for public-key encryption schemes. Full version of this paper, available via http://www-cse.ucsd.edu/users/mihir/ 31, 35, 35, 37, 38, 39, 42, 43, 45, 46 3. M. Bellare and P. Rogaway, Random oracles are practical: a paradigm for designing efficient protocols. First ACM Conference on Computer and Communications Security, ACM, 1993. 29, 30 4. M. Bellare and P. Rogaway, Optimal asymmetric encryption – How to encrypt with RSA. Advances in Cryptology – Eurocrypt 94 Proceedings, Lecture Notes in Computer Science Vol. 950, A. De Santis ed., Springer-Verlag, 1994. 29, 29, 30, 40 5. M. Bellare and A. Sahai, private communication, May 1998. 29, 46, 46 6. D. Bleichenbacher, A chosen ciphertext attack against protocols based on the RSA encryption standard PKCS #1, Advances in Cryptology — CRYPTO ’98 Proceedings, Lecture Notes in Computer Science, H. Krawczyk, ed., SpringerVerlag 1998. 30 7. M. Blum, P. Feldman and S. Micali, Non-interactive zero-knowledge and its applications. Proceedings of the 20th Annual Symposium on Theory of Computing, ACM, 1988. 42 8. R. Cramer and V. Shoup, A practical public key cryptosystem provably secure against adaptive chosen ciphertext attack. Advances in Cryptology — CRYPTO ’98 Proceedings, Lecture Notes in Computer Science, H. Krawczyk, ed., Springer-Verlag 1998. 30, 30 9. I. Damg˚ ard, Towards practical public key cryptosystems secure against chosen ciphertext attacks. Advances in Cryptology – Crypto 91 Proceedings, Lecture Notes in Computer Science Vol. 576, J. Feigenbaum ed., Springer-Verlag, 1991. 30 10. A. De Santis and G. Persiano, Zero-knowledge proofs of knowledge without interaction. Proceedings of the 33rd Symposium on Foundations of Computer Science, IEEE, 1992. 41, 41, 42 11. D. Dolev, C. Dwork, and M. Naor, Non-malleable cryptography. Proceedings of the 23rd Annual Symposium on Theory of Computing, ACM, 1991. 27, 27, 29, 30, 30, 31, 33, 33, 35, 45 12. D. Dolev, C. Dwork, and M. Naor, Non-malleable cryptography. Technical Report CS95-27, Weizmann Institute of Science, 1995. 27, 29, 29, 30, 33, 33, 35, 45 13. D. Dolev, C. Dwork, and M. Naor, Non-malleable cryptography. Manuscript, 1998. 27, 29, 29, 30, 33, 33, 35, 45, 46 14. O. Goldreich, A uniform complexity treatment of encryption and zeroknowledge. Journal of Cryptology, Vol. 6, 1993, pp. 21-53. 35 15. Z. Galil, S. Haber and M. Yung, Symmetric public key encryption. Advances in Cryptology – Crypto 85 Proceedings, Lecture Notes in Computer Science Vol. 218, H. Williams ed., Springer-Verlag, 1985. 30, 42 16. Z. Galil, S. Haber and M. Yung, Security against replay chosen ciphertext attack. Distributed Computing and Cryptography , DIMACS Series in Discrete Mathematics and Theoretical Computer Science, Vol. 2, ACM, 1991. 42


45

17. S. Goldwasser and S. Micali, Probabilistic encryption. Journal of Computer and System Sciences, 28:270–299, 1984. 27, 27, 29, 32 18. O. Goldreich, S. Goldwasser and S. Micali, How to construct random functions. Journal of the ACM, Vol. 33, No. 4, 1986, pp. 210–217. 39, 39 19. J. H˚ astad, R. Impagliazzo, L. Levin and M. Luby, Construction of a pseudorandom generator from any one-way function. Manuscript. Earlier versions in STOC 89 and STOC 90. 39 20. R. Impagliazzo and M. Luby, One-way functions are essential for complexity based cryptography. Proceedings of the 30th Symposium on Foundations of Computer Science, IEEE, 1989. 39 21. S. Micali, C. Rackoff and R. Sloan, The notion of security for probabilistic cryptosystems. SIAM J. of Computing, April 1988. 32 22. M. Naor, private communication, March 1998. 26, 43 23. M. Naor and M. Yung, Public-key cryptosystems provably secure against chosen ciphertext attacks. Proceedings of the 22nd Annual Symposium on Theory of Computing, ACM, 1990. 27, 27, 30 24. C. Rackoff and D. Simon, Non-interactive zero-knowledge proof of knowledge and chosen ciphertext attack. Advances in Cryptology – Crypto 91 Proceedings, Lecture Notes in Computer Science Vol. 576, J. Feigenbaum ed., Springer-Verlag, 1991. 27, 27, 30 25. SETCo (Secure Electronic Transaction LLC), The SET standard book 3 formal protocol definitions (version 1.0). May 31, 1997. Available from http://www.setco.org/ 30 26. Y. Zheng and J. Seberry, Immunizing public key cryptosystems against chosen ciphertext attack. IEEE Journal on Selected Areas in Communications, vol. 11, no. 5, 715–724 (1993). 30

A

Comparing our Notion of NM with Simulation NM

Let SNM refer to the original, simulation-based definition of nonmalleability [11,12,13]. Its three forms are denoted SNM-CPA, SNM-CCA1, and SNM-CCA2. (In the full version of this paper [2] we recall DDN’s definition. A key feature one must note here however is that the simulator is not allowed access to a decryption oracle, even in the CCA cases. We note that we are here discussing the version of SNM without “history”; we will comment on histories later.) The question we address here is how NM-ATK compares to SNM-ATK for each ATK ∈ {CPA, CCA1, CCA2}. It is easy to see that NM-CPA ⇒ SNM-CPA. Intuitively, our definition can be viewed as requiring, for every adversary A, a specific type of simulator, which we can call a “canonical simulator,” A0 = (A01 , A02 ). The first stage, A01 , is identical to A1 . The second simulator stage A2 simply chooses a random message from the message space M that was output by A01 , and runs the adversary’s second stage A2 on an encryption of that message. Since A does not have a decryption oracle, A0 can indeed do this. If we continue to think in terms of the canonical simulator in the CCA cases, the difficulty is that this “simulator” would, in running A, now need access to a decryption oracle, which is not allowed under SNM. Thus it might appear that our definition is actually weaker, corresponding to the ability to simulate by

46


simulators which are also given the decryption oracle. However, this appearance is false; in fact, NM-ATK implies SNM-ATK for all three types of attacks ATK, including CCA1 and CCA2. This was observed by Bellare and Sahai [5]. A proof of the following can be found in [2]. Theorem 8. [5] [NM-ATK ⇒ SNM-ATK] If encryption scheme Π is secure in the sense of NM-ATK then Π is secure in the sense of SNM-ATK for any attack ATK ∈ {CPA, CCA1, CCA2}. Are the definitions equivalent? For this we must consider whether SNM-ATK ⇒ NM-ATK. This is true for ATK = CCA2 (and thus the definitions are equivalent in this case) because [13] asserts that SNM-CCA2 implies IND-CCA2 and Theorem 2 asserts IND-CCA2 implies NM-CCA2. For ATK ∈ {CPA, CCA1} the question remains open. Finally, on the subject of histories, we remark that all that we have discussed here is also true if we consider the history-inclusive versions of both definitions.

Cryptography and the Internet Steven M. Bellovin AT&T Labs–Research, Florham Park, NJ, USA [email protected] http://www.research.att.com/~smb

Abstract. After many years, cryptography is coming to the Internet. Some protocols are in common use; more are being developed and deployed. The major issue has been one of cryptographic engineering: turning academic papers into a secure, implementable specification. But there is missing science as well, especially when it comes to efficient implementation techniques.

1

Introduction

In early 1994, CERT announced1 that widespread password monitoring was occuring on the Internet. In 1995, Joncheray published a paper explaining how an eavesdropper could hijack a TCP connection [Jon95]. In mid-1998, there is still very little use of cryptography. Finally, though, there is some reason for optimism. A number of factors have combined to change people’s behavior. First, of course, there is the rise of the Internet as a mass medium, and along with it the rise of Internet commerce. Consider the following quote from a popular Web site: How does ——.com protect my credit card if I order online? ——.com takes every precaution to protect the privacy of your credit card information. We utilize Secure Socket Layers (SSL), the most advanced security system available. All of your ordering information – including your name, address, and credit card number – is encrypted using a Secure Server for maximum security. Your credit card and billing information cannot be read as it travels to our ordering system. In addition, our ordering system is not connected to the Internet and is not accessible in any way. You can also use our Standard Server, pay by phone option, or fax us your order. There are several noteworthy things here. First, of course, they advertise their use of encryption. Second, as evidenced by the phone payment option—a relatively common choice—there is still persistent public uneasiness about Internet security. Cryptography, of course, is part of the solution; thus, companies that wish to attract business are touting their use of encryption. 1

CERT Advisory CA-94:01, 3 February 1994.

H. Krawczyk (Ed.): CRYPTO’98, LNCS 1462, pp. 46–55, 1998. c Springer-Verlag Berlin Heidelberg 1998

Cryptography and the Internet

47

A second major driver for the adoption of cryptography has been the transformation of the Internet into the data network. More and more, corporations are using the Internet for general data transfer, in the same was as they have traditionally used the phone network for voice traffic. A branch office may have its own link to the Internet and communicate with the home office via this channel, instead of using a leased line. Telecommuters can use an ISP’s modem pool to dial in to work. But both of these practices are risky without encryption. Finally, the technology is ready. Computers are fast enough that the overhead for encryption is—barely, in some cases—tolerable. Standards exist for the much important types of encryption. And most—but not all—of the necessary science exists.

2

Current Uses of Cryptography on the Internet

Perhaps the most mature cryptographic technology in use on the Internet is secure email. Two different schemes—PGP and S/MIME—have achieved reasonably broad penetration. Both have been hampered, though, by lack of a widespread public-key infrastructure. While not strictly necessary, especially for use within comparatively small groups, a more broadly-based certificate graph is necessary for some uses. Imagine, for example, trying to solve the “spam” email problem by relying on PGP’s Web of Trust.2 Another notable use of cryptography is SSL, the Secure Socket Layer. While in theory quite general, in practice SSL is used almost exclusively for communication between Web browsers and servers. Furthermore, in almost all cases authentication is at best one-way—servers have certificates; clients rarely do— and in practice is unauthenticated, since most users of the technology neither know nor care what a certificate is, nor who has signed it. For that matter, the popular browsers give very little guidance on whether or not certificates should be accepted, what the meaning of the signing certificate is, etc. We thus have the dual of the situation with secure email: the certificate authorities exist, and are used, but to little practical effect. The third major area for cryptography in the Internet is the networklayer encryption and authentication protocol set, IPSEC.3 There is also a key exchange protocol derived from Diffie et al’s STS [DvW92] and Krawczyk’s SKEME [Kra96]. IPSEC provides broad-spectrum protection of communications, below the application and transport layers. It is thus invisible to them, but protects them nevertheless. 2 3

“Spam” is the common term for bulk, unsolicited commercial email. At this point, the new IPSEC RFCs have not yet been issued, though they are expected momentarily. The existing ones—RFCs 1825-1829—describe an obsolete version of the protocol. While they are useful as a general guide to IPSEC, there are a number of cryptographically significant differences between them and the newer standards.

48

Steven M. Bellovin

IPSEC also provides the ability to trade cost for granularity of protection. A single IPSEC key can protect a user, a host, or an entire network. An organization can use a single gateway, to minimize its costs; conversely, it can add IPSEC to every host, thus guarding against certain attacks within its site. Since IPSEC is just starting to be deployed, it is impossible to assess its usage patterns. Still, in at least two likely deployment patterns—firewall-to-firewall in a virtual private network (VPN) configuration, and remote employees-to-firewall— the certificate will be used for authorization. This suggests that certificates will be meaningful, but that a widespread PKI will not be needed; instead, each corporation will issue its own certificates. If IPSEC is used in other than end-to-end mode, some intermediate points must be trusted. Furthermore, since the topology of the Internet is dynamic, there may not be a fixed set of trusted parties between two endpoints wishing to converse. In many cases, such delegations should be digitally signed by the ultimate endpoint. In other cases, such as a corporate firewall, the delegation is in fact in the reverse order. That is, the administrator for some zone gigacorporation.com could in fact specify the IPSEC proxies for all hosts within that domain. Regardless, the exact set of IPSEC gateways to be used must be discovered anew for each connection. Mention should also be made of SET, a secure electronic payment protocol developed by the banking and credit card industry. It’s especially interesting because it’s a multiparty protocol: the consumer, the merchant, and the bank. It is worth noting the collision here between cryptographic theory and commercial reality: while one might think that a signature-based protocol would eliminate any need to transmit an actual credit card number, that turns out not to be the case; some merchants use credit card numbers as the look-up key for the customer databases, and are unwilling to lose the previous purchase history. Accordingly, the card number may still be sent to the merchant, though of course in encrypted form.

3

Planned Uses and Missing Pieces

There are a number of things we would like to do on the Internet; however, we don’t know how to do them. I will focus on three: efficient cryptographic processing, routing, and multi-party cryptography. The first is, of course, obvious: we need faster algorithms. While Moore’s Law has helped, often the effect of a faster CPU is that system designers demand more of it. A modern tunnel server, for example, may handle hundreds of simultaneous connections. But if these sessions are cryptographically protected, more CPU power is needed. Worse yet, the public key operations to establish these connections are very expensive. If a server handling 500 remote users crashes and reboots, it can be a quite a while before all of the sessions are re-established: the necessary public-key operations are quite expensive. A less obvious place where efficiency improvements are desperately needed is for authentication and integrity algorithms. During the development of IPSEC,


49

we learned that encryption without integrity-checking is all but useless [Bel96]— it turned out to be practical to use the error propagation properties of CBC mode to construct all manner of useful but fraudulent packets. To give just one example, an attacker could splice together the body of a target packet with the header of a packet destined for his or her program. The current choices for such integrity algorithms—IPSEC currently specifies HMAC [BCK96] with either MD5 [Riv92] or SHA-1 [NIS95]—are too slow in many cases, and are not particularly suited for hardware accelerators [Tou96]. Alternatively, an encryption mode or algorithm that prevented any tampering with the ciphertext might suffice. (Rivest’s all-or-nothing encryption mode [Riv97] is too expensive.) There is also a strong need for secure routing protocols. Internet routers exchange reachability and link cost information with their neighbors; from this, each router computes the optimal path to each destination network on the Internet.4 There is no global knowledge of the actual topology. If a router lies about the networks it can reach, its neighbors will be deceived. This in turn can result in traffic being diverted to paths controlled by an enemy. While traffic encryption should prevent eavesdropping, routing attacks represent at least a denial of service attack, and—in the absence of other encryption—more serious threats. It is not obvious how to use cryptography to secure this structure. Protecting the routing exchanges between each pair of routers is straight-forward enough; the problem, however, is that each router knows only what its neighbors have said. They themselves could have been deceived. A solution will involve verifying the full path back to the owner of the network. And that in turn requires calculating and verifying many digital signatures, which is prohibitively expensive. While some work has been done [SK97, SMGLA97, HPT97, MB96], much more remains. Another interesting research area is providing adequate security for multicast sessions. While a number of protocols have been proposed, it is not clear that they are suitable. There are a number of reasons for this; prominent among them is that there is no one model for what multicast is. It may be a television-style broadcast, with authentication of all messages and perhaps encryption so that only subscribers can watch. It may be a conversation between a small number of participants. It may be a combination of a broadcast and a question-and-answer session; while anyone can speak, the session is under control of a central site, which must have the ability to exclude disruptive participants. A constraint on multicast security mechanisms is the trust model. Many proposed protocols assume that the key distribution graph is somehow related to the packet-forwarding graph. For some common uses of multicast technology, this is a bad assumption. Packet-forwarding is often configured by Internet Service Providers; ordinary users can and do create multicast sessions. A compromise 4

The actual routing structure of the Internet is far more complex than is explained here.

50

Steven M. Bellovin

position may be some way to identify some trustable subset of the forwarding graph; discovering this set—and deciding that it is trustable—is not trivial.

4

Trust and Policy Management

Many of the problems discussed earlier can be summed up in one question: who can be trusted to do what? More precisely, how can a user or a computer acting on that user’s behalf know what certificates are acceptable for a given action? Furthermore, out of the set of potentially trustable parties, which are the right ones under some given set of circumstances.? The problem shows up most clearly with IPSEC, where a machine may need to discover the identity of a security gateway for some connection. Even in that case, there can be considerably more complexity. For example, two hosts may wish to use end-to-end encryption. However, both sites are behind firewalls that wish to do their own encryption between them. Furthermore, one host may need to use authentication from it to the outbound firewall, to validate its right to send traffic out of the location. A related issue is the specification of the desired policy. How can an administrator communicate to assorted hosts the identities, both cryptographic and network, of the various gateways that must be involved in a secure connection? More to the point, how is it communicated securely? Who is authorized to set such policies, and how do the endpoints know it? With SSL and secure email, the trust question is made more complex because the answer must relate to the real world. If I request a secure connection to www.wsj.com, my Web browser warns me that the certificate was issued to interactive.wsj.com. Should these two be considered identical? The company name is Dow Jones; is that right? How should I know that, a priori? And domain names are often confusing; nasa.com bears no relation to nasa.gov. Will a user notice the distinction? One can assert that no matter the cryptographic tricks, the user of a certificate is (and should be) responsible for validating its authenticity. Often, though, it is impossible for the user to do so. In particular, a conventional certificate does not indicate what roles the holder can fulfill. The company name in my certificate indicates correctly that I work for a telecommunications company; it does not say whether or not I am authorized to accept payment for phone bills. Possibly, schemes such as PolicyMaker [BFL96, BFS98] or SDSI [RL96] will solve this problem. But enumerating all possible roles for a certificate is easy; enumerating the roles it may not fill is very hard. Furthermore, the distinction may be too subtle for a program. A server certificate valid for, say, accepting orders for books via a Web page may not be the proper certificate for software orders, even from the same Web server. But it may be the proper certificate for sending email to the customer care agent. We must also be wary of techniques that work for humans, but not for programs. A person may be wary enough to note that my certificate contains that word “research”, or that it says nothing about bill collection. But will a program


51

check this? IPSEC may become the first large-scale use of certificates intended for checking by programs, not humans. Are our certificates adequate for the task?

5

Cryptography versus Cryptographic Engineering

Often, designing cryptosystems for use in the Internet is one of cryptographic engineering. Partly, it’s a matter of translating abstract notions into concrete packet formats. That is relatively simple. It’s harder to find a way to fit cryptography into a protocol that wasn’t designed to accept it. But the hardest job is maintaining security in the face of actual network practice. Consider, for example, the question of encrypting a message M . The academic paper on the subject would likely have said something like “Alice transmits {M }K to Bob”. An implementation specification might say “Use CAST-128 in CBC mode, with key K, an IV as specified above, the whole preceded by a twobyte length field. The receiver’s identity is specified in the previous message.” But even that isn’t sufficient. As we all know, ciphers can be broken. An implementable cryptographic protocol must have some way to indicate which cipher is being used. That in turn raises questions of what ciphers must be common to all implementations. Worse yet, the cipher to be used must be negotiated, and negotiated securely; an enemy who can force the use of DES instead of a more secure cipher may be able to do considerable damage. Often, different security considerations produce contradictory constraints. In [Bel96], I showed that it was much more secure to use a separate key for each connection, as opposed to a single key for all connections between a pair of hosts. But in [Bel97], I showed that per-connection keying aided an enemy cryptanalyst. Which is right? Operational considerations produce their own conflicts. The Domain Name System (DNS) relies on caches, timeouts, and hierarchies of servers to reduce the load on the network. The design, originally specified in 1983 [Moc83], requires that the record’s time-to-live be in the original response to a query, that it be decremented by servers that cache the response, and that this modified value be passed along to any machines that receive their response from the caching server. But that conflicted badly with a later desire to add digital signatures to DNS records [EK97]. Not only would recalculating the signature each time be prohibitively expensive, the caching server does not (and should not) possess the necessary signing key. Thus, the modified time-to-live field cannot be passed along in a secure fashion. Perhaps the lifetime should have been expressed as an absolute expiration time (though that has problems of its own). But Secure DNS is constrained to live with the earlier structure. Secure DNS has run into other complications as well. The format of the signed records was designed to permit the signing operation to be done offline, to safeguard the private key. However, this operational requirement conflicts with the DNS Dynamic Update protocol [BRTV97]. It has also resulted in a situation where the mechanism to indicate that a record does not exist—signed

52

Steven M. Bellovin

front- and back-pointers—can be used by an enemy to learn all of the names in a domain, which conflicts with other security requirements. But safeguarding the signing key is critical; not only does it act as a certificate-signing key, fraudulent DNS records can be used to perpetrate a wide variety of attacks on Internet systems [Bel95]. IPSEC often conflicts with firewalls. A firewall cannot examine, and hence pass or reject, an encrypted packet. Should end-to-end encryption be permitted through firewalls? The fact that a packet is encrypted and authenticated does not mean that it is harmless; an attacker may have penetrated an external system that is authorized to make incoming calls. Even outgoing calls can be used to launch attack. Suppose that a firewall is configured to permit all outbound calls. Naturally, the reply packets must be allowed back in. However, if the port numbers are encrypted the firewall cannot distinguish between a reply packet and a packet attacking a different port on the inside host. There are no good solutions for this problem. Presumably, some sort of keysharing with the firewall must take place. Again, that demands strong verification of the firewall’s right to the information. It may also demand multiparty key negotiation, or perhaps proxy cryptography [BBS98].

6

Protocol Verification

It should come as no surprise that the cryptographic protocols and mechanisms used in the Internet are in need of verification. They are complex, and as we all know, it is very easy to make mistakes in designing even simple cryptographic protocols. But the analysis here is harder, because it must contend with the complexities of real systems and real operational environments. Several examples of this can be found in [Bel96]. In one class of attacks, I showed how replays could be used to trick the host operating system into decrypting messages. There were a number of variants of this attack; the simplest involved waiting until the target program had ended, then binding to its “port” and reinjecting the messages. The key remains secure, but the plaintext is revealed. A more subtle flaw is exploited by Wagner’s short-block guessing attack. The attacker attempts to guess at the contents of packets containing a single byte of application data. It requires a modest (and practical) amount of chosen plaintext (28 blocks) and a simple (28 packet) active attack. If the injected packet contains an erroneous guess of the data byte, the receiving machine will silently discard the packet. If the guess is correct, the network-level checksum will also be correct and the receiving machine will acknowledge the packet. (The ACK messages can be seen as a side channel, similar to those exploited by Kocher in his timing and power consumption attacks.) To my knowledge, existing formal techniques cannot detect attacks such as these. At the very least, the formalism would have to include a complete description of the networking environment, and probably more besides.


53

There has already been some useful input from the theory community. IPSEC originally used keyed hash functions as MACs; Preneel and van Oorschot’s attacks on these [Pv95, Pv96] caused us to adopt HMAC [BCK96], an algorithm that was proven to be secure, instead. Unfortunately, the help has not always been appreciated. The resentment has come not because of “interference” but because of its timing. In an environment where the phrase “sometimes it’s time to shoot the engineers and ship the product” can be uttered, a complaint late in the design cycle is often rejected, simply because it’s too late.

7

What Cryptography Can’t Do

Cryptography is not a panacea for the security problems of the Internet. By my count, no more than 15% of the CERT advisories over the last 10 years describe vulnerabilities that would be irrelevant in a world with ubiquitous cryptography. Most of the other advisories concerned buggy programs, a failing that cryptography cannot address. Indeed, there were a number of reports of flaws in assorted encryption and authentication programs. A second problematic area is the existence—dare I say the prevalence?—of bad cryptography. While part of the problem is lack of science—we’re all familiar with new attacks on old algorithms and protocols—more of the trouble is a lack of education. About the time I was writing this note, it was disclosed that a major vendor’s network encryption product inadvertently used DES with a 48bit key size. That was bad enough, though forgiveable and fixable. But the same product used ECB mode, an egregious error described as a deliberate design choice. Other vendors misuse stream ciphers [SM98] or invent their own flimsy algorithms—and then rely on obscurity for protection. Finally, the user interface to cryptographic functions is often lacking. I will give just one example, an encrypting mail program based on a symmetric cryptosystem. To avoid the need for the recipient to have any particular applications software, this program packages up everything into a self-extracting executable that prompts the recipient for the shared secret key. It is adding insult to injury that the keylength employed is a magnanimous 32 bits. . .

References [BBS98]

[BCK96]

[Bel95]

Matt Blaze, G. Bleumer, and Martin Strauss. Divertible protocols and atomic proxy cryptography. In Proceedings of Eurocrypt ’98, 1998. to appear. 52 M. Bellare, R. Canetti, and H. Krawczyk. Keying hash functions for message authentication. In Advances in Cryptology: Proceedings of CRYPTO ’96, pages 1–15. Springer-Verlag, 1996. 49, 53 Steven M. Bellovin. Using the domain name system for system breakins. In Proceedings of the Fifth Usenix Unix Security Symposium, pages 199–208, Salt Lake City, UT, June 1995. 52

54

Steven M. Bellovin

[Bel96]

[Bel97]

[BFL96]

[BFS98]

[BRTV97]

[DvW92]

[EK97]

[HPT97]

[Jon95]

[Kra96]

[MB96]

[Moc83]

[NIS95] [Pv95]

[Pv96] [Riv92] [Riv97]

[RL96]

Steven M. Bellovin. Problem areas for the IP security protocols. In Proceedings of the Sixth Usenix Unix Security Symposium, pages 205– 214, July 1996. 49, 51, 52 Steven M. Bellovin. Probable plaintext cryptanalysis of the IP security protocols. In Proceedings of the Symposium on Network and Distributed System Security, pages 155–160, 1997. 51 Matt Blaze, Joan Feigenbaum, and Jack Lacy. Decentralized trust management. In IEEE Symposium on Security and Privacy, pages 164–173, 1996. 50 Matt Blaze, Joan Feigenbaum, and Martin Strauss. Compliance checking in the PolicyMaker trust management system. In Proceedings of the 2nd Financial Crypto Conference, 1998. to appear. 50 J. Bound, Y. Rekhter, S. Thomson, and P. Vixie. Dynamic updates in the domain name system (DNS UPDATE). Request for Comments (Proposed Standard) 2136, Internet Engineering Task Force, April 1997. (Obsoletes RFC1035). 51 W. Diffie, P.C. van Oorschot, and M.J. Wiener. Authentication and authenticated key exchange. Designs, Codes and Cryptography, page 107, 1992. 47 D. Eastlake and C. Kaufman. Domain name system security extensions. Request for Comments (Proposed Standard) 2065, Internet Engineering Task Force, January 1997. (Obsoletes RFC1034). 51 Ralf Hauser, Tony Przgienda, and Gene Tsudik. Reducing the cost of security in link-state routing. In Proceedings of the Symposium on Network and Distributed System Security, pages 93–99, 1997. 49 Laurent Joncheray. A simple active attack against TCP. In Proceedings of the Fifth Usenix Unix Security Symposium, Salt Lake City, UT, 1995. 46 Hugo Krawczyk. SKEME: A versatile secure key exchange mechanism for internet. In Proceedings of the Internet Society Symposium on Network and Distributed System Security, pages 114–127, February 1996. 47 S.L. Murphy and M.R. Badger. Digital signature protection of the OSPf routing protocol. In Proceedings of the Symposium on Network and Distributed System Security, pages 93–102, 1996. 49 P. Mockapetris. Domain names: Concepts and facilities. RFC 882, Internet Engineering Task Force, November 1983. (Obsoleted by RFC1034); (Updated by RFC973). 51 NIST. Secure hash standard (SHS), April 1995. Federal Information Processing Standards Publication 180-1. 49 B. Preneel and Paul C. van Oorschot. MDx-MAC and building fast MACs from hash functions. In Proceedings of CRYPTO ’95, pages 1–14, 1995. 53 B. Preneel and Paul C. van Oorschot. On the security of two mac algorithms. In Proceedings of Eurocrypt ’96, pages 19–32, 1996. 53 R. Rivest. The MD5 message-digest algorithm. Request for Comments (Informational) 1321, Internet Engineering Task Force, April 1992. 49 Ronald Rivest. All-or-nothing encryption and the package transform. In Proceedings of the Fast Software Encryption Conference, 1997. To appear. 49 Ronald Rivest and Butler Lampson, 1996. Several papers can be found at http://theory.lcs.mit.edu/~cis/sdsi.html. 50

Cryptography and the Internet [SK97]

55

K.E. Sirois and S.T. Kent. Securing the nimrod routing architecture. In Proceedings of the Symposium on Network and Distributed System Security, pages 74–84, 1997. 49 [SM98] Bruce Schneier and P. Mudge. Cryptanalysis of Microsoft’s Point-toPoint Tunneling Protocol (PPTP), November 1998. 5th ACM Conference on Computer and Communications Security, to appear. 53 [SMGLA97] B.R. Smith, S. Murthy, and J.J. Garcia-Luna-Aceves. Securing distancevector routing protocols. In Proceedings of the Symposium on Network and Distributed System Security, pages 85–92, 1997. 49 [Tou96] Joseph D. Touch. Performance analysis of MD5. In Proceedings of ACM SIGCOMM ’95, pages 77–86, 1996. 49

Differential Collisions in SHA-0 Florent Chabaud and Antoine Joux ´ Centre d’Electronique de l’Armement CASSI/SCY/EC F-35998 Rennes Armées, France {chabaud,joux}@celar.fr Abstract. In this paper we present a method for finding collisions in SHA-0 which is related to differential cryptanalysis of block ciphers. Using this method, we obtain a theoretical attack on the compression function SHA-0 with complexity 261 , which is thus better than the birthday paradox attack. In the case of SHA-1, this method is unable to find collisions faster than the birthday paradox. This is a strong evidence that the transition to version 1 indeed raised the level of security of SHA.

1 1.1

Description of SHA Historical Overview

The Secure Hash Standard (SHS) [7] was issued by the National Institute of Standards and Technology in 1993. It was largely inspired from Rivest’s MD4 [5]. However, a certain number of basic blocks of this function were different from MD4 ones, but no explanation was given for the choices. Two years later, an addendum was made to the standard, slightly altering the function [8]. This change was claimed to correct a technical weakness in SHA but no justification was given. Yet, it was reported that a collision attack better than the birthday paradox had been found by the NSA. Independantly, several attacks on the original MD4 function, and its MD5 improvement [6] have been published [2,4]. However, these attacks couldn’t be applied to the Secure Hash Algorithm (neither in the first nor in the second version) because of the expansion used. 1.2

Notation

The symbols we use in this paper are defined Table 1. Besides, we denote by capital letters 32-bits words, and X (i) stand for the value of X used at i-th round of SHA. 1.3

Description of SHA

Description of the Hash Function. The hash functions in the SHA family deal with 512 bits message blocks and output a 160 hash value. This hash value is formed by concatenating 5 registers of 32 bits each. In order to hash a message, several steps are performed: H. Krawczyk (Ed.): CRYPTO’98, LNCS 1462, pp. 56–71, 1998. c Springer-Verlag Berlin Heidelberg 1998

Differential Collisions in SHA-0

57

Table 1. Notations Notation IFq hX, Y, . . . , Zi + ⊕ ∨ ∧ ROL` (X) Xi

Definition Finite field with q elements. Concatenation of 32-bits words. Addition on 32-bits words modulo 232 . Exclusive or on bits or 32-bits words. Inclusive or on bits or 32-bits words. Logical and on bits or 32-bits words. Rotation by ` bits of a 32-bits word. The ith bit of 32-bits word X, from the least significant 0 to the most significant 31.

1. Pad the message to be hashed by adding a 1, aa appropriate number of 0 and the 64 bits integer representing the length of the message. After this padding operation, the message is formed of a integral number of 512 blocks. 2. Initialize 5 registers of 32 bits A, B, C, D and E with fixed constants: – A = 0x67452301 – B = 0xEFCDAB89 – C = 0x98BADCFE – D = 0x10325476 – E = 0xC3D2E1F0 3. For each message block, copy A, B, C, D and E respectively in AA, BB, CC, DD and EE. Apply the compression function to AA, BB, CC, DD, EE and the message block. This yields AA0 , BB 0 , CC 0 , DD0 and EE 0 . These 5 values are then added respectively to A, B, C, D and E. 4. Output the concatenation of A, B, C, D and E. In the remaining of this paper, we try to find collisions on the compression function, from which collision on the hash function are trivial. Description of the Compression Function. Following [7], we denote by

(0) W , . . . , W (15) the 512 bits input of SHA, constituted by 16 words of 32 bits. The first step of SHA-0 is to perform an expansion on these 512 bits. The result of this expansion is given by the following relation: W (i) = W (i−3) ⊕ W (i−8) ⊕ W (i−14) ⊕ W (i−16) , ∀i, 16 ≤ i < 80 .

(1)

These 80 words of 32 bits are used to alter the five 32-bits words state denoted of the compression by A(i) , B (i) , C (i) , D(i) , E (i)

. The initial state is the input function. We now denote it A(0) , B (0) , C (0) , D(0) , E (0) .

The modification of A(i) , B (i) , C (i) , D(i) , E (i) state is performed by the following transformation, where the function f (i) and the constant K (i) are set according to Table 2, and ADD(U, V, W, X, Y ) = U +V +W +X +Y (mod 232 ):

58

Florent Chabaud and Antoine Joux

for i = 0 to 79 A(i+1) = ADD W (i) , ROL5 A(i) , f (i) B (i) , C (i) , D(i) , E (i) , K (i) B (i+1) = A(i) C (i+1) = ROL30 B (i) D(i+1) = C (i) E (i+1) = D(i)

Table 2. SHA definition of function f (i) (X, Y, Z), and constant K (i) . Round i 0–19 20–39 40–59 60–79

Function f (i) Constant K (i) Name Definition ¯ ∧ Z) IF (X ∧ Y ) ∨ (X 0x5A827999 XOR (X ⊕ Y ⊕ Z) 0x6ED9EBA1 MAJ (X ∧ Y ) ∨ (X ∧ Z) ∨ (Y ∧ Z) 0x8F1BBCDC XOR (X ⊕ Y ⊕ Z) 0xCA62C1D6

The output of the compression function is the 160 bits word obtained in the fi

nal state A(80) , B (80) , C (80) , D(80) , E (80) . By collision, we understand the stan

dard meaning of finding two input words W (0) . . . W (15) and W 0(0) . . . W 0(15)

(80) (80) (80) (80) (80) ,C ,D ,E , using the that gives the same 160-bits output A , B

same initial value A(0) , B (0) , C (0) , D(0) , E (0) . The basic architecture of SHA can be illustrated by Fig. 1. The expansion box 512 2560 to (IF2 ) , that maps can be considered as a linear application from (IF2 )

(0) W . . . W (15) to W (0) . . . W (79) . This linear mapping is the only difference between first and second version of SHA. More precisely, the extension of SHA-1 is obtained by replacing (1) by the following equation, which differs from (1) by the one bit rotation to the left: W (i) = ROL1 W (i−3) ⊕ W (i−8) ⊕ W (i−14) ⊕ W (i−16) , ∀i, 16 ≤ i < 80 . (2) We will denote E0 the initial expansion described by (1), and E1 the modified expansion described by (2). This generic architecture defines a family of hash functions that could be derived by changing the expansion box.

2 2.1

Propagation of Local Perturbations in SHA-Like Hash Functions Weakened SHA Variations

The Bare Architecture of SHA. We first want to study the propagation of local perturbations in a fully linear variation of SHA, in order to discriminate between


input (512 bits) expansion E 2560 bits · · · 80 blocks of 32 bits · · · ith block at ith iteration

A

ROL5

ADD

B ROL30 C

f

K

D E

Fig. 1. SHA architecture

59

60


the roles of the bare architecture of the hash functions on one side and of the elementary building blocks on the other side. Within the compression function of a hash function in the SHA family, there are two sources of non-linearity, the f (i) functions and the addition function ADD. Thus, the first hash function we consider is SHI11 the compression function in the SHA family built by starting from SHA-0 (thus using expansion E0 ) and by replacing the ADD function by an exclusive-or on 5 variables, and all the f (i) by XOR functions. We denote as usual by W (i) the ith word of the expansion (0 ≤ i < 80), and (i) (i) the 32 bits of this word are numbered W0 , . . . , W31 . We now relax the constraints on the W vector and temporary forget that it results from an expansion process. Thus, we can apply any local perturbation (i) on any bit of W . For example, we can negate the value of W1 . This change (i+1) (i+2) (i+3) will modify bit 1 of A , bit 1 of B , bit 31 of C , bit 31 of D(i+4) (i+5) and finally bit 31 of E . If we want to prevent further changes, we need to (i+1) (i+2) (i+3) (i+4) (i+5) , W1 , W31 , W31 and W31 . These new negate the values of bits W6 modifications prevent the change on bit 1 of A(i+1) to change bit 6 of A(i+2) , the change on bit 1 of B (i+2) to change bit 1 of A(i+3) , the change on bit 31 of C (i+3) to change bit 31 of A(i+4) , the change on bit 31 of D(i+4) to change bit 31 of A(i+5) and the change on bit 31 of E (i+5) to change bit 31 of A(i+6) . Thus (i) (i+1) (i+2) (i+3) (i+4) (i+5) , W1 , W31 , W31 and W31 gives two different negating W1 , W6 paths from A(i) , B (i) , C (i) , D(i) and E (i) to A(i+6) , B (i+6) , C (i+6) , D(i+6) and E (i+6) , and yields a local collision. This is summarized in Fig. 2. Note 1. It is clear that what we say for bit 1, can be generalized for any other bit from 0 to 31. However, it will become clear in the following (see Sect. 1), that this choice is the best one for our purpose. Hence, we focus on this value through the rest of this paper. Since everything is linear, we can apply simultaneously as many local collisions as we want and get two different paths from A(0) , B (0) , C (0) , D(0) and E (0) to A(80) , B (80) , C (80) , D(80) and E (80) , the first path using the original W and the second one using the modified one which we denote by W 0 . The question that now arises is “How to choose the local collisions to come back under the condition that both W and W 0 result from an expansion process ?” Choosing the local collisions simply means to build an error vector m0 of 80 (i) bits (numbered from 0 to 79) with a 1 in position i if we want to negate W1 . (i) However, we can’t choose to negate W1 for i ≥ 75, since a perturbation in round i is never corrected before round i + 6, and since all perturbations must be corrected by round 80. (0) (79) Let m0 , . . . , m0 be one of these error vectors. We deduce from it the E D (−5) (79) perturbative mask on W , M0 = M0 , . . . , M0 defined by: (i)

∀i, −5 ≤ i ≤ −1, M0 = 0 1

SHI1 is a French pun involving cats and dogs.

Differential Collisions in SHA-0 Perturbation

61

Corrections on bits

on bit 1 Initial state A(i) B (i) C (i)

(i)

W6

W1

W31

W31

W31

(i)

A(i+1)

A(i+2)

A(i+3)

A(i+4)

A(i+5)

W1 A1

(i+1)

(i+2)

(i+3)

(i+4)

(i+1)

(i+5)

B (i+5)

B1

(i+2)

C (i+5)

C31

(i+3)

D(i)

D(i+5)

D31

(i+4)

E (i)

E31

E (i+5)

Subscripts denote the perturbed bit of the state.

Fig. 2. SHI1 propagation of perturbation (i)

∀i, 0 ≤ i ≤ 79, M0,k = 0 if k 6= 1; (i)

(i)

∀i, 0 ≤ i ≤ 79, M0,1 = m0

.

This mask is completed by 5 zero-blocks, because the corrective masks are now deduced from this perturbative mask by translation and rotation. The first corrective mask M1 is deduced from M0 by a translation by one round, and a rotation of 5 bits to left. This rotation comes from the description of the SHA transformation (see Sect. 1.3 and Fig. 2). Hence, it applies on bits numbered k = 6. We have: (i) (i−1) . (3) ∀i, −4 ≤ i ≤ 79, M1 = ROL5 M0 The second corrective mask M2 is deduced from M0 by a translation by two rounds and no rotation (see Fig. 2). (i)

(i−2)

∀i, −3 ≤ i ≤ 79, M2 = M0

.

(4)

Similarly, M3 (resp. M4 , M5 ) are deduced from M0 by translation by three (resp. four, five) rounds, and apply on bits numbered k = 31. (i) (i−3) ; (5) ∀i, −2 ≤ i ≤ 79, M3 = ROL30 M0 (i) (i−4) ∀i, −1 ≤ i ≤ 79, M4 = ROL30 M0 ; (6) (i) (i−5) ∀i, 0 ≤ i ≤ 79, M5 = ROL30 M0 ; (7)

62


Now, what we need is that the global differential mask M defined by (i)

(i)

(i)

(i)

(i)

(i)

∀i, 0 ≤ i ≤ 79, M (i) = M0 ⊕ M1 ⊕ M2 ⊕ M3 ⊕ M4 ⊕ M5

,

(8)

must be an output of E0 . This condition holds if all masks Mk satisfy (1), which is ensured if the initial perturbative mask satisfies the following equation: (i)

(i−3)

M0 = M0

(i−8)

⊕ M0

(i−14)

⊕ M0

(i−16)

⊕ M0

, ∀i, 11 ≤ i < 80 .

(9)

Moreover, since E0 does not interleave bits (see (1)), we can split the expansion in 32 identical boxes e0 expanding 16 bits to 80 bits, and defined by (1) considered upon bits. The box e0 is small enough to be exhaustively enumerated. The number of possible masks is in fact relatively small, as there are only 128 of the 216 = 65536 possible inputs, that satisfy (9), and the constraint of 5 zeroes on rounds 75 to 79, and thus give a mask m0 . Given such a mask, one can obtain M , and, by reversing the linear application E0 , one can compute the corresponding 512 bits input mask µ such that M = E0 (µ). As the expansion boxes of the SHA functions are coded in a systematic way, it is clear that µ = M (0) , . . . , M (15) . For all input W = W (0) . . . W (15) , W 0 = W ⊕ µ has same output by the linear compression function SHI1. Introducing Non Linear Functions. From a Deterministic to a Probabilistic Method. We now want to study the impact of non-linear functions f (i) in the security of hash function from the SHA family. We consider a second function SHI2, the compression function in the SHA family built by starting from SHA-0 (thus using expansion E0 ) and by replacing the ADD function by an exclusive-or on 5 variables. This can also be seen as SHI1 with added non-linear functions f (i) . It can easily be seen that in some cases the f (i) behaves like a XOR. Thus, the previous attack may work. The questions that arise are “When does it work?” and “What is the probability of success?” In order to compute the probability we need to make a detailed analysis of the IF and M AJ functions. Since these functions work in parallel on 32 bits, we need only study what happens on a single bit. Assuming that we study the behavior of the transition from f (i) (B (i) , C (i) , D(i) ) to f (i) (B 0(i) , C 0(i) , D0(i) ), by looking carefully at the rotations and at our perturbation model one can see that different cases can occur: 1. There is no change at all in the inputs, i.e. B (i) = B 0(i) , C (i) = C 0(i) and D(i) = D0(i) . In that case the output f (B 0(i) , C 0(i) , D0(i) ) = f (B (i) , C (i) , D(i) ) does not change and f (i) behaves as XOR. 2. There is a single difference in the entries on bit 1 of B (i) , i.e. B 0(i) = B (i) ⊕21 . In that case, f (i) behaves as a XOR, if and only if f (i) (B 0(i) , C 0(i) , D0(i) ) = f (i) (B (i) , C (i) , D(i) ) ⊕ 21 .


63

3. There is a single difference in the entries on bit 31 of C (i) or D(i) (exclusive or). In that case, f (i) behaves as a XOR, if and only if f (i) (B 0 , C 0(i) , D0(i) ) = f (i) (B (i) , C (i) , D(i) ) ⊕ 231 . 4. There are two differences in the entries on bits 31 of C (i) and D(i) , that is to say C 0(i) = C (i) ⊕ 231 and D0(i) = D(i) ⊕ 231 . In that case, f (i) behaves as a XOR, if and only if the output of f (i) does not change f (i) (B 0(i) , C 0(i) , D0(i) ) = f (i) (B (i) , C (i) , D(i) ). We can now look at the three last cases for the M AJ and IF function. For the M AJ function, Cases 2 and 3 behave identically, the change in the output occurs if and only if the two bits of input that do not change are opposite. This occur with probability 1/2. In Case 4, the output does not change if and only (i) (i) if the two bits C31 and D31 change in opposite directions. This occurs with probability 1/2. (i) For the IF function, in Case 2 the output changes if and only if bits C31 and (i) D31 are opposite. This occurs with probability 1/2. In Case 3, the output changes (i) (i) if and only if bit B31 points on the changing bit (i.e. B31 = 1 if C 0(i) = C (i) ⊕231 (i) changes and B31 = 0 if D0(i) = D(i) ⊕ 231 changes), this occurs with probability 1/2. In Case 4, the output will always change, so the probability of good behavior is 0. This implies, that we need to choose a perturbation pattern with no two adjacent perturbations in the IF rounds. More precisely, as the IF rounds occur from round 0 to 19 (see Table 2), and Case 4 involves states C (i) and D(i) , no two adjacent perturbations can appear before round 16, but there may be two adjacent perturbations on rounds 16 and 17, because the propagation of the error will occur for C (i) and D(i) on round 20 (see Fig. 2). Under all our constraints, we were able to find a pattern with a global probability of success of about 1/224 . We represent hereafter the corresponding 80 bits output of the e0 box. The 5 preceding zeroes are just there to recall that this pattern satisfies the constraints developed in Sect. 1: 00000 00100010000000101111 01100011100000010100 01000100100100111011 00110000111110000000 This pattern m0 is ended and preceded by 5 zeroes, and has no two adjacent bits in the 16 first rounds. By the same construction as described in Sect. 1, we obtain a differential mask that can be applied on input word, and gives a collision with non negligible probability. We reference this mask by M. Evaluating the probability of success is quite tricky, because the 16 first rounds must not be included in this evaluation. The reason for this appears when implementing the collision search. Implementing the Collision Search. We now mask M that

have the differential we can try to apply on any input word W (0) . . . W (15) . In order to check

64


whether we have a collision or not, one has to verify for every perturbation, if the correction is done well, that is to say, if the function f (i) behaves like a XOR. Since each perturbation appears in 3 different (successive) f (i) , we need to consider many elementary probabilities. In our example, there are perturbations in positions 2, 6, 14, 16, 17, 18, 19, 21, 22, 26, 27, 28, 35, 37, 41, 45, 48, 51, 54, 55, 56, 58, 59, 62, 63, 68, 69, 70, 71 and 72. Table 3 shows which case each perturbation is related to, for the three f (i) involved. √ Note 2. In Table 3, Case 4 in M AJ case is counted for a probability 1/ 2 for each of the two perturbations involved. In this way, the global overall probability of 1/2 seen above is obtained.

Table 3. Probability of success of mask M in SHI2 model Perturbation in round i 2 6 14 16 17 18, 19, 21 22, 26, 27 28, 35 37 41 45 48 51 54 55 56 58, 59, 62 63, 68, 69 70, 71, 72

f (i+2) case f (i+3) case f (i+4) case

overall probability probability logarithm 1/8 3 1/8 3 1/8 3 = 2 + 1 (see Note 3) 1/4 2 1/2 1

IF IF IF IF IF

2 2 2 2 2

IF IF IF IF XOR

3 3 3 3 –

IF IF IF XOR XOR

3 3 3 – –

XOR

–

XOR

–

XOR

–

1

0

XOR M AJ M AJ M AJ M AJ M AJ M AJ M AJ

– 2 2 2 2 2 2 2

M AJ M AJ M AJ M AJ M AJ M AJ M AJ M AJ

3 3 3 3 3 3 4 4

M AJ M AJ M AJ M AJ M AJ M AJ M AJ XOR

3 3 3 3 3 4 4 –

1/4 1/8 1/8 1/8 1/8 √ 1/4 2 1/4 √ 1/2 2

2 3 3 3 3 2.5 2 1.5

XOR

–

XOR

–

XOR

–

1

0

As the input word is transmitted with no modification through the expansion, it is possible to split the search in two. First, we search W (0) . . . W (14) such that the function f (i) behaves like a XOR when the mask is applied. This occurs with probability 1/26, as the two perturbations involved are in positions 2 and 6. Then, W (0) . . . W (14) being fixed, we try many values of W (15) (of course we must try less than 232 , in practice any large number such as 10000 is satisfactory). Such a W (15) can lead to a collision after 80 rounds if all the other rounds behave nicely. As can be seen on Table 3, this happen with probability 1/226 . Since the


65

first part of the construction is done once for many W (15) , the second probability gives the real cost of the enumeration. Note 3. This first evaluation gives an overall probability of 1/226 in place of the claimed probability. But we can further refine this approach and get rid of some of the probability coming from perturbation of round 14. The first function related to this perturbation is the IF function seen in round 16. This function (16) (16) behaves nicely if bits C1 and D1 differs. These bits are known in round 14, (14) (13) since they are copies of A3 and A3 . This allows us to transfer a probability of 1/2 from the second part of the enumeration to the first one. This reduces the probability to 1/225 . The second function related to perturbation of round 14 is the IF function (17) seen in round 17. This function behaves nicely if bit B31 is a 1. Since this bit (16) is a copy of A31 , one can check its correctness just after choosing W (15) , and, if necessary, change bit 31 of W (15) before starting the testing process. This reduces the probability to the announced 1/224 . Note 4. In the case of SHI2, the collision search is very fast and can be performed in less than a half minute. Here is a sample collision: 1a6191b0 062ec496 2270fdbd 002831a9

3c4a331c 48611ca8 2a8090f0 50fe1535

1f228ea2 583401bc 4b12fd98 61ac0d3d

403b7609 399879d0 473cc7a1 f26700ec

3c4a331c c8611ca8 aa8090f0 50fe1535

1f228ea0 d83401be cb12fd98 61ac0d3f

403b7649 b9987990 c73cc7a1 f26700ac

and 1a6191b0 062ec494 2270fdbf 002831a9 both give 1334f224 21a3efc9 b667d2b2 2890013b 56013ca9 after the 80 rounds of the SHI2 function. Introducing Addition. Eventually, before dealing with SHA-0 and SHA-1 we want to study the influence of the addition ADD on our scheme of attack. We consider a third function SHI3, the compression function in the SHA family built by starting from SHA-0 (thus using expansion E0 ) and by replacing the nonlinear functions IF and M AJ by the function XOR. This can also be seen as SHI1 with the addition ADD put back. The new point here is that a perturbation may lead to carries. If we can prevent this from happening, everything will behave nicely as before. At first, it seems that each perturbation bit and each correction bit may lead to carry. This

66


would imply an elementary probability of 1/26 per perturbation, and therefore give no usable attack. However, remember that we choose to apply perturbation (i+3) on bit 1 of W (i) thus getting three corrections on bits in position 31 (W31 , (i+4) (i+5) W31 , W31 ). Since there is no possible carry from bit 31, this halves the logarithm of the elementary probability, and this explains our above choice . (i) We can reduce this even further, suppose that W1 is a 0 and that it changes 0(i) (i+1) 0(i+1) to a 1 in W1 , if no carry occurs (probability 1/2) then A1 is a 0 (and A1 (i+1) is a 1). Following this change in the computation of A(i+2) , we see that W6 0(i+1) should be a 1 (and W6 should be a 0), otherwise the correction would lead to a carry. If this condition holds then the correction always occur without carry. The most difficult point is to correct the change in the computation of A(i+3) . (i+2) 0(i+2) to 1 (and W1 to 0). Then the correction As before, we choose to fix W1 (i+2) behaves nicely if the first bit of the result of the XOR function is equal to B1 (i+1) (i+2) (i+2) (i.e A1 ). This is true whenever C1 = D1 (with probability 1/2). (i) The very same arguments show that the probabilities are the same when W1 0(i) is a 1 (and changes to a 2 in W1 ). In fact, the important issue is that a change from 0 to 1 (an incrementation) must be corrected by a change from 1 to 0 (a decrementation) and that a change from 1 to 0 must be corrected by a change from 0 to 1. The elementary probability to consider is formed from a factor 1/2 to ensure that the initial perturbation engenders no carry, and another 1/2 to ensure that the XOR keeps the change in the same direction. Two technical complications arise in this case, the first one is that we need (i) (i+1) (i+2) to build W in such a way that W1 , W6 and W1 will satisfy the above (non-linear) constraints. Since E0 does not interleave bits, we build W1 and W6 at the very beginning and keep them fixed for the rest of the attack. The second complication comes from the fact that nothing prevents us from getting a change (i) (i+2) in W1 , and another in W1 , in that case we get different conditions on W1 and W6 but the elementary probability of 1/4 still holds. In practice, we were able to find a pattern with probability of 1/244 (computed as in the SHI2 case)2 . This pattern is: 00000 01000010100100011110 01011000001110000000 00001100000011011000 00011000101101100000 and we will denote M0 its associated differential mask. Note 5. In this second pattern, we have no condition on adjacent perturbations, since we consider f (i) to always be the XOR function. Thus, one can note that this pattern has two adjacent perturbative bits on rounds 15 and 16. 2

One can refine the enumeration process to force the perturbations of round 16 and 17 and their associated corrections to be successful. The details are too tricky to be explained here, but will appear in the journal version of this paper. This leads to a 240 running time, which was confirmed by our implementation.


67

Associated to this pattern, the conditions on bits 1 and 6 of W and the expansion E0 made us choose the following values for these bits: Bit1: 01110010000000011000 10101101011110000110 11010101111101101010 00001001111101010111 Bit6: 00010000000110100000 10110001101001110011 01101101011111000010 00001011101101110111 Note 6. After a few days of computation, we were able to find an explicit collision for SHI3: 53c29e14 0c0abc30 0da433ac 1a3f8b70

44fe051b 3806260d 6337b011 0e7a4620

4a8ce882 76cbeb2f 1041e2a9 25e81245

576e1943 1b8379a8 20b44364 289acb2b

44fe0519 b806260d e337b051 8e7a4622

4a8ce8c2 f6cbeb2d 9041e2ab a5e81245

576e1941 1b8379e8 20b44366 a89acb29

and 53c29e14 8c0abc30 0da433ac 9a3f8b30 both give 983d1f8e e619f190 2e94fa09 0b0d479c 4c536e3e after the 80 rounds of the SHI3 function. 2.2

True SHA-0 Case

Having studied SHI1, SHI2, and SHI3, we now come back to the SHA-0 case. In this case, all perturbations have to be inserted without any carry, as in SHI3 case. Moreover, we need to probe deeper into the analysis of the IF and M AJ functions, that we carried out to deal with SHI2. Let us start with the IF function. As in SHI2, we must consider Cases 2, 3 and 4. Case 4 is always unacceptable in a pattern of attack. In case 3, everything remains the same: the change must go through the IF function, and it happens with probability 1/2. In case 2, the change must go through the function. Moreover, as in SHI3 case, its direction must be preserved. These two conditions are satisfied with probability 1/4. For the M AJ function, we can remark that M AJ never reverses the direction of a change, so that cases 2 and 3 are left unchanged, and each one leads to an elementary probability of 1/2. However, case 4 undergoes an interesting change.

68


The new fact, as compared to SHI2, is that as in SHI3, we have the following additional properties: (i+3)

C31

(i+4) D31

(i+1)

= A1 =

(i+2) A1

(i)

= W1 =

,

(i+1) W1

.

This means that in case 4, M AJ behaves as a XOR as soon as the following equation holds, (i) (i+1) , (10) W1 6= W1 (i+3)

(i+4)

and D31 because the result of M AJ does not change if and only if C31 change in opposite directions. Thus, when there are perturbations in round i and i + 1 with 36 ≤ i ≤ 55,, if we add the additional constraints (10) on W1 , then the elementary probability of case 4 for the M AJ function is 1. These conditions are added to the previous ones described for SHI3, when building W1 and W6 . Taking in account all these constraints, we were able to find two good patterns, with probability of success 1/268 (resp. 1/269 ). These patterns are: 00000 00010000000100100000 00100001101101111110 11010010000101010010 10100010111001100000 c=68 00000 00100010000000101111 01100011100000010100 01000100100100111011 00110000111110000000 c=69 We can now build the differential masks deduced from each pattern by the construction of Sect. 1. The second pattern was denoted M in Sect. 1. We denote the first one by M00 . Note 7. The computation of the probabilities can be done from Tables 5 and 4. As explained in Note 3, the perturbation in round 14 is on the boundary between the two enumerations. It contributes to the overall probability of success by a single 1/2. Note 8. Given a pattern M00 (resp. M), once W1 and W6 are chosen according to the constraints, the collision search by itself remains unchanged (see Sect. 1). The expected running complexity is thus 268 (resp. 269 ). However, being more careful when implementing the collision search, we can get rid of the remaining probability implied by the perturbation in round 14. We hence obtain a running complexity of 267 (resp. 268 ). Moreover, in case of M, one can also suppress the probabilities implied by the perturbations in round 16 and 17. This further decreases the probability of success of M to the claimed value of 261 . This ultimate trick can also be used in SHI2 model. Thus, instead of the probability 1/224 obtained in Note 3, we can obtain a probability of 1/220 .


69

Table 4. Probability of success of mask M for SHA-0 Perturbation in round i 2 6 14 16 17 18, 19, 21 22, 26, 27 28, 35 37 41 45 48 51 54 55 56 58, 59, 62 63, 68, 69 70, 71, 72


overall probability probability logarithm 1/32 5 1/32 5 1/32 4+1 1/16 4 1/8 3

IF IF IF IF IF

2 2 2 2 2

IF IF IF IF XOR

3 3 3 3 –

IF IF IF XOR XOR

3 3 3 – –

XOR

–

XOR

–

XOR

–

1/4

2

XOR M AJ M AJ M AJ M AJ M AJ M AJ M AJ

– 2 2 2 2 2 2 2

M AJ M AJ M AJ M AJ M AJ M AJ M AJ M AJ

3 3 3 3 3 3 4 4

M AJ M AJ M AJ M AJ M AJ M AJ M AJ XOR

3 3 3 3 3 4 4 –

1/16 1/16 1/16 1/16 1/16 1/8 1/4 1/4

4 4 4 4 4 3 2 2

XOR

–

XOR

–

XOR

–

1/4

2

Note 9. In the middle of the second 20-rounds block of pattern M with probability 1/269 (basic search) or 1/261 (improved search), we were lucky to find a group of 5 zeroes (in fact 6 but 5 is sufficient for our purpose). This allows us to stop the attack after this group, with a partial collision on 35 rounds of SHA. Here is such a partial collision: 78fb1285 4a4d1c83 a08e7920 38bef788

77a2dc84 186e8429 16a3e469 2274a40c

4035a90b 74326988 2ed4213d 4c14e934

b61f0b39 7f220f79 4a75b904 cee12cec

77a2dc84 986e8429 96a3e469 2274a40c

4035a909 f432698a aed4213d 4c14e936

b61f0b79 ff220f39 ca75b904 cee12cac

and 78fb1285 4a4d1c81 a08e7922 38bef788

both yield after 35 rounds of SHA-0: 7b907fb9 d050108b 88d6e6d6 5c70d4a3 7e06a692 The probability to find such a collision is 1/222 , using the basic collision search, or 1/214 , using the improved collision search.

70


Table 5. Probability of success of mask M00 for SHA-0 Perturbation in round i 3 11 14 22, 27, 28 30, 31, 33 34, 35 36 37 38 40 41 43 46 51 53 55 58, 60, 62 66, 68, 69 70, 73, 74

3


overall probability probability logarithm 3 1/32 5 3 1/32 5 3 1/32 4+1

IF IF IF

2 2 2

IF IF IF

3 3 3

IF IF IF

XOR

–

XOR

–

XOR

–

1/4

2

XOR XOR XOR M AJ M AJ M AJ M AJ M AJ M AJ M AJ

– – – 2 2 2 2 2 2 2

XOR M AJ M AJ M AJ M AJ M AJ M AJ M AJ M AJ M AJ

– 4 4 3 4 3 3 3 3 3

M AJ M AJ M AJ M AJ M AJ M AJ M AJ M AJ M AJ M AJ

4 4 3 4 3 3 3 3 3 3

1/4 1/4 1/8 1/8 1/8 1/16 1/16 1/16 1/16 1/16

2 2 3 3 3 4 4 4 4 4

XOR

–

XOR

–

XOR

–

1/4

2

SHA-1 Case

In the SHA-1 case, the bits are interleaved and therefore it is no more possible to split the expansion in 32 little expansions. However, the invariance by translation is still true. Hence, it is still feasible to deduce the 5 corrective masks from a perturbative one, using the construction of Sect. 1. More precisely, given a perturbative mask M0 that is an output of E1 , Equ. (3) to (7) still hold, and the constructed mask M defined by (8) is again an output of E1 . Finding the perturbative mask M0 can be done using coding theory tools [3], because the mask can be considered as a low-weight codeword of the extension. Performing such a search on E1 leads to some very short codewords as compared to the dimensions of the code. However, with very high probability, no codeword of weight less than 100 exists in E1 , that satisfies the constraints (see Sect. 1), whereas there exists 27 weighted codewords in E0 . As every bit of the perturbative mask M0 implies at least a factor 1/4 in the overall probability of success, our attack will therefore be totally inefficient on SHA-1. However, it remains an open problem to see if differential masks exist in the SHA-1 case, because our attack builds very specific masks.


4

71

Conclusion

We have developed a new kind of attack on SHA functions that yields better results than the classical birthday-paradox attack on SHA-0. This attack is related to the well known differential cryptanalysis [1] in that it looks for some kind of characteristic masks that can be added to input word with non trivial probability of unchanging the output of the compression function. The expansion of SHA-1 seems to be designed to counter this kind of attack, which should increase the level of confidence in this standard.

Acknowledgments We wish to thank Matthew Robshaw and the referees for their valuable remarks and improvements to this paper.

References 1. E. Biham, and A. Shamir. Cryptanalysis of the Full 16-Round DES, CRYPTO’92 LNCS 740, pp 487–496, 1993. 71 2. B. den Boer, and A. Bosselaers. Collisions for the compression function of MD5, EUROCRYPT’93 LNCS 773, pp 293–304, 1994. 56 3. A. Canteaut, and F. Chabaud. A new algorithm for finding minimum-weight words in a linear code: Application to primitive narrow-sense BCH codes of length 511, IEEE Trans. Inform. Theory, IT-44(1), pp 367–378, Jan. 1998. 70 4. H. Dobbertin. Cryptanalysis of MD4, Fast Software Encryption LNCS 1039, pp 53–69, 1996. 56 5. R. Rivest. The MD4 Message-Digest Algorithm, CRYPTO’90 LNCS 537, pp 303– 311, 1991. 56 6. R. Rivest. The MD5 Message-Digest Algorithm, Network Working Group Request for Comments: 1321, April 1992. http://theory.lcs.mit.edu/~rivest/Rivest-MD5.txt 56 7. Secure Hash Standard. Federal Information Processing Standard Publication # 180, U.S. Department of Commerce, National Institute of Standards and Technology, 1993. 56, 57, 71 8. Secure Hash Standard. Federal Information Processing Standard Publication # 1801, U.S. Department of Commerce, National Institute of Standards and Technology, 1995 (addendum to [7]). 56

From Differential Cryptanalysis to Ciphertext-Only Attacks Alex Biryukov1 and Eyal Kushilevitz2 1

Applied Mathematics Department, Technion - Israel Institute of Technology, Haifa, Israel 32000. [email protected] 2 Computer Science Department, Technion - Israel Institute of Technology, Haifa, Israel 32000. [email protected]

Abstract. We present a method for efficient conversion of differential (chosen plaintext) attacks into the more practical known plaintext and ciphertext-only attacks. Our observation may save up to a factor of 220 in data over the known methods, assuming that plaintext is ASCII encoded English (or some other types of highly redundant data). We demonstrate the effectiveness of our method by practical attacks on the block-cipher Madryga and on round-reduced versions of RC5 and DES. Keywords: block-ciphers, Madryga, RC5, DES, ciphertext-only attack, differential cryptanalysis, differential-linear attack.

1

Introduction

Differential cryptanalysis [1,12] is a very powerful technique for the analysis of block-ciphers. It has been used with success against many block-ciphers, e.g. [1,2,3,18,4]. One weakness of differential cryptanalysis is that it finds chosen plaintext attacks; these are much less practical than known-plaintext and certainly than ciphertext-only attacks. Ciphertext-only attacks are the most useful attacks on cryptosystems, since they require only passive eavesdropping from the attacker. Such attacks are usually hard to find, since the assumptions on the knowledge of the attacker are minimal. Exceptions include the most basic ciphers, like simple substitution or Vigenère [11]. Although there exists a general method for converting any differential chosen plaintext attack into the more favorable known plaintext attack [1], this conversion becomes (almost) impractical due to the huge increase in the data requirements. If a differential attack uses m chosen-plaintext pairs, the correw√ sponding known-plaintext attack will need about 2 2 2m known plaintexts, where w is the block size (in bits) of the analyzed cryptosystem. For example, if a differential attack on a cryptosystem with 64-bit block uses only eight chosen-plaintext pairs, the corresponding known-plaintext attack will require 234 known plaintext-ciphertext pairs, an increase which makes this attack much less practical. H. Krawczyk (Ed.): CRYPTO’98, LNCS 1462, pp. 72–88, 1998. c Springer-Verlag Berlin Heidelberg 1998

From Differential Cryptanalysis to Ciphertext-Only Attacks

73

In this paper we show a method of converting successful differential chosenplaintext attacks into known-plaintext and even ciphertext-only attacks without loosing as much efficiency as the above mentioned method, and under a reasonable assumption that plaintext comes from a redundant source. Notice that due to plaintext redundancy, the probability of some input differences increases, and the probability of other input differences decreases or even becomes negligible. If the probability of the input differences which are useful for the differential attack is increased (depending on the type of input redundancy and the type of input differences required for the attack on the particular cipher), then the cipher is weaker against the differential attack combined with the redundancy assumption. We show, for example, that under the assumption that plaintext comes from ASCII encoded English encrypted in ECB (Electronic CodeBook) mode, the probability of input differences with small Hamming weight (which are the differences needed for most of the known attacks) increases significantly. Therefore, only about 214 known-plaintexts are needed for a known-plaintext attack of the previous example, saving a factor of 220 in data. Moreover, our observation helps to turn differential attacks into much more desirable ciphertext-only attacks, with modest increase in data. Our efficient conversion method applies also for the combined differential-linear attacks [13], which can be converted into efficient known-plaintext attacks. This paper is organized as follows: In section 3 we outline the principles of our method. Then, we demonstrate its applicability for various ciphers; we start by presenting a new differential attack on Madryga [14,21] with only sixteen chosen plaintext pairs. We use this cipher as a testing ground for the development of our ideas. Then, we proceed to a ciphertext-only attack on Madryga with only several thousand ciphertexts. We continue demonstrating the effectiveness of our approach with a ciphertext-only attack on 4-round RC5 using only 217 ciphertexts, and a known plaintext attack on 6-round RC5 (as of today this is the first known plaintext attack on this cipher) with about 218 plaintext/ciphertext pairs (the previous known-plaintext attack on this cipher [8] required 257 for 6-round RC5 but it was found erroneous [22]). We show a new known-plaintext attack on seven round DES [19] with about 217 known plaintexts.1 Finally we show, that our attacks are applicable not only to ECB mode, but also to the first block of the CBC (Cipher Block Chaining) mode if the initial vector (IV) is unchanged for several datagrams or incremented sequentially, starting from a random value (as is usually the practice on the Internet), and to the counter mode [21]. To conclude, we show that differential attacks are very subjective to the underlying plaintext redundancies. We mark the importance of studying differential attacks on ciphers together with the underlying redundancies of the protocols, they are used in. We also suggest methods, that may help to prevent attacks of the kind described in this paper. 1

For FEAL [16,17] there exist several very efficient known-plaintext attacks which use specific features of this cipher. Our analysis is applicable to FEAL as well but yields inferior results.

74

2

Alex Biryukov and Eyal Kushilevitz

Differential Cryptanalysis

Differential cryptanalysis is a very efficient chosen plaintext attack on blockciphers [1]. The idea of differential cryptanalysis is to analyze pairs of plaintexts instead of single plaintexts. An attacker chooses the difference ∆P between plaintexts (P, P ∗ ) and studies the propagation (avalanche) of the changes in the encryption process. During the attack he searches and then studies the ciphertexts pairs (C, C ∗ ), which exhibit difference ∆C, predicted by his analysis. Let us introduce some terminology related to differential cryptanalysis. The difference between two bit-strings X and X ∗ of equal length is defined as X ⊕ X ∗ = ∆X, where ⊕ is a bitwise XOR operation. We call a pair of plaintexts (P, P ∗ ) a good pair with respect to differential analysis of a cipher, if it exhibits the difference propagation and the output difference ∆C, predicted by the analysis of a cipher. We call noise all pairs that are suspected to be good pairs (i.e. pass all our criteria for good pairs (which we call filters)), but which do not exhibit the difference propagation, predicted by the analysis. It is well known, that unlike other chosen plaintext attacks, differential cryptanalytic attacks can be easily converted to known plaintext attacks [1]. The idea is similar to the Birthday paradox. Denote the length of the blocks in bits by w. Suppose that differential chosen plaintext attack needs m pairs to succeed, and w√ that we are given n ≈ 2 2 2m random known plaintexts and their ciphertexts. These plaintexts can form about 2w · m pairs. Since the block size is w bits, there are only 2w possible different XOR values wand thus, due to the uniform distribution of the plaintexts, there are about 2 2w·m = m pairs for each XOR value (and thus, we expect m pairs with the specific differences, used by the attack). In order to find pairs with useful input differences, one can sort the array of n known plaintext/ciphertexts by plaintexts, and then search for pairs with particular differences; the total complexity of this process is O(n log n). Once these pairs are discovered a regular differential attack on the cipher may begin.

3

On Ciphertext-Only Attacks

In this section we describe our method for converting successful differential attacks into ciphertext-only attacks (and known-plaintext attacks) with huge savings over the method described in the previous section. We show practical cases, where this conversion can be applied very efficiently. The essence of differential cryptanalysis is in studying the differences between plaintexts, without using the plaintexts themselves (although there are many ways of helping the analysis by adding the information about plaintexts). Thus, one can perform a ciphertext-only attack on a cipher as soon as he is able to detect ciphertext pairs that come from good plaintext pairs. Suppose that we are in a ciphertext-only attack scenario and that we are given only a pool of n ciphertexts, without the knowledge of the corresponding plaintexts. It may seem 2 that we will have to check all the n2 pairs, which is usually infeasible, and that we will not be able to detect pairs that exhibit useful input difference (as we show


75

in the previous section). However, two observations may help in our case. First, the structure of the ciphertext difference in a good pair may be very restricted, and thus the search in a sorted pool of ciphertexts will still have O(n log n) complexity. The second observation is related to the possible redundancy of the encrypted plaintexts. Usually not all 2w blocks are ”legal” plaintexts, and the probabilities of possible plaintexts may be very non-uniform. Thus, among the pairs taken from a pool of redundant plaintexts, some differences will be very frequent, and some will never occur. Let us proceed with an important but simple to analyze example of plaintext redundancy. Suppose that the plaintext source, produces blocks of w bits, and has entropy of e bits (so there are w − e redundant bits per block). Suppose also that the entropy is ”bit-local”: a fixed subset of e bits out of w is chosen and may get all 2e possible values uniformly. The other w − e = r redundant bits are fixed to arbitrary values. This sort of redundancy describes many types of computerized information (database records, network packets, etc.). Denote the set of all such blocks as Se , and denote the set of all differences, produced by elements of Se as ∆Se . Let there exist a fast differential attack on the cipher, which succeeds with m chosen pairs, using differences from the set ∆Se . If the differences used by the differential attack are not in ∆Se , then the cipher (taken together with the redundancy assumption) is more secure against this differential attack, than in the general (uniformly random plaintext) case2 . For the described e√ 2 type of redundancy, we can use only ne ≈ 2 2m ciphertexts, which form about 2e m pairs. This pool contains about m pairs with useful input differences for a differential attack. If the corresponding plaintexts are known to the attacker, he can simply sort the table of given plaintexts and search in the sorted table for pairs that exhibit the necessary input differences and proceed to analysis of the pairs as in a regular differential attack. If the corresponding plaintexts are not known, we are in a ciphertext-only scenario. If we expect (due to differential analysis) a ciphertext difference in a good pair to have a definite structure (for example, ∆C should be a particular constant), then the probability for a random pair to have similar ciphertext difference may be as small as 2−w . Suppose for simplicity, that the probability 1 of a good pair is p ≈ m (i.e., one good pair is enough to start a differential attack, which is true in many cases). Since in a ciphertext-only scenario we do not know which m pairs exhibit useful input differences, we can only hope that about one pair in a pool of 2e m pairs of ciphertexts will be good for the differential attack3 . The Signal/Noise ratio (ratio of the probability of a good pair to the probability 2

3

This may lead to a method of strengthening for differentially-weak ciphers. Add redundancy to the plaintext, in a way, that prohibits successful input differences. For example one can use error correcting code (plaintexts be codewords), in order to avoid input differences of low Hamming weight. However, in order to find it we do not need to check all 2e m pairs. If we are looking for a well defined ciphertext difference, search in a sorted pool of ne ciphertexts will have O(ne log ne ) complexity.

76

Alex Biryukov and Eyal Kushilevitz −e

−1

r

m of noise) for the ciphertext-only attack in this case is S/N = 2 2−w = 2m . The attack will be successful if S/N > 1. We can generalize the description above in two ways. First, it may be more useful to consider a set of possible ”good” output differences ∆G of size 2k , rather than one output difference. If differences from the set ∆G are equally likely to r−k appear in a good pair, then the Signal/Noise ratio will decrease to 2 m , but the probability of a good pair will increase (since we relax the conditions on the differential propagation pattern), and thus m will decrease. In this case we have to solve a more complex search problem. For example, we would like to find all pairs with differences of low Hamming weight in reasonable time. We can state this problem as follows:

Problem 1 Find all pairs (si , sj ), i 6= j, in a given set S of n binary words from {0, 1}w , such that dH (si , sj ) ≤ k (here dH denotes Hamming distance). Let us call such pairs k-neighbors. We can reduce this problem to a well studied approximate string matching (approximate dictionary queries) problem. However most of the algorithms for this problem are linear in the document size n. Since we have to call this algorithm n times (to check neighbors of each element of the set), this results in a complexity of O(n2 ), which is the complexity of checking all pairs in the set. In [23] an efficient algorithm, based on tries, which runs in O(k|Σ|k ) expected worst case time (here |Σ| denotes the alphabet size), independently of the document size (n) is presented. Trie indices combine suffixes and so are compact in storage. Applying this algorithm to our problem, we get O(nk2k ) complexity. Though exponential in k, it still provides a better algorithm than exhaustive check of all pairs if log n > k + 1 + log k. Thus, for a set of size 220 , the search of all 15-neighbors can be performed faster than 239 using tries. Since we have seen that redundancy in some cases helps for differential attacks, our second observation concerns another useful type of redundancy — the natural language redundancy. For a natural language L over alphabet A, denote by An the set of all n-grams of the language. Then the n-gram entropy of the language is: X H(X1 , . . . , Xn ) = −P (χn = s) log P (χn = s), s∈An

where χn = (X1 , . . . , Xn ) ∈ An . We say that the language has entropy HL if: HL = lim

n→∞

H(X1 , . . . , Xn ) n

In the case of the English language successive approximations of HL go as: log2 26 ≈ 4.7, the first order approximation (English letter frequencies) gives ≈ 4.2, digram frequencies give ≈ 3.9. Shannon [24] suggests a value of 2.3 bits per letter for eight letter blocks. By various experiments, for large n the entropy decreases into the interval 1.0 ≤ HL ≤ 1.5. By a gambling technique Cover and King [6] give an estimate of 1.3 bits of information per letter. In [5] an upper


77

bound of 1.75 bit per letter is estimated from a large sample of ASCII encoded English text. Our experiments with large English files show, that some differences are very frequently encountered, even in a small quantities of English plaintext. For example, differences with low Hamming weights (especially one-bit differences at the beginning and at the end of the block) are very frequent. For a more detailed study of these differences see Appendix A. This fact can be used in a differential known-plaintext and even in differential ciphertext-only attacks on block-ciphers that are weak with respect to these differences. In the following sections we demonstrate new attacks on Madryga, RC5 and DES which follow the ideas expressed in this section4 . As we explained here and as we will show in the further sections, differential attacks are very subjective to the underlying plaintext redundancies. We stress the importance of studying differential attacks on ciphers together with underlying redundancies of the protocols, they are used in.

4

Attacks on Madryga

We used the Madryga block-cipher [14,21] as a testing ground for the development of our ideas. In the following subsections we describe this cipher and our attacks on it. We first find a very fast differential attack on Madryga, which uses negligible amount of data, and then proceed to a differential ciphertext-only attack on this cipher, which is also very efficient. 4.1

Description of Madryga

Madryga is a blockcipher proposed in 1984 by W. E. Madryga [14]. It was designed for efficient software implementation. It consists of data-dependent rotations and exclusive or’s with the bytes of the key. Madryga was designed as an alternative to DES (with larger key size – 64 bits) in order to permit efficient implementation both in software and hardware. Here is a description of the encryption algorithm. Block size and key size in Madryga may vary, but 64-bit block size was suggested for compatibility with DES. The key size in this case is also 64 bits. The encryption process consists of two nested cycles. The out-most cycle consists of eight iterations of the inner cycle. The inner cycle consists of eight local operations on the block. A work frame (Frame) of three consecutive bytes b1 b2 b3 is chosen in the plaintext block (Text), starting from the second last byte (the block is treated as a cyclic entity). The 64-bit key (Key) is rotated by three bit positions to the right and exclusive or’ed with the 64-bit constant (KeyHash). Rotation amount is extracted from the three least significant bits of b3 . Then the least significant byte of the key is exclusive or’ed with b3 . The concatenation of b1 b2 is rotated by 4

For the rest of the paper, all attacks are described for the English language model of redundancy, but they work even better in the ”bit-local” model of redundancy, when the entropy e is the English language entropy.

78

Alex Biryukov and Eyal Kushilevitz

the rotation amount to the left. Then the working frame is shifted one byte to the right and the process continues. The working frame moves to the right from the starting second-last byte to the starting third-last byte. Here is a Madryga implementation (WORD is 64 bits): /* MADRYGA encryption engine, 64-bit implementation. WORD EncKey; /* Secret Key WORD KeyHash = 0x0F1E2D3C4B5A6978; /* Key Hash Constant WORD Key; /* Work Key WORD Text; /* Plaintext block WORD Frame; /* Work Frame #define FrameMask 0xFFFF #define TextMask 0xFFFFFFFFFFFF0000 Key = EncKey; for(i=0; i < 8; i++){ /* for(j=0; j < 8; j++){ /* Frame = ROTL(Text,8*j)&FrameMask;/* Key = ROTR(Key,3)^KeyHash; /* /* rotation_count = (Text >> (56-8*j)) & Text ^= (Key & 0xFF) τ (because γ1 < γ + λ), this number is negligible with respect to n2−τ . Now, the probability that λi is hit is: 2−τ 1 1 n =Ω =Ω . Ω n2+γ nγ+τ n γ1

242

A.3

Phong Nguyen and Jacques Stern

Proof of Proposition 16

As in the proof of Proposition 15, consider the√ output (z, λ1 , . . . , λν ) of the oracle. kzk and k(λ1 , . . . , λν )k are still less than 2n1+θ+γ/2 . And we have: λp−1 (1) y1 =

ν X 1 z− λp−1 (i) yi . β i=2

Since y2 , . . . , yν are good ciphertexts of ’0’, Lemma 14 implies that for all i ≥ 2: dist(Z, hu, yi i) ≤ M4

1 . n4 (ε1 ε2 ε3 )1/2k

Therefore, by the Cauchy-Schwarz inequality: s +! v * ν u ν X uX 2 t λp−1 (i) yi , u ≤ λp−1 (i) × νM42 dist Z, i=2

i=1

1 n8 (ε1 ε2 ε3 )1/k

√ 1 ≤ 2n1+θ+γ/2 M4 n1+γ/2−4 (ε1 ε2 ε3 )1/2k √ 2 ≤ M4 nθ+γ−2 . (ε1 ε2 ε3 )1/2k

Furthermore: dist(Z, hz/β, ui) ≤ Therefore, for sufficiently large n:

dist(Z, λp−1 (1) y1 , u ) ≤ M4

√

2nθ−6 . √

3 nθ+γ−2. (ε1 ε2 ε3 )1/2k

If λp−1 (1) is a fixed integer, since y1 is a random vector in the parallelepiped, the latter inequality is satisfied with probability at most: √ 3 nθ+γ−2 . 2M4 (ε1 ε2 ε3 )1/2k But if y1 is hit, then:

o n γ |λp−1 (1) | ∈ 1, 2, . . . , n 2 +θ+λ .

Hence, y1 is hit with probability at most: √ 2 3 nθ+γ−2 2nγ/2+θ+λ . 2M4 (ε1 ε2 ε3 )1/2k As n grows, this is: 1 2θ+3γ/2+λ−2+(σ1 +σ2 +σ3 )/(2k) O n . =O n γ2 And this concludes the proof.

Cryptanalysis of the Chor-Rivest Cryptosystem Serge Vaudenay? Ecole Normale Supérieure — CNRS [email protected]

Abstract. Knapsack-based cryptosystems used to be popular in the beginning of public key cryptography before being all broken, all but the Chor-Rivest cryptosystem. In this paper, we show how to break this one with its suggested parameters: GF(p24 ) and GF(25625 ). We also give direction on possible extensions of our attack.

Recent interests about cryptosystems based on knapsacks or lattice reduction problems unearthed the problem of their security. So far, the Chor-Rivest was the only unbroken cryptosystem based on the subset sum problem [2,3]. In this paper, we present a new attack on it which definitely breaks the system for all the proposed parameters in Chor-Rivest’s final paper [3]. We also give directions to break the general problem, and related cryptosystems such as Lenstra’s Powerline cryptosystem [8].

1

The Chor-Rivest Cryptosystem

We let q = ph be a power-prime (for a practical example, let p = 197 and h = 24). We consider the finite field GF(q) and we assume that its representation is public (i.e. there is a public h-degreed polynomial P (x) irreducible on GF(p) and elements of GF(q) are polynomials modulo P (x)). We also consider a public numbering α of the subfield GF(p), i.e. {α0 , . . . , αp−1 } = GF(p) ⊆ GF(q). Secret keys consist of – – – –

an element t ∈ GF(q) with algebraic degree h a generator g of GF(q)∗ an integer d ∈ Zq−1 a permutation π of {0, . . . , p − 1}

Public keys consist of all ci = d + logg (t + απ(i) ) mod q − 1 for i = 0, . . . , p − 1. For this reason, the public parameters must be chosen such that the discrete logarithm is easy to calculate in GF(q). In the final paper, the authors suggested to use a relatively small prime power p and a ?

Part of this work was done when the author was visiting AT&T Labs Research.

H. Krawczyk (Ed.): CRYPTO’98, LNCS 1462, pp. 243–256, 1998. c Springer-Verlag Berlin Heidelberg 1998

244

Serge Vaudenay

smooth power h, i.e. an integer with only small factors so that we can apply the Pohlig-Hellman algorithm [11].1 Suggested parameters corresponds to the fields GF(19724 ), GF(21124 ), GF(24324 ), and GF(25625 ). The Chor-Rivest cryptosystem works over a message space which consists of all p-bit strings with Hamming weight h. This means that the message to be encrypted must first be encoded as a bitstring m = [m0 . . . mp−1 ] such that m0 + . . . + mp−1 = h. The ciphertext space is Zq−1 and we have E(m) = m0 c0 + . . . + mp−1 cp−1 mod q − 1. To decrypt the ciphertext E(m), we compute p(t) = g E(m)−hd as a polynomial in term of t over GF(p) with degree at most h − 1, which must be equal to Y (t + απ(i) ) mi =1

in GF(q). Thus, if we consider µ(x) + p(x) where µ(x) is the minimal polynomial of t, we must obtain the formal polynomial Y (x + απ(i) ) mi =1

whose factorization leads to m. Although the public key generation relies on intricate finite fields computations, the decryption problem is based on the traditional subset sum problem (also more familiarly called knapsack problem): given a set of pieces c0 , . . . , cp−1 and a target E(m), find a subset of pieces so that its sum is E(m). This problem is known to be hard, but the cryptosystem hides a trapdoor which enables the legitimate user to decrypt. This modifies the genericity of the problem and the security is thus open.

2

Previous Work

The Merkle-Hellman cryptosystem was the first subset-sum-based cryptosystem [10]. Although the underlying problem is NP-complete, it has surprisingly been broken by Shamir [12]. Later, many other variants have been shown insecure for any practical parameters by lattice reduction techniques (see [6] for instance). Actually, subset-sum problems can be characterized by the density parameter which is (with our notations) the ratio d = p/ log2 q. When the density is far from 1 (which was the case of most of cryptosystems), the problem can 1

This √algorithm with Shanks’ baby step giant step trick has a complexity of O(h3 B log p) simple GF(p)-operations for computing one ci where B is the largest prime factor of ph − 1. (See Koblitz [7].) Since pr − 1 is a factor of ph − 1 when r is a factor of h, B is likely to be small when h only has small prime factors.

Cryptanalysis of the Chor-Rivest Cryptosystem

245

efficiently be solved by lattice reduction algorithms like the LLL algorithm [9]. The Chor-Rivest cryptosystem is an example of cryptosystem which achieves a density close to 1 (for p = 197 and h = 24, the density is 0.93). Its underlying problem has however the restriction that the subsets must have cardinality equal to h. Refinement of lattice reduction tools with this restriction have been studied by Schnorr and H¨ orner [13]. They showed that implementations of the Chor-Rivest cryptosystem with parameters p = 151 and h = 16 could be broken within a few days of computation on a single workstation (in 1995). So far, the best known attack for secret key recovery is Brickell’s attack √ which works within a complexity of O(p2 h h2 log p). It has been published in the final paper by Chor and Rivest [3]. This paper also includes several attempts of attacks when parts of the secret key is disclosed. In Sect. 5, we briefly review a few of them in order to show what all quantities in the secret key are for. The Chor-Rivest cryptosystem has the unnatural property that the choice of the finite field GF(q) must be so that computing the discrete logarithm is easy. A variant has been proposed by Lenstra [8] which overcomes this problem. In this setting, any parameter can be chosen, but the encryption needs multiplications instead of additions. This variant has further been extended by Camion and Chabanne [1].

3

Symmetries in the Secret Key

In the Chor-Rivest cryptosystem setting, one has first to choose a random secret key, then to compute the corresponding public key. It relies on the difficulty of finding the secret key from the public key. It shall first be noticed that there are several equivalent secret keys, i.e. several keys which correspond to the same public key and thus which define the same encryption and decryption functions. We first notice that if we replace t and g by their pth power (i.e. if we apply the Frobenius automorphism in GF(q)), the public key is unchanged because loggp (tp + απ(i) ) =

1 logg ((t + απ(i) )p ) = logg (t + απ(i) ). p

Second, we can replace (t, απ ) by (t + u, απ − u) for any u ∈ GF(p). Finally, we can replace (t, d, απ ) by (ut, d − logg u, u.απ ) for any u ∈ GF(p). Thus we have at least hp2 equivalent secret keys. The Chor-Rivest problem consists of finding one of it. Inspired by the symmetry use in the Coppersmith-Stern-Vaudenay attack against birational permutations [4], these properties may suggest that the poly Qh−1 pi of whom all the equivalent t’s are the roots plays a crucial nomial i=0 x − t role. This is actually the case as shown by the attacks in the following sections.

4

Relation to the Permuted Kernel Problem

Throughout this paper, we will use the following property of the Chor-Rivest cryptosystem.

246

Serge Vaudenay

Fact 1 For any factor r of h, there exists a generator gpr of the multiplicative group of the subfield GF(pr ) of GF(q) and a polynomial Q with degree h/r and coefficients in GF(pr ) and such that −t is a root and that for any i we have Q(απ(i) ) = gpr ci . Proof. We let Q(x) = gpr d

h/r−1

Y

ri

x + tp

(1)

i=0

Q ri where gpr = g p (gpr can be considered as the norm of g when considering the extension GF(pr ) ⊆ GF(q)). We notice that we have Q(x) ∈ GF(pr ) for any x ∈ GF(pr ). Since pr > hr we obtain that all coefficients are in GF(pr ). The property Q(απ(i) ) = gpr ci is straightforward. t u Since h/r is fairly small, it is unlikely that there exists some other (gpr , Q) solutions, and gpr is thus essentially unique. Throughout this paper we will use the notation q−1 gq0 = g q0 −1 . If we consider the Vandermonde matrix M = (αi j )

0≤i

Advances in Cryptology - CRYPTO '98, 18 conf

Advances in Cryptology CRYPTO

Advances in Cryptology - CRYPTO '99, 19 conf

Advances in Cryptology - CRYPTO 2000, 20 conf

Advances in Cryptology - CRYPTO 2002, 22 conf

Advances in Cryptology CRYPTO '96 16 conf

Advances in Cryptology - CRYPTO 2004, 24 conf