Four years ago, I wrote a (surprisingly popular) blog post about the notion of wear-out for symmetric encryption schemes. Two years ago, I wrote a thing about extending the nonce used by AES-GCM without introducing foot-guns. This was very recently r…
As I was reading Filippo's newsletter, it occurred to me that a dedicated treatment to how cryptographers discuss the Birthday Bound for symmetric encryption would be beneficial.
Birthday Bound?
The Birthday Bound is rooted in the so-called Birthday Paradox in probability theory. It goes something like this:
How many people (chosen from a uniform random distribution) would you need to have in the same room in order for there to be a 50% or greater chance that two of them have the same birthday?
Even though there are 366 possible calendar days in the year (365 if you ignore leap years), the answer is only 23.
This observation can tell us something interesting about the collision risk in discrete uniformly random samples.
For example, the random number (called an IV in this case) used to encrypt a message with AES-CBC, which is a 128-bit random number. This means that there are possible values. We can simply describe this situation for any distribution; in this case, .
For any random distribution (i.e., random nonces or tweaks for an AEAD scheme) of possible values, you expect to have a 50% chance of collision after about queries. This is a loose, order-of-magnitude estimate.
From our earlier AES-CBC example, this means blocks of data, we'd expect a 50% chance of the two to collide.
My symmetric wear-out blog post went in-depth about the particulars for specific designs, if you'd like to know how the numbers play out.
Is 50/50 Good Enough?
A security policy that cuts off at a 50% chance of a nonce reuse is not fit for the real world. We would call that a YOLO security policy.
AES-GCM, which accepts a 96-bit nonce, is considered unsafe to use for more than random nonces. At this point, the probability of a collision for each subsequent message is greater than or equal to .
Consequently, many people consider the point that the risk exceeds the Birthday Bound for a block cipher mode; after which, a new encryption key must be used.
I certainly don't fault any security policy that keeps the risk of a bad outcome below the mark, but I think there's another way to interpret what NIST did with AES-GCM. Namely, the fact that GCM's risk of is exceeded after samples (for some fixed value ) offers a nice symmetry to the risk analysis.
Three Birthday Bounds
(I was originally trying to find a wordplay that references the children's story Six-Dinner Sid for this section, but ran out of steam and pressed publish before finding an appropriate parody.)
Soatok Editorial Note
This gives rise to three different possible values for a given random distribution that can be considered whenever someone says the phrase "birthday bound".
50% collision risk after samples, which I like to think of as the Hard Birthday Bound.
collision risk after samples, which I like to call the Soft Birthday Bound.
collision risk after samples, which I like to call the Optimal Birthday Bound.
(These labels are not official; a better naming convention may be worth considering, should this framework for discussion prove useful at all.)
Since we're still dealing with approximations, a useful technique for calculating is to just take the cube-root of (a.k.a., divide the exponent by 3).
In the case of AES-GCM, the Soft and Optimal values are equivalent.
In the case of Filippo's XAES-256-GCM design? They differ quite a bit.
Soft: messages for collision probability.
Optimal: messages for collision probability.
Whereas the Soft and Optimal limits for a 96-bit nonce are the same, with a 192-bit nonce, they differ a lot: Soft is about 65,536 times the size as Optimal.
Does Any Of This Matter?
If your accepted risk is hard-coded at , then you don't need to pass Go or collect $200, because your security policy saves you the headache of having to care about math nerd stuff.
However, I do think that the Optimal Birthday Bound is more useful for risk analysis for one simple reason: This is the point at which the probability of a collision is inversely proportional to the number of samples.
Being able to say, "After we've encrypted messages under the same key, the probability of a nonce collision with our next message is approximately 1 in ," is much more deeply satisfying than arbitrarily focusing on as an arbitrary safety limit.
Of course, to most people, and are both ridiculously small numbers that they "might as well be zero".
Whenever designing extended-nonce constructions atop AEAD block cipher modes, it may be worthwhile to discuss both the Soft and Optimal limits for nonce collisions.
So long as the people designing extended-nonce constructions at least consider doing this, my work here is done.
Generally, you don't want to consider a probability space smaller than (which is the inflection point where Optimal becomes larger than Soft).
Probability Space
Hard Bound
Soft Bound
Optimal Bound
The collision probability for any Optimal Bound is 1 divided by the number of messages.
To use this table, select a probability space from the left column. Then, on the same row, find the cell corresponding to the type of bound you are interested in.
No comments:
Post a Comment