Birthday Sharing Odds |
However, finding other people that share the same birthday (not yours) is actually quite easy. To those particular people, discovering that they share that same birthday is quite remarkable. But to you, oh reader of this page, you know better.
In fact, get 367 people together, and you are guarranteed to have two people sharing the same birthday. And if you discount leap year babies, you'd only need 366.
Given that certainty, and the fact that you've bothered to read this far, let's ask about smaller social gatherings. Say, a small party. At that party, let's ask the following question:
For two people (n=2):
Adding a third person to the group starts to get a little complicated. Assuming that person #1 and person #2 don't share the same birthday (otherwise our search criteria would have already been met), then person #3 might share a birthday with person #1 or person #2. Put another way, the only way that person #3 won't share a birthday with someone at the party is if he was born on one of the 363 days remaining in the year -- 365 days minus the 2 already "taken" by the first two guests. For this there is a 363/365 chance, but don't forget we are also assuming that the first two guests have different birthdays.
To help visualize this, imagine a huge calendar laid out on the floor. As people arrive, they go and stand on the spot on the calendar corresponding to their birthday. As they do, the calendar fills up. As more people show up, it becomes more and more likely that their birthday "spot" will already be occupied by an earlier guest. Once that happens, our search criteria has been met -- two guests share the same birthday.
In words:
Probability that at least two of the three people at the party have the same birthday =
(100% of the time) - [ Probability (that Person #1 and Person #2 don't share a birthday) and (that Person #3 doesn't share a birthday with either Person #1 or Person #2) ]
In numbers:
In practice, the probability rises so fast that we have a >99.9% chance by the time we hit 69 people. So, for all intents and purposes, the chance of finding at least one match in 365 people is already 100%.
Representing this generally for any number of people (n) is not hard, but unfortunately, the result isn't something your handheld calculator is going to be able to stomach. So, I'll be giving you a partial table of probabilities a little later to help with betting. To derive the general expression, let's start by taking the five person example above (n=5):
which we can rearrange to:
You'll notice that all those 365's in the donominator are easily handled by an exponential:
The top part is more tricky. With those descending numbers all being multiplied together, it looks like the factorial 364! (a VERY large number), except the factors don't go all the way to 1. But, if we're willing to do some big time cancellations (unfortuantely, your handheld calculator probably isn't), we can use two factorials to accomplish our goal:
The 360! in the bottom cancels with 360! in the top, leaving us with the desired (364 · 363 · 362 · 361) in the top.
Now, here's that table:
| # of people (n) | Probability | Fair Odds | Recommended odds |
|---|---|---|---|
| 2 | 0.0027 | (1:364) | (1:400) |
| 3 | 0.0082 | (1:121) | (1:150) |
| 4 | 0.0164 | (1:60) | (1:70) |
| 5 | 0.0271 | (1:36) | (1:40) |
| 10 | 0.1170 | (1:8) | (1:10) |
| 15 | 0.2529 | (1:3) | (1:4) |
| 20 | 0.4114 | (2:3) | (1:2) |
| 25 | 0.5687 | 3:2 | even |
| 30 | 0.7063 | 3:1 | 2:1 |
| 35 | 0.8144 | 4:1 | 3:1 |
| 40 | 0.9032 | 8:1 | 5:1 |
| 45 | 0.9483 | 18:1 | 15:1 |
| 50 | 0.9704 | 33:1 | 30:1 |
| 55 | 0.9863 | 72:1 | 50:1 |
| 60 | 0.9941 | 169:1 | 150:1 |
As long as we're talking about betting, here are some things to remember:
Good question. Indeed, the fact that a group of 23 people has a 50% chance that at least two people share a birthday does strike many people as unusual. After all, the chance of finding a match for your own birthday in a group of 23 people (excluding yourself) is only 6%. You would need to gather a group over 10 times that size (253 people) to have a 50% chance of finding someone with your own birthday.
So, why the big difference? The general explanation is that, in both cases, your chances of finding a match increase with increasing numbers of people (n). That just makes good sense. However, in the case of searching for people that match your own birthday, the probability that your will find a match with any given person is fixed -- about 364/365 or about .00274. The situation is different when you no longer care about any particular birthday, but rather just matches in general. In that case, the chance of finding a match with any given person is not fixed, but rather depends on the number of people in the group (n). This makes sense, because as n increases, so do the number of ways we can find a match for any newcomer to the party.
To help understand this, consider the case of a group of 118 people. Instead of asking people their birthday, we'll just roll a die for each person, and we'll consider them a match if we roll a '1'. In both cases (i.e., searching for your birthday, or searching for any two people with the same birthday) we get to roll the die once for each person -- 118 times in each case. However, the type of die in each case is different. In the case of searching for your own birthday, the die has 365 sides (only one represents your birthday). In contrast, when searching for any two people that match you get to roll a 6-sided die (a regular cube-shaped die). As you can imagine, it's much easier to roll a '1' using 118 throws of a 6-sided die than it is to roll a '1' using the same number of throws of a 365-sided die.
OK, sure, rolling a certain number is easier when you use dice with 6 sides than dice 365 sides. But where are you getting this 6-sided die for 118 people? What's that all about?
Admittedly, the 6-sided die is a little contrived, but in representing the probabilities involved, it is accurate nonetheless. Let me explain.
Like I said before, in the case of finding people that share your birthday, we can simply say y = 364/365 or .00274 no matter how big n gets. But in the case of looking for any two people that share the same birthday, y gets bigger as n gets bigger. Exactly how y and n are related in this case is what we just solved for. Unfortunately, as we've encountered before, 365! isn't something your handheld calculator, or even Micorsoft Excel is willing to compute for you, so I've made a graph. I'll also mention a few highlights.
As you can see from the graph, y starts out the same in both cases [this makes sense -- consider a group of 2 people (including yourself)], but as you can see, y gets big quite quickly when you're not looking for matches to your own birthday. In fact, by the time the group has grown to 20 people, y is 10 times larger for finding any two matches than for finding a match to your own birthday.
I mentioned before that 23 people is around the 50/50 spot for finding a pair of people with matching birthdays. At 23 people, y = 0.032 (~1/30). So, continuing with our dice-rolling analogy, you would have 23 rolls to roll a '1' on a 30-sided die. The fact that your chances of doing this is about 50/50 may be intuitive. To be sure, this is a lot easier than rolling a '1' on a 365-sided die in as many tries!
I also mentioned that at 69 people, there is a greater than 99.9% chance that at least two people will share a birthday. At 69 people, y = 0.096 (~1/10). So, that's like having 69 chances to roll a '1' on a 10-sided die. Too true, one would be very surprised to not roll a '1' during such a trial.
Around 300 people, y gets close to 0.5, which is like flipping a coin for every person, and you need only get a single 'heads' to garner a match. In 300 flips of a coin, there's essentially no way that you won't see it come up 'heads' at least once.
Right at 365 people, y is not close to 1, which you might have otherwise expected. Instead, the last entry for y is 0.630. The fact that this is equivalent to 1 - (1/e) is probably no accident, but I'm at a loss to explain why, especially since it doesn't appear to approach this value asymptotically. If you can explain why this happens, I'd love to hear from you.
Happy Birthday.