## Tuesday, October 15, 2013

### Handshakes and Pigeons

Q: A party has a hundred guests. The guests mingle freely and randomly shake hands with one another (no repeat handshakes). What is the probability that any two persons will have the same number of handshakes?
Discrete Mathematics with Applications

A: In order to solve this, a powerful yet simple tool available is the "Pigeonhole Principle". At its heart its a rather simple statement. If you place $$n$$ pigeons in $$m$$ holes where $$n \ge m$$ at least one hole must have more than one pigeon. This is simple to grasp if you look at the picture below which shows the case with 3 pigeons in 2 boxes.
A guest at the party can shake hands 0 to $$n - 1$$ times. Each of these counts can be thought of as states a guest can get into and the resulting distribution is quite arbitrary. For example, there could be guests who shake hands with none, with 3 other guests and so on. Notice how this fits nicely with the pigeon hole principle. In such a setting, a guest can fit into any of the $$n - 1$$ holes or states. Also note, we could get misled to think that there are a total of $$n$$ states given that $$[1..n-1]$$ implies $$n-1$$ states and the zero state is one more, leading to a total $$n$$ states. However, if a guest shakes hands with none, no other guest can go beyond $$n-2$$ handshakes. So the total can never be beyond $$n-1$$ handshakes. This leads us to the conclusion that at least two guests will have the same number of handshakes. Thus the sought probability is 1.

If you are looking to buy some books in probability here are some of the best books to learn the art of Probability

Fifty Challenging Problems in Probability with Solutions (Dover Books on Mathematics)
This book is a great compilation that covers quite a bit of puzzles. What I like about these puzzles are that they are all tractable and don't require too much advanced mathematics to solve.

Introduction to Algorithms
This is a book on algorithms, some of them are probabilistic. But the book is a must have for students, job candidates even full time engineers & data scientists

Introduction to Probability Theory
Overall an excellent book to learn probability, well recommended for undergrads and graduate students

An Introduction to Probability Theory and Its Applications, Vol. 1, 3rd Edition
This is a two volume book and the first volume is what will likely interest a beginner because it covers discrete probability. The book tends to treat probability as a theory on its own

The Probability Tutoring Book: An Intuitive Course for Engineers and Scientists (and Everyone Else!)
A good book for graduate level classes: has some practice problems in them which is a good thing. But that doesn't make this book any less of buy for the beginner.

Introduction to Probability, 2nd Edition
A good book to own. Does not require prior knowledge of other areas, but the book is a bit low on worked out examples.

Bundle of Algorithms in Java, Third Edition, Parts 1-5: Fundamentals, Data Structures, Sorting, Searching, and Graph Algorithms (3rd Edition) (Pts. 1-5)
An excellent resource (students, engineers and even entrepreneurs) if you are looking for some code that you can take and implement directly on the job

Understanding Probability: Chance Rules in Everyday Life
This is a great book to own. The second half of the book may require some knowledge of calculus. It appears to be the right mix for someone who wants to learn but doesn't want to be scared with the "lemmas"

Data Mining: Practical Machine Learning Tools and Techniques, Third Edition (The Morgan Kaufmann Series in Data Management Systems)
This one is a must have if you want to learn machine learning. The book is beautifully written and ideal for the engineer/student who doesn't want to get too much into the details of a machine learned approach but wants a working knowledge of it. There are some great examples and test data in the text book too.

This is a good book if you are new to statistics & probability while simultaneously getting started with a programming language. The book supports R and is written in a casual humorous way making it an easy read. Great for beginners. Some of the data on the companion website could be missing.

Covered in this book are the central limit theorem and other graduate topics in probability. You will need to brush up on some mathematics before you dive in but most of that can be done online

This book has been yellow-flagged with some issues: including sequencing of content that could be an issue. But otherwise its good