What is the Binomial Distribution?
The binomial distribution is a discrete probability distribution that models the number of successes in a fixed number of independent trials, where each trial has two possible outcomes (success or failure) and the probability of success remains constant. It's one of the most important distributions in statistics and probability theory.
Named after the binomial coefficient in its formula, this distribution appears in countless real-world scenarios: coin flips, quality control testing, medical trials, survey responses, and any situation with repeated yes/no outcomes.
Binomial Distribution Parameters
Key Parameters
- n (trials): Number of independent trials or experiments
- p (probability): Probability of success on each trial (0 ≤ p ≤ 1)
- x (successes): Number of successful outcomes we're interested in
- q = 1 - p: Probability of failure on each trial
The Binomial Probability Formula
Probability Mass Function (PMF)
P(X = x) = C(n,x) × p^x × (1-p)^(n-x)
Where C(n,x) is the binomial coefficient:
C(n,x) = n! / (x! × (n-x)!)
Understanding the Formula
- C(n,x): Number of ways to choose x successes from n trials
- p^x: Probability of x successes
- (1-p)^(n-x): Probability of (n-x) failures
Distribution Properties
Mean (Expected Value)
μ = n × p
The expected number of successes in n trials.
Example: Flip a coin 100 times (n=100, p=0.5), expected heads = 50
Variance and Standard Deviation
Variance: σ² = n × p × (1-p)
Standard Deviation: σ = √(n × p × (1-p))
Example: For 100 coin flips, σ = √(100 × 0.5 × 0.5) = 5
Cumulative Probabilities
Often we need cumulative probabilities rather than exact values:
| Notation | Meaning | Calculation |
|---|---|---|
| P(X = x) | Exactly x successes | PMF formula |
| P(X ≤ x) | At most x successes | Sum P(X=0) to P(X=x) |
| P(X ≥ x) | At least x successes | 1 - P(X ≤ x-1) |
| P(X < x) | Fewer than x | P(X ≤ x-1) |
| P(X > x) | More than x | 1 - P(X ≤ x) |
Conditions for Binomial Distribution
The binomial distribution applies when all four conditions (BINS) are met:
Binary Outcomes
Each trial has exactly two possible outcomes: success or failure. Examples: pass/fail, yes/no, heads/tails, defective/good.
Independent Trials
The outcome of one trial doesn't affect others. Sampling with replacement or from a very large population ensures independence.
Number of Trials is Fixed
You must know the number of trials (n) in advance. The experiment runs for exactly n trials.
Same Probability
The probability of success (p) remains constant for every trial. Conditions shouldn't change during the experiment.
Practical Examples
Example 1: Quality Control
A machine produces items with 3% defect rate. In a sample of 20 items:
n = 20, p = 0.03
Q: What's the probability of exactly 1 defective item?
P(X = 1) = C(20,1) × 0.03¹ × 0.97¹⁹ = 0.3364 (33.64%)
Example 2: Medical Trial
A treatment has 70% success rate. For 15 patients:
n = 15, p = 0.70
Q: What's the probability at least 12 patients are cured?
P(X ≥ 12) = P(12) + P(13) + P(14) + P(15) = 0.2969 (29.69%)
Example 3: Sports Predictions
A basketball player has 80% free throw accuracy. For 10 attempts:
n = 10, p = 0.80
Q: What's the probability of making at least 8 shots?
P(X ≥ 8) = 0.6778 (67.78%)
Expected makes: μ = 10 × 0.80 = 8
Binomial vs. Other Distributions
Binomial vs. Bernoulli
Bernoulli is a special case of binomial with n = 1 (single trial). Binomial can be thought of as the sum of n independent Bernoulli trials.
Binomial vs. Poisson
When n is large and p is small (rare events), the binomial distribution approximates the Poisson distribution with λ = np. Use Poisson when counting events in continuous time/space.
Normal Approximation
When n is large and both np ≥ 5 and n(1-p) ≥ 5, the binomial distribution is approximately normal with μ = np and σ = √(np(1-p)). This simplifies calculations for large n.
Applications of Binomial Distribution
Quality Assurance
- Acceptance sampling in manufacturing
- Defect rate analysis
- Reliability testing
Medicine and Biology
- Drug efficacy trials
- Genetic inheritance patterns
- Diagnostic test accuracy
Business and Marketing
- Survey response rates
- Conversion rate analysis
- Risk assessment
Education and Psychology
- Test score analysis
- Guessing on multiple choice exams
- Behavioral studies
Frequently Asked Questions
What's the maximum value of n I can use?
This calculator handles n up to 170 due to factorial limitations. For larger n, use the normal approximation or software with arbitrary precision arithmetic.
When should I NOT use binomial distribution?
Don't use it when: trials aren't independent, probability changes between trials, outcomes aren't binary, or sample size is large relative to population (use hypergeometric instead).
How do I know if results are statistically significant?
Compare your observed probability to the significance level (typically α = 0.05). If P(X ≥ observed) < α for an unusually high count, the result may be significant.
What if I get 0.000000 as my answer?
Very small probabilities may display as zero due to rounding. The actual probability exists but is extremely small. This often happens with extreme values of x relative to n and p.