Siksha Sarovar

Siksha Sarovar (sikshasarovar.com) is a free educational web application that helps students in India learn programming and prepare for academic and competitive exams. The platform offers structured coding courses (C, C++, Python, Java, HTML, CSS, PHP, Power BI, AI, Machine Learning, Data Science), complete university curriculum notes for BCA/MCA students with previous year question papers, Class 10 and Class 12 CBSE/HBSE school notes, and dedicated preparation material for SSC, UPSC, Banking, Railway and other government exams. Browsing the site is completely free and requires no account. Users may optionally sign in with Google solely to save their learning progress, quiz scores and personal preferences across devices.

Privacy Policy | Terms of Service | Contact Siksha Sarovar | About Siksha Sarovar

v4.0.9 · PWA
Siksha Sarovar logo
Siksha Sarovar
Your Learning Universe

Siksha Sarovar is a free e-learning platform for coding courses, BCA university notes and competitive exam preparation. Optional Google sign-in saves your learning progress across devices.

Initializing knowledge base…
Compiling modules 0%

2.2 Probability Distributions in Depth

Lesson 10 of 32 in the free Data Visualisation and Analytics notes on Siksha Sarovar, written by Rohit Jangra.

Probability Distributions: The Theory of Data Patterns

1. Mathematical Foundation

A probability distribution is a mathematical function that describes the likelihood of obtaining various possible outcomes in an experiment.

Study Deep: The Moments of a Distribution

In statistics, "Moments" are quantitative measures that describe the shape of a distribution:

  1. 1st Moment (Mean): Indicates the "center" or expected value.
  2. 2nd Moment (Variance): Indicates the "spread" or width.
  3. 3rd Moment (Skewness): Indicates the "asymmetry" (Left-tailed vs. Right-tailed).
  4. 4th Moment (Kurtosis): Indicates the "peakedness" or "tailedness" (How many outliers are present).

1. Mathematical Foundation

  • Discrete Variables use a Probability Mass Function (PMF), which gives the exact probability of a specific value: P(X = x).
  • Continuous Variables use a Probability Density Function (PDF), where the probability of an exact value is 0. Instead, we measure the probability over a range (area under the curve): P(a ≤ X ≤ b) = ∫ f(x)dx.

2. Key Discrete Distributions

A. Binomial Distribution: Used for binary outcomes (Success/Failure) over n independent trials.

  • Formula: P(X = k) = C(n, k) p^k (1 - p)^(n - k)
  • Where: C(n, k) is the combinations formula, p is probability of success.
  • BCA Use Case: Predicting the number of system crashes in 100 server boots, assuming an independent 2% crash probability per boot.

B. Poisson Distribution: Used for counting the number of events in a fixed interval of time or space.

  • Formula: P(X = k) = (λ^k * e^-λ) / k!
  • Where: λ (lambda) is the average rate of occurrence, e is Euler's number (~2.718).
  • BCA Use Case: Modeling the number of network packets arriving at a router per millisecond.

3. The Normal (Gaussian) Distribution

The most important continuous distribution in statistics, forming the basis for parametric tests.

  • PDF Formula: f(x) = (1 / (σ √(2π))) e^(-(x - μ)² / (2σ²))
  • Characteristics:
  • Perfectly symmetrical (Mean = Median = Mode).
  • Empirical Rule: 68.2% of data within ±1σ, 95.4% within ±2σ, 99.7% within ±3σ.

4. Standardization and Z-Scores

We can convert any Normal distribution into a Standard Normal Distribution (where μ=0, σ=1).

  • Z-Score Formula: Z = (X - μ) / σ
  • Why it matters: It allows us to compare scores from completely different datasets (e.g., comparing a student's GPA to their SAT score) by putting them on a common scale.

5. Moments of a Distribution

  • 1st Moment (Mean): Expected value, the center of mass.
  • 2nd Moment (Variance): Spread of the data around the mean.
  • 3rd Moment (Skewness): Asymmetry. Positive (right tail) or Negative (left tail).
  • 4th Moment (Kurtosis): Peakedness. Leptokurtic (heavy tails/outliers, common in financial data) vs Platykurtic (light tails).