Siksha Sarovar

Siksha Sarovar (sikshasarovar.com) is a free educational web application that helps students in India learn programming and prepare for academic and competitive exams. The platform offers structured coding courses (C, C++, Python, Java, HTML, CSS, PHP, Power BI, AI, Machine Learning, Data Science), complete university curriculum notes for BCA/MCA students with previous year question papers, Class 10 and Class 12 CBSE/HBSE school notes, and dedicated preparation material for SSC, UPSC, Banking, Railway and other government exams. Browsing the site is completely free and requires no account. Users may optionally sign in with Google solely to save their learning progress, quiz scores and personal preferences across devices.

Privacy Policy | Terms of Service | Contact Siksha Sarovar | About Siksha Sarovar

v4.0.9 · PWA
Siksha Sarovar logo
Siksha Sarovar
Your Learning Universe

Siksha Sarovar is a free e-learning platform for coding courses, BCA university notes and competitive exam preparation. Optional Google sign-in saves your learning progress across devices.

Initializing knowledge base…
Compiling modules 0%

Unit I Overview: Combining Different Models

Lesson 1 of 22 in the free Machine Learning II notes on Siksha Sarovar, written by Rohit Jangra.

Unit I Overview: Combining Different Models

Ensemble methods combine multiple base learners to produce a more powerful predictor. Condorcet's Jury Theorem (1785) shows that when independent voters are slightly better than chance, a majority vote approaches certainty. Machine learning exploits this through bagging, boosting, and stacking.

Why Ensembles Work

The expected test error decomposes as: Total Error = Bias^2 + Variance + Irreducible Noise

  • Bagging trains models in parallel on bootstrap samples — reduces variance by averaging.
  • Boosting trains models sequentially, correcting previous errors — reduces bias.
  • Stacking learns a meta-model to optimally weight diverse base learners.

Unit I Roadmap

TopicCore TechniquePrimary Benefit
Model EvaluationCross-validation, AUCReliable performance estimates
Statistical Learning TheoryPAC learning, VC dimensionFormal generalization bounds
BoostingAdaBoost, GradientBoostBias reduction
BaggingBootstrap aggregatingVariance reduction
Random ForestsRandom subspace + baggingBias + variance reduction

No Free Lunch Theorem

No single algorithm outperforms all others on every problem distribution. This motivates ensembles: diverse models cover different hypothesis regions for broadly robust performance.

Requirements for Effective Ensembles

  1. Base learner accuracy must exceed 50% (better than random).
  2. Base learners must make diverse, uncorrelated errors — diversity is paramount.
  3. A combination rule (vote, average, or meta-learner) must aggregate predictions.

Exam-Ready Summary

  • Ensemble error = f(individual errors, inter-model correlation)
  • Lower inter-model correlation means greater variance reduction
  • Bagging: parallel, best with unstable high-variance models (deep trees)
  • Boosting: sequential, best with stable weak learners (decision stumps)
  • Diversity is essential — identical models provide zero ensemble benefit