Siksha Sarovar

Siksha Sarovar (sikshasarovar.com) is a free educational web application that helps students in India learn programming and prepare for academic and competitive exams. The platform offers structured coding courses (C, C++, Python, Java, HTML, CSS, PHP, Power BI, AI, Machine Learning, Data Science), complete university curriculum notes for BCA/MCA students with previous year question papers, Class 10 and Class 12 CBSE/HBSE school notes, and dedicated preparation material for SSC, UPSC, Banking, Railway and other government exams. Browsing the site is completely free and requires no account. Users may optionally sign in with Google solely to save their learning progress, quiz scores and personal preferences across devices.

Privacy Policy | Terms of Service | Contact Siksha Sarovar | About Siksha Sarovar

v4.0.9 · PWA
Siksha Sarovar logo
Siksha Sarovar
Your Learning Universe

Siksha Sarovar is a free e-learning platform for coding courses, BCA university notes and competitive exam preparation. Optional Google sign-in saves your learning progress across devices.

Initializing knowledge base…
Compiling modules 0%

1.4 Analytical Models

Lesson 4 of 32 in the free Data Visualisation and Analytics notes on Siksha Sarovar, written by Rohit Jangra.

Analytical Models: Requirements & Types

1. What is an Analytical Model?

An analytical model is a mathematical or statistical representation of a real-world process.

Study Deep: The Bias-Variance Tradeoff

This is the central challenge in model building:

  1. Bias (Underfitting): The model is too simple. It ignores the signal in the data (e.g., using a straight line to fit a curve).
  2. Variance (Overfitting): The model is too complex. It "memorizes" the noise in the training data rather than the actual pattern.
  • Goal: Find the "Sweet Spot" where total error is minimized.

2. Requirements for a Good Model

For a model to be useful in a professional environment, it must meet several criteria:

  1. Accuracy: How often the model is correct (crucial but not the only metric).
  2. Interpretability: Can a human understand why the model made a certain decision? (Crucial for healthcare/finance).
  3. Scalability: Can the model handle increasing amounts of data efficiently?
  4. Robustness: Does the model perform well on new, unseen data (avoiding overfitting)?
  5. Efficiency: How much computational power/time does it take to train and run?

3. Categories of Analytical Models

CategoryGoalExample
Predictive ModelsForecast future valuesLinear Regression (Sales), Time-Series (Stock)
Descriptive ModelsSummarize historical dataAverages, Cluster analysis (Customer types)
Classification ModelsAssign labels to dataLogistic Regression (Spam/Not Spam), Decision Trees
Optimization ModelsFind the best solutionLinear Programming (Supply chain routes)

4. Bias vs. Variance: The Tradeoff

ProblemBiasVarianceDescriptionSolution
UnderfittingHighLowModel is too simple; misses patternsUse more features, complex model
Good FitLowLowIdeal state
OverfittingLowHighExcellent on training, poor on test dataRegularization, more data, simpler model

5. Model Lifecycle

  1. Define: Clearly state the problem and success metrics.
  2. Train: Teach the model using historical data (typically 70-80% of data).
  3. Validate: Evaluate performance on a validation set (10-15%).
  4. Test: Final evaluation on a held-out test set (10-15%).
  5. Deploy: Put the model into production (APIs, dashboards).
  6. Monitor: Continuously check for accuracy drift, data drift, and concept drift over time.

6. Evaluation Metrics Quick Reference

TaskMetricFormula/Description
ClassificationAccuracy(TP + TN) / Total
ClassificationPrecisionTP / (TP + FP) — "Of predicted positives, how many are correct?"
ClassificationRecallTP / (TP + FN) — "Of actual positives, how many were found?"
RegressionMAEMean Absolute Error — average of absolute differences
RegressionRMSERoot Mean Squared Error — penalizes large errors more
RegressionProportion of variance explained (0 to 1; higher is better)