Siksha Sarovar

Siksha Sarovar (sikshasarovar.com) is a free educational web application that helps students in India learn programming and prepare for academic and competitive exams. The platform offers structured coding courses (C, C++, Python, Java, HTML, CSS, PHP, Power BI, AI, Machine Learning, Data Science), complete university curriculum notes for BCA/MCA students with previous year question papers, Class 10 and Class 12 CBSE/HBSE school notes, and dedicated preparation material for SSC, UPSC, Banking, Railway and other government exams. Browsing the site is completely free and requires no account. Users may optionally sign in with Google solely to save their learning progress, quiz scores and personal preferences across devices.

Privacy Policy | Terms of Service | Contact Siksha Sarovar | About Siksha Sarovar

v4.0.9 · PWA
Siksha Sarovar logo
Siksha Sarovar
Your Learning Universe

Siksha Sarovar is a free e-learning platform for coding courses, BCA university notes and competitive exam preparation. Optional Google sign-in saves your learning progress across devices.

Initializing knowledge base…
Compiling modules 0%

Practical 10: Correlation, Regression & Scatter Plot Analysis

Lesson 10 of 14 in the free Statistical Analysis using Excel Lab notes on Siksha Sarovar, written by Rohit Jangra.

Program Statement

Perform correlation and regression analysis on multiple datasets using Excel statistical functions.

---

Dataset 1: City Population vs Annual Income

CityPopulation (Lakhs)Avg Annual Income (₹)
Delhi190580000
Mumbai200620000
Bangalore130540000
Chennai95490000
Kolkata145460000
Hyderabad100510000
Pune65480000
Ahmedabad80440000

(Population in B2:B9, Income in C2:C9)

=== a. Average Population ===
=AVERAGE(B2:B9)

=== Average Annual Income ===
=AVERAGE(C2:C9)

=== b. Standard Deviation of Population ===
=STDEV(B2:B9)

=== Standard Deviation of Annual Income ===
=STDEV(C2:C9)

=== c. Correlation Coefficient ===
=CORREL(B2:B9, C2:C9)

=== d. Slope (m) of Regression Line ===
=SLOPE(C2:C9, B2:B9)

=== Intercept (c) of Regression Line ===
=INTERCEPT(C2:C9, B2:B9)

=== Regression Equation ===
Y = m*X + c  →  Income = SLOPE * Population + INTERCEPT

Interpretation: If CORREL > 0.7, there is a strong positive relationship between population and income.

Scatter Plot: Select Population & Income → Insert → Scatter → Add Trendline (Linear)

---

Dataset 2: Student Study Hours vs Marks

StudentStudy Hours/DayMarks Obtained
Aman6.588
Priya4.072
Rohit7.594
Neha3.060
Vikas8.097
Riya5.078
Karan6.085
Anjali2.555
Sanjay5.581
Pooja7.090

(Study Hours in E2:E11, Marks in F2:F11)

=== a. Average Study Hours ===
=AVERAGE(E2:E11)

=== b. Average Marks ===
=AVERAGE(F2:F11)

=== c. Std Dev of Study Hours ===
=STDEV(E2:E11)

=== d. Std Dev of Marks ===
=STDEV(F2:F11)

=== e. Correlation Coefficient ===
=CORREL(E2:E11, F2:F11)

=== f. Slope (m) ===
=SLOPE(F2:F11, E2:E11)

=== Intercept (c) ===
=INTERCEPT(F2:F11, E2:E11)

=== Regression Equation ===
Marks = SLOPE * StudyHours + INTERCEPT

=== Predict marks for 4.2 hours ===
=SLOPE(F2:F11,E2:E11)*4.2 + INTERCEPT(F2:F11,E2:E11)

Interpretation of Correlation:

  • r close to +1 → Strong positive correlation (more study → higher marks)
  • r close to 0 → No relationship
  • Limitation: Study hours alone cannot predict marks; factors like aptitude, health, and teaching quality also matter.

---

Dataset 3: Schools vs Candidates Distribution

Class IntervalNo. of SchoolsNo. of Candidates
1–1015200
11–2028450
21–3035600
31–4020380
41–5012250

(Schools in H2:H6, Candidates in I2:I6)

=== Average number of students in distribution ===
=AVERAGE(I2:I6)

=== Standard Deviation ===
=STDEV(I2:I6)

=== Correlation between Schools and Candidates ===
=CORREL(H2:H6, I2:I6)