Siksha Sarovar

Siksha Sarovar (sikshasarovar.com) is a free educational web application that helps students in India learn programming and prepare for academic and competitive exams. The platform offers structured coding courses (C, C++, Python, Java, HTML, CSS, PHP, Power BI, AI, Machine Learning, Data Science), complete university curriculum notes for BCA/MCA students with previous year question papers, Class 10 and Class 12 CBSE/HBSE school notes, and dedicated preparation material for SSC, UPSC, Banking, Railway and other government exams. Browsing the site is completely free and requires no account. Users may optionally sign in with Google solely to save their learning progress, quiz scores and personal preferences across devices.

Privacy Policy | Terms of Service | Contact Siksha Sarovar | About Siksha Sarovar

v4.0.9 · PWA
Siksha Sarovar logo
Siksha Sarovar
Your Learning Universe

Siksha Sarovar is a free e-learning platform for coding courses, BCA university notes and competitive exam preparation. Optional Google sign-in saves your learning progress across devices.

Initializing knowledge base…
Compiling modules 0%

Unit 2.3: Newton’s Method

Lesson 9 of 22 in the free Engineering Optimization notes on Siksha Sarovar, written by Rohit Jangra.

Unit 2.3: Newton’s Method

Newton’s method is a "Second-Order" method. It tries to fix the slow convergence of Steepest Descent by using curvature information.

1. Concept

Steepest Descent approximates the function as a flat plane (linear). Newton's method approximates the function as a Quadratic Surface (bowl shape) using the second derivative (Hessian Matrix).

  • If the actual function is a perfect quadratic (e.g., f(x) = x²), Newton's method finds the minimum in ONE step.

2. Mathematical Formulation

Search Direction (Sᵢ): Instead of just relying on the gradient, we modify it using the inverse of the Hessian Matrix [J]. Sᵢ = -[J]⁻¹ * ∇f(Xᵢ)

Update Formula: X₍ᵢ₊₁₎ = Xᵢ - [J]⁻¹ * ∇f(Xᵢ)

Where:

  • ∇f: Gradient Vector (First derivatives).
  • [J]: Hessian Matrix (Second partial derivatives).

3. Algorithm

  1. Start: Select initial guess X₁.
  2. Compute: Gradient ∇f and Hessian [J] at Xᵢ.
  3. Invert: Compute the inverse matrix [J]⁻¹.
  4. Update: X₍ᵢ₊₁₎ = Xᵢ - [J]⁻¹ * ∇f.
  5. Check Convergence: If gradient is near zero, stop.

4. Pros & Cons

ProsCons
Fast Convergence: Quadratic rate (extremely fast) when close to the optimum.Expensive: Calculating and inverting the Hessian matrix is computationally heavy for large problems.
Direct Path: Takes a more direct, curved path to the minimum, avoiding zig-zags.Stability: If the initial guess is far off, or if the Hessian is not Positive Definite, the method may diverge (fly away from the minimum).