Siksha Sarovar

Siksha Sarovar (sikshasarovar.com) is a free educational web application that helps students in India learn programming and prepare for academic and competitive exams. The platform offers structured coding courses (C, C++, Python, Java, HTML, CSS, PHP, Power BI, AI, Machine Learning, Data Science), complete university curriculum notes for BCA/MCA students with previous year question papers, Class 10 and Class 12 CBSE/HBSE school notes, and dedicated preparation material for SSC, UPSC, Banking, Railway and other government exams. Browsing the site is completely free and requires no account. Users may optionally sign in with Google solely to save their learning progress, quiz scores and personal preferences across devices.

Privacy Policy | Terms of Service | Contact Siksha Sarovar | About Siksha Sarovar

v4.0.9 · PWA
Siksha Sarovar logo
Siksha Sarovar
Your Learning Universe

Siksha Sarovar is a free e-learning platform for coding courses, BCA university notes and competitive exam preparation. Optional Google sign-in saves your learning progress across devices.

Initializing knowledge base…
Compiling modules 0%

Unit III Overview: Learning With Neural Networks

Lesson 13 of 22 in the free Machine Learning II notes on Siksha Sarovar, written by Rohit Jangra.

Unit III Overview: Learning With Neural Networks

Artificial Neural Networks (ANNs) are inspired by the biological neural networks in animal brains. The modern deep learning revolution began with the 2012 ImageNet breakthrough by Krizhevsky, Sutskever, and Hinton (AlexNet), demonstrating that deep convolutional networks could dramatically outperform traditional methods.

Biological Inspiration

A biological neuron:

  1. Receives signals from dendrites (inputs with varying weights)
  2. Integrates signals in the cell body (weighted sum)
  3. Fires an action potential if the sum exceeds a threshold (activation function)
  4. Transmits signal via axon (output)

An artificial neuron computes: y = f(sum(w_i * x_i) + b) where f is the activation function.

Unit III Roadmap

TopicKey ConceptWhy It Matters
PerceptronSingle-layer linear classifierFoundation of all neural networks
MLP + BackpropMulti-layer + gradient descentCore training algorithm
Network StructuresCNNs, RNNs, regularizationArchitecture for different data types
Deep LearningFeature hierarchy learningState-of-the-art performance

Activation Functions

FunctionFormulaProperty
Sigmoid1/(1+e^{-z})Squashes to (0,1), vanishing gradient
Tanh(e^z - e^{-z})/(e^z + e^{-z})Zero-centered, vanishing gradient
ReLUmax(0, z)No vanishing gradient, sparse activation
Leaky ReLUmax(0.01z, z)Fixes dying ReLU problem
Softmaxe^{z_i}/sum(e^{z_j})Multi-class output probabilities

Universal Approximation Theorem

A feedforward network with a single hidden layer containing enough neurons can approximate any continuous function on a compact subset of R^n to arbitrary precision. This justifies why neural networks can, in theory, learn any pattern.

Exam-Ready Summary

  • ANN: layers of weighted sums followed by non-linear activation functions
  • Universal Approximation: 1 hidden layer suffices in theory; depth helps in practice
  • ReLU is the default activation for hidden layers (avoids vanishing gradient)
  • Depth provides compositional representations — exponentially more efficient than shallow networks
  • Modern deep learning relies on GPUs, large datasets, and architectural innovations (BN, dropout, skip connections)