Does Siksha Sarovar have an AI chatbot to answer student doubts?

Yes. Siksha Sarovar has a built-in AI Assistant chatbot accessible from a floating button on every page. It understands English, Hindi and Hinglish, handles typos (for example 'pyhtion' or 'certifecate'), and indexes 165+ destinations including every course, lesson, BCA subject, school chapter, competitive exam topic, FAQ and tool. Most queries return direct link cards in under 5 milliseconds. An AI fallback is available for novel questions.

Can I ask the SikshaSarovar chatbot questions in Hindi or Hinglish?

Absolutely. The chatbot is built specifically for Indian students — natural Hinglish queries like 'kaise milega certificate', 'free hai kya', 'pyhtion ke datatype kaha hai', 'kaha se shuru karu' are first-class citizens. The matcher strips Hindi filler words and routes you to the right course, lesson or page.

Is the SikshaSarovar AI chatbot free to use?

Yes. The chatbot is 100% free, requires no signup, and is available on every page. It runs locally in your browser for the vast majority of queries — there is no API cost or usage limit. The optional 'Ask AI' fallback for advanced coding questions uses the Pro AI Tutor.

Is Siksha Sarovar really free?

Yes. Every course, lesson, quiz, online compiler, and notes download is free to use without an account. We offer an optional Pro pass that unlocks longer AI tutor sessions, larger compiler quotas and priority support, but it is not required to learn from the platform. The educational content itself stays free.

Do I need to sign in to use the courses?

No. You can browse any course, read all lessons, run code in the compiler and take quizzes without signing in. Google Sign-In is purely optional and is used only to save your progress, quiz scores and certificate eligibility across devices. We never request access to Gmail, Drive, Calendar, Contacts, or any sensitive Google data.

Are the certificates from Siksha Sarovar recognised?

Our certificates are a record of completion that you can share on LinkedIn or attach to applications, but Siksha Sarovar is an independent platform — not a UGC-recognised university or board. We are upfront about that. The certificate is most useful as a verifiable signal that you have completed the curriculum, not as a substitute for a degree.

Which courses are best for BCA and MCA students?

Our University Curriculum section covers the YMCA BCA/MCA syllabus subject-by-subject — Data Structures, DBMS, Web Based Programming, Computer Networks, Operating Systems, Software Engineering, Data Warehousing and more. Each subject is broken down into the same units your university teaches, with previous year question papers where available.

Can I use Siksha Sarovar to prepare for SSC, UPSC, Banking or Railway exams?

Yes. The Competitive section has dedicated tracks for SSC (CGL, CHSL, MTS), UPSC, IBPS/SBI Banking, RRB Railways and defence exams (NDA, CDS, AFCAT). Topics include quantitative aptitude, reasoning, English grammar, general knowledge and current affairs, written specifically for the Indian exam pattern.

What languages does the online compiler support?

The Siksha Sarovar online compiler supports C, C++, Python, Java, PHP, JavaScript, C# and SQL. The compiler runs your code in a sandboxed environment using Judge0, returns the standard output and error stream, and supports stdin so you can test interactive programs. There is no installation — everything runs in your browser.

How is my personal data handled by Siksha Sarovar?

We follow data minimisation: we collect only what is needed (email, name, profile picture from Google sign-in, and your learning progress). Data is stored on Supabase with HTTPS in transit. We do not sell user data, and we do not use it to train AI models. You can request deletion at any time by emailing contact@sikshasarovar.com — see our Privacy Policy for the full details.

Who founded Siksha Sarovar?

Siksha Sarovar was founded by Rohit Kumar, who serves as CEO and Head Developer. Rohit built the platform to provide free, structured education to students across India — covering programming courses, university notes, school study material and competitive exam preparation.

3.3 Design of HDFS & Core Concepts — Big Data-1 Notes

3.3.1 The HDFS Design Philosophy

The Hadoop Distributed File System (HDFS) is designed to store very large files across machines in a large cluster. It prioritizes Throughput over Latency.

Large Data Sets: Files are typically in the range of gigabytes to terabytes.
Streaming Data Access: Designed for "Write Once, Read Many times." It works best for batch processing rather than interactive user applications.
Hardware Failure: Assume nodes will crash. HDFS is built to be self-healing.

3.3.2 HDFS Key Concepts

1. Blocks

HDFS splits a file into large chunks, called Blocks.

Size: Default is typically 128 MB (much larger than the 4 KB blocks in a traditional OS).
Reason: To minimize the "seek time" and costs associated with metadata lookup for millions of small pieces.

2. Namenode and Datanodes

HDFS follows a Master/Slave architecture.

Component	Responsibility
Namenode	The Master. Manages the file system namespace and metadata (where blocks are located). It is a Single Point of Failure (SPOF) in older versions.
Datanode	The Slave. Stores the actual data blocks. They periodically send "Heartbeats" to the Namenode to say "I am alive."

3.3.3 The Namenode's Persistence: FsImage and Edit Logs

Since the Namenode stores all metadata in memory for speed, it needs a way to persist this to disk.

FsImage: A complete snapshot of the file system namespace at a specific point in time.
Edit Log: A small file that records every recent change (creating a file, deleting a folder) made to the file system.

The Restart Process: On start, the Namenode reads the FsImage and then "replays" all the transactions in the Edit Log to get back to the current state.

3.3.4 The Secondary Namenode (The Checkpointer)

Contrary to its name, it is NOT a backup for the Namenode. Its main job is to merge the FsImage and Edit Log periodically.

Why?: If the Edit Log grows too large, the Namenode will take hours to restart.
How?: The Secondary Namenode pulls the FsImage and Edit Log, merges them into a "new" FsImage, and sends it back to the Primary Namenode.

3.3.5 Why 128 MB blocks?

In a typical OS, blocks are 4 KB. If HDFS used 4 KB blocks:

Metadata Explosion: To store a 100 GB file, the Namenode would need to track 25 million blocks, crashing its memory.
Seek Time: With 128 MB, the time to transfer the data is much larger than the time to find (seek) the block on the disk, making the transfer 99% efficient.

Rack Awareness: Hadoop tries to put copies on different "Racks" (collections of servers) so that even if a power switch for an entire rack fails, the data is still accessible from another rack.

3.3.6 Advanced HDFS: Erasure Coding vs Replication

Standard replication (3x) has a 200% storage overhead. Modern Hadoop (3.x) uses Erasure Coding.

Concept: Instead of full copies, it uses mathematical parity (like RAID-6).
Overhead: Reduces overhead from 200% down to 50% while maintaining the same level of fault tolerance.
Trade-off: Requires much more CPU to recalculate data if a node fails.

3.3.7 Monitoring and Management

Administrators manage HDFS using several tools:

Namenode UI: A web interface (port 9870) that shows cluster health, safe mode status, and dead nodes.
JMX Metrics: Java Management Extensions allow systems like Prometheus to monitor JVM memory usage and RPC latency.

3.3.8 High Availability (HA) Architecture

In Hadoop 1.x, the NameNode was a Single Point of Failure. In 2.x and 3.x, we use Active/Standby models.

Shared Edits Directory: Both NameNodes have access to a shared storage (like NFS) or a cluster of Quorum Journal Managers (QJM).
ZK Failover Controller (ZKFC): A ZooKeeper client that monitors NameNode health and handles the automatic transition from Standby to Active if the Primary fails.
Fencing: Ensuring that the "old" Active NameNode is actually dead before the new one takes over, preventing "Split Brain" scenarios where two masters try to write to the cluster.