Siksha Sarovar

Siksha Sarovar (sikshasarovar.com) is a free educational web application that helps students in India learn programming and prepare for academic and competitive exams. The platform offers structured coding courses (C, C++, Python, Java, HTML, CSS, PHP, Power BI, AI, Machine Learning, Data Science), complete university curriculum notes for BCA/MCA students with previous year question papers, Class 10 and Class 12 CBSE/HBSE school notes, and dedicated preparation material for SSC, UPSC, Banking, Railway and other government exams. Browsing the site is completely free and requires no account. Users may optionally sign in with Google solely to save their learning progress, quiz scores and personal preferences across devices.

Privacy Policy | Terms of Service | Contact Siksha Sarovar | About Siksha Sarovar

v4.0.9 · PWA
Siksha Sarovar logo
Siksha Sarovar
Your Learning Universe

Siksha Sarovar is a free e-learning platform for coding courses, BCA university notes and competitive exam preparation. Optional Google sign-in saves your learning progress across devices.

Initializing knowledge base…
Compiling modules 0%

1.2 Data Types and Examples

Lesson 3 of 36 in the free Big Data-1 notes on Siksha Sarovar, written by Rohit Jangra.

Understanding Unstructured Data

Traditional databases (SQL) are designed for Structured Data—data that fits neatly into rows and columns (like an Excel sheet). However, the vast majority of Big Data is Unstructured.

Definition of Unstructured Data

Unstructured data is information that either does not have a predefined data model or is not organized in a predefined manner. It is typically heavy on text but may contain data such as dates, numbers, and facts as well.

Examples of Unstructured Data:

  • Text Files: Word documents, PDFs, Emails, Spreadsheets.
  • Social Media: Posts, comments, hashtags, and user "likes."
  • Media: Digital photos, audio files, and video recordings.
  • Mobile Data: Text messages, location pings, and app usage logs.

Comparing Data Structures

FeatureStructured DataSemi-Structured DataUnstructured Data
FormatStrictly organized (Tables)Tags/Markers (JSON, XML)No predefined format
FlexibilityRigid schemaMore flexibleHighly flexible
ScaleLow to MediumMedium to HighInfinite
ToolsMySQL, Oracle, PostgreSQLMongoDB, CouchDBHadoop, Spark, NoSQL

Real-World Examples of Big Data in Action

  1. Google Search: Google processes over 3.5 billion searches per day. Each search involves analyzing trillions of web pages in milliseconds to provide the most relevant result.
  2. Netflix Recommendation Engine: Netflix analyzes the viewing habits of over 200 million subscribers. It tracks when you pause, rewind, or fast-forward, using this "Big Data" to recommend your next favorite show.
  3. Boeing Jet Engines: A single Boeing 787 engine generates 500 gigabytes of data per flight. This data is analyzed to predict maintenance needs before a part actually fails, ensuring passenger safety.
  4. NASA Climate Modeling: NASA uses supercomputers to process satellite data, creating complex climate models that involve petabytes of oceanic and atmospheric readings.