Siksha Sarovar

Siksha Sarovar (sikshasarovar.com) is a free educational web application that helps students in India learn programming and prepare for academic and competitive exams. The platform offers structured coding courses (C, C++, Python, Java, HTML, CSS, PHP, Power BI, AI, Machine Learning, Data Science), complete university curriculum notes for BCA/MCA students with previous year question papers, Class 10 and Class 12 CBSE/HBSE school notes, and dedicated preparation material for SSC, UPSC, Banking, Railway and other government exams. Browsing the site is completely free and requires no account. Users may optionally sign in with Google solely to save their learning progress, quiz scores and personal preferences across devices.

Privacy Policy | Terms of Service | Contact Siksha Sarovar | About Siksha Sarovar

v4.0.9 · PWA
Siksha Sarovar logo
Siksha Sarovar
Your Learning Universe

Siksha Sarovar is a free e-learning platform for coding courses, BCA university notes and competitive exam preparation. Optional Google sign-in saves your learning progress across devices.

Initializing knowledge base…
Compiling modules 0%

2.2 Data Cleaning Techniques

Lesson 12 of 62 in the free Power BI notes on Siksha Sarovar, written by Rohit Jangra.

Data Cleaning Techniques in Power Query

Data cleaning (or data cleansing) is the process of fixing or removing incorrect, incomplete, duplicate, or improperly formatted data. Clean data leads to accurate reports and reliable insights.

1. Handling Missing Values (Nulls & Blanks)

Missing data can distort analysis. Power Query provides several ways to handle it.

Identifying Missing Values: • Null values appear as null in the data preview • Blank cells from Excel appear as empty strings or null • Column quality indicator (View tab → Column Quality) shows % of valid, error, and empty values

Methods to Handle Missing Values:

MethodWhen to Use
Replace ValuesReplace null with a default value (e.g., 0, "Unknown")
Remove RowsRemove rows where critical columns are null
Fill Down / Fill UpCopy value from above/below to fill gaps

Steps to Replace Null Values:

  1. Select the column
  2. TransformReplace Values
  3. In "Value to Find" enter null
  4. In "Replace With" enter your default value (e.g., 0)
  5. Click OK

Steps to Fill Down:

  1. Select the column
  2. TransformFillDown
  3. Null cells are filled with the nearest non-null value above

2. Removing Duplicates

Duplicate rows can inflate metrics and skew analysis.

Steps to Remove Duplicates:

  1. Select the column(s) to check for duplicates
  2. HomeRemove RowsRemove Duplicates
  3. Power BI keeps the first occurrence and removes subsequent duplicates

Tips: • Select multiple columns to remove duplicates based on a combination • Use "Remove Alternate Rows" to handle patterned duplicates

3. Splitting & Merging Columns

Splitting a Column: • Useful when a single column contains combined data (e.g., "FirstName LastName" in one column)

Steps:

  1. Select the column
  2. TransformSplit Column → Choose method:
  • By Delimiter (comma, space, colon, custom)
  • By Number of Characters
  • By Positions
  1. Configure options and click OK

Merging Columns: • Combine two or more columns into one

Steps:

  1. Select the columns to merge (Ctrl + Click)
  2. TransformMerge Columns
  3. Choose a separator (space, comma, hyphen, custom)
  4. Name the new merged column
  5. Click OK

4. Changing Data Types

Correct data types are essential for accurate calculations and visualizations.

Common Data Types:

TypeDescriptionExample
TextString values"Rohit", "Mumbai"
Whole NumberIntegers1, 42, 1000
Decimal NumberFloating point3.14, 99.99
DateDate values01/01/2025
Date/TimeDate with time01/01/2025 10:30 AM
True/FalseBooleanTRUE, FALSE

Steps to Change Data Type:

  1. Click the data type icon on the column header (ABC, 123, etc.)
  2. Select the desired data type from the dropdown
  3. Or: Select column → TransformData Type → Choose type

Important: Always set data types correctly before loading data. Incorrect types cause errors in DAX calculations.