Chapter 58: Descriptive Statistics

Descriptive Statistics is simply the art & science of summarizing and describing the data you actually have in front of you — as clearly, honestly, and helpfully as possible.

It answers the question: “What does this pile of numbers / measurements / observations actually look like?”

It does not try to guess what happens outside your data — that’s the job of inferential statistics (which we’ll talk about later). Descriptive statistics stays humble: it only talks about the data you collected.

Step 1: The Four Most Important Jobs of Descriptive Statistics

  1. Measure central tendency — “Where is the middle / typical value?”
  2. Measure spread / variation — “How much do things differ from the middle?”
  3. Show the shape / distribution — “Is the data symmetric, skewed, clumped, full of outliers?”
  4. Show relationships — “Do two things tend to move together?”

These four jobs cover 95% of what people do when they “look at data”.

Step 2: Everyday Hyderabad Examples (You Already Use Descriptive Statistics)

Example 1 – Food delivery times from Swiggy/Zomato (very common)

You order biryani 15 times in the last month and write down the delivery time each time (in minutes):

28, 35, 42, 31, 27, 39, 45, 33, 29, 38, 40, 32, 36, 30, 41

What does descriptive statistics tell you?

  • Mean (average) = (28+35+42+…+41) ÷ 15 = 34.4 minutes → “On average, delivery takes about 34–35 minutes.”
  • Median = sort the list → middle value = 35 minutes → “Half the time it’s 35 min or less, half the time 35 min or more.”
  • Mode = most common time = none clear (all different) → no mode
  • Range = max – min = 45 – 27 = 18 minutes → Deliveries vary by up to 18 minutes.
  • Standard deviation5.2 minutes (calculation later) → “Most deliveries are within about ±5 minutes of the average (roughly 29–39 min).”
  • Box plot / histogram (if you plot it) → Shows most times cluster around 30–40 min, a few outliers at 45 min (rainy day?).

That’s descriptive statistics — it tells you exactly what happened in your 15 orders.

Example 2 – Monthly mobile recharge amounts (very relatable)

You look at last 12 months recharges:

₹399, ₹479, ₹399, ₹666, ₹399, ₹719, ₹399, ₹399, ₹479, ₹555, ₹399, ₹479

Descriptive statistics:

  • Mean ≈ ₹487
  • Median = ₹439 (middle value when sorted)
  • Mode = ₹399 (happened 6 times)
  • Range = ₹719 – ₹399 = ₹320
  • Standard deviation ≈ ₹112 (quite spread out)

What does this tell you? → You mostly stick to ₹399 plan, but sometimes upgrade to higher data packs when you travel or watch more videos.

Step 3: The Most Important Tools of Descriptive Statistics (With Numbers)

  1. Measures of Central Tendency (Where is the “middle”?)

    • Mean (arithmetic average) — add everything, divide by count → Sensitive to outliers (one ₹5,000 recharge would pull it way up)
    • Median — middle value when sorted → Very robust — ignores extreme values
    • Mode — most frequent value → Useful for “most common” questions (most popular biryani spice level?)
  2. Measures of Spread / Dispersion (How much variety?)

    • Range — max – min (simple, but very sensitive to outliers)
    • Interquartile Range (IQR) — Q3 – Q1 (middle 50% spread — robust)
    • Variance → average squared distance from mean
    • Standard Deviation → square root of variance (same unit as data — easy to interpret)

    Rule of thumb: ≈ 68% of data lies within ±1 standard deviation of mean (if roughly normal) ≈ 95% within ±2 SD ≈ 99.7% within ±3 SD

  3. Shape & Distribution (How is the data shaped?)

    • Symmetric (bell curve) → mean ≈ median ≈ mode
    • Right-skewed (positive skew) → long tail on right (many small values, few very large) Example: monthly income in Hyderabad — most people earn ₹20k–₹80k, few earn ₹5 lakh+
    • Left-skewed (negative skew) — long tail on left (few very small values)

    Visual tools:

    • Histogram → bars showing frequency of each range
    • Box plot → shows median, quartiles, outliers

Step 4: Quick Summary Table (Copy This!)

Tool What it tells you Hyderabad Everyday Example Sensitive to outliers?
Mean Arithmetic average Average Swiggy delivery time over 15 orders Yes
Median Middle value when sorted Middle delivery time — ignores one 120-min late delivery No
Mode Most frequent value Most common biryani order price in your area No
Range Max – min Price range of biryani across 10 shops Yes
Standard Deviation Typical variation around mean How much delivery times usually vary Yes
Histogram Shape & frequency distribution Delivery times mostly 25–45 min, few >60 min
Box Plot Median, quartiles, outliers Shows typical rent range + extreme high-rent flats Shows outliers clearly

Final Teacher Words

Descriptive Statistics is the first honest look at any data you collect.

It answers: “What does this data actually look like? Where is the center? How much does it spread? Is it symmetric or skewed? Are there weird extreme values?”

Before you ever train a machine learning model, run a hypothesis test, or make a big decision — always start with descriptive statistics.

In Hyderabad 2026 you use it constantly:

  • When you compare delivery times from 5 apps
  • When you look at your monthly expenses in Google Sheets
  • When you read “average house rent in Kukatpally is ₹28,000” — that’s descriptive stats talking

It’s not fancy — but it’s truthful.

Understood the heart of descriptive statistics now? 🌟

Want to go deeper?

  • How to calculate standard deviation by hand (with small numbers)?
  • Real histogram of Hyderabad flat rents (skewed example)?
  • Why median is better than mean for income data in India?
  • Difference descriptive vs inferential with UPI fraud example?

Just tell me — next class is ready! 🚀

You may also like...

Leave a Reply

Your email address will not be published. Required fields are marked *