Chapter 58: Descriptive Statistics
Descriptive Statistics is simply the art & science of summarizing and describing the data you actually have in front of you — as clearly, honestly, and helpfully as possible.
It answers the question: “What does this pile of numbers / measurements / observations actually look like?”
It does not try to guess what happens outside your data — that’s the job of inferential statistics (which we’ll talk about later). Descriptive statistics stays humble: it only talks about the data you collected.
Step 1: The Four Most Important Jobs of Descriptive Statistics
- Measure central tendency — “Where is the middle / typical value?”
- Measure spread / variation — “How much do things differ from the middle?”
- Show the shape / distribution — “Is the data symmetric, skewed, clumped, full of outliers?”
- Show relationships — “Do two things tend to move together?”
These four jobs cover 95% of what people do when they “look at data”.
Step 2: Everyday Hyderabad Examples (You Already Use Descriptive Statistics)
Example 1 – Food delivery times from Swiggy/Zomato (very common)
You order biryani 15 times in the last month and write down the delivery time each time (in minutes):
28, 35, 42, 31, 27, 39, 45, 33, 29, 38, 40, 32, 36, 30, 41
What does descriptive statistics tell you?
- Mean (average) = (28+35+42+…+41) ÷ 15 = 34.4 minutes → “On average, delivery takes about 34–35 minutes.”
- Median = sort the list → middle value = 35 minutes → “Half the time it’s 35 min or less, half the time 35 min or more.”
- Mode = most common time = none clear (all different) → no mode
- Range = max – min = 45 – 27 = 18 minutes → Deliveries vary by up to 18 minutes.
- Standard deviation ≈ 5.2 minutes (calculation later) → “Most deliveries are within about ±5 minutes of the average (roughly 29–39 min).”
- Box plot / histogram (if you plot it) → Shows most times cluster around 30–40 min, a few outliers at 45 min (rainy day?).
That’s descriptive statistics — it tells you exactly what happened in your 15 orders.
Example 2 – Monthly mobile recharge amounts (very relatable)
You look at last 12 months recharges:
₹399, ₹479, ₹399, ₹666, ₹399, ₹719, ₹399, ₹399, ₹479, ₹555, ₹399, ₹479
Descriptive statistics:
- Mean ≈ ₹487
- Median = ₹439 (middle value when sorted)
- Mode = ₹399 (happened 6 times)
- Range = ₹719 – ₹399 = ₹320
- Standard deviation ≈ ₹112 (quite spread out)
What does this tell you? → You mostly stick to ₹399 plan, but sometimes upgrade to higher data packs when you travel or watch more videos.
Step 3: The Most Important Tools of Descriptive Statistics (With Numbers)
-
Measures of Central Tendency (Where is the “middle”?)
- Mean (arithmetic average) — add everything, divide by count → Sensitive to outliers (one ₹5,000 recharge would pull it way up)
- Median — middle value when sorted → Very robust — ignores extreme values
- Mode — most frequent value → Useful for “most common” questions (most popular biryani spice level?)
-
Measures of Spread / Dispersion (How much variety?)
- Range — max – min (simple, but very sensitive to outliers)
- Interquartile Range (IQR) — Q3 – Q1 (middle 50% spread — robust)
- Variance → average squared distance from mean
- Standard Deviation → square root of variance (same unit as data — easy to interpret)
Rule of thumb: ≈ 68% of data lies within ±1 standard deviation of mean (if roughly normal) ≈ 95% within ±2 SD ≈ 99.7% within ±3 SD
-
Shape & Distribution (How is the data shaped?)
- Symmetric (bell curve) → mean ≈ median ≈ mode
- Right-skewed (positive skew) → long tail on right (many small values, few very large) Example: monthly income in Hyderabad — most people earn ₹20k–₹80k, few earn ₹5 lakh+
- Left-skewed (negative skew) — long tail on left (few very small values)
Visual tools:
- Histogram → bars showing frequency of each range
- Box plot → shows median, quartiles, outliers
Step 4: Quick Summary Table (Copy This!)
| Tool | What it tells you | Hyderabad Everyday Example | Sensitive to outliers? |
|---|---|---|---|
| Mean | Arithmetic average | Average Swiggy delivery time over 15 orders | Yes |
| Median | Middle value when sorted | Middle delivery time — ignores one 120-min late delivery | No |
| Mode | Most frequent value | Most common biryani order price in your area | No |
| Range | Max – min | Price range of biryani across 10 shops | Yes |
| Standard Deviation | Typical variation around mean | How much delivery times usually vary | Yes |
| Histogram | Shape & frequency distribution | Delivery times mostly 25–45 min, few >60 min | — |
| Box Plot | Median, quartiles, outliers | Shows typical rent range + extreme high-rent flats | Shows outliers clearly |
Final Teacher Words
Descriptive Statistics is the first honest look at any data you collect.
It answers: “What does this data actually look like? Where is the center? How much does it spread? Is it symmetric or skewed? Are there weird extreme values?”
Before you ever train a machine learning model, run a hypothesis test, or make a big decision — always start with descriptive statistics.
In Hyderabad 2026 you use it constantly:
- When you compare delivery times from 5 apps
- When you look at your monthly expenses in Google Sheets
- When you read “average house rent in Kukatpally is ₹28,000” — that’s descriptive stats talking
It’s not fancy — but it’s truthful.
Understood the heart of descriptive statistics now? 🌟
Want to go deeper?
- How to calculate standard deviation by hand (with small numbers)?
- Real histogram of Hyderabad flat rents (skewed example)?
- Why median is better than mean for income data in India?
- Difference descriptive vs inferential with UPI fraud example?
Just tell me — next class is ready! 🚀
