-->
Written by: Marlon Colca
Posted on 10 May 2025 - 4 months ago
python pandas analytics
Learn what Pandas is, why it's essential in data science, and how to load your first dataset with it.
If you’re just getting started with data science or data analysis using Python, you’ve probably heard of Pandas.
No, not the animal — the library. But it is a pretty magical tool.
Pandas is a powerful and flexible Python library that makes working with structured data fast and intuitive. Think of it as a mix between a spreadsheet (like Excel) and a database — right inside your Python code.
Because:
Let’s say you have a file called prices_sample.csv
. Here’s how you load it using Pandas:
import pandas as pd
# Read the file
df = pd.read_csv("prices_sample.csv")
# Show the first 5 rows
print(df.head())
This is the usual Pandas workflow:
head()
, info()
, describe()
)Some things you’ll learn in this series:
# Filter products with price greater than 50
expensive = df[df["price"] > 50]
# Group by category and calculate average price
averages = df.groupby("category")["price"].mean()
print(averages)
In the next post, we’ll dive into basic data exploration (EDA) using tools like .describe()
, .value_counts()
, and more.
Get ready to really look into your data.
See you in the next one 🐼
Learn how to explore and understand your dataset using Pandas. From `.head()` to `.describe()` and `.value_counts()`, this post walks you through the essential tools.