5 Pandas Basics Every Data Science Beginner Should Know!

Knowing these functions can kickstart your Data Analysis journey in Python

Harsha S
1 min readJan 30, 2022
Photo by Bruce Hong on Unsplash

Why Pandas?

Pandas is very important library when it comes to Data Science in Python. Since it is built on top of the NumPy library, we can perform mathematical operations on arrays of single-type data(Series) and tables of data(DataFrame).

We can also quickly visualize the plots in a single line of code, without using Matplotlib library.

So, Pandas play a crucial role in Data related tasks in Python.

* Data Input and Output

Reading DataFrames from outside sources utilizing pd.read capacities.

CSV file

pd.read_csv()

pd.to_csv()

Excel file

pd.read_excel()

pd.to_excel()

* Shape of the Data

df.ndim

df.shape

* Display the Data

df.head()

df.tail()

* Selection and Indexing

df.iloc

df.loc

* Handling the Missing Values

df.dropna()

df.isnull().sum()

df.fillna()

These are some of the most functions, all data scientists use while working in Pandas. I’ll come up with another blog post to explain more about Pandas library. Thanks for reading!

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

--

--

Harsha S
Harsha S

Written by Harsha S

NLP Engineer | I love to write about AI in beginner way

No responses yet

Write a response