Home >Backend Development >Python Tutorial >About the basic operations of pandas.DataFrame in python

About the basic operations of pandas.DataFrame in python

Y2J
Y2JOriginal
2017-05-09 14:50:312522browse

This article mainly introduces you to the method of excluding specific rows in pandas.DataFrame in python. The article gives detailed example codes. I believe it has certain reference value for everyone's understanding and learning. Friends who need it can follow Let’s take a look together.

Preface

When you use Python for data analysis, a data structure that you often use is the pandas DataFrame. About the basic operations of pandas.DataFrame in python , you can check this article.

pandas.DataFrame excludes specific rows

If we want to filter only one or certain rows like Excel, we can use isin( ) method, pass in the values ​​of the required rows in a list, you can also pass in a dictionary and specify columns for filtering.

But if we only want all content that does not contain a specific line, there is no isnotin() method. I encountered such a requirement at work today. After searching frequently, I found that I could only use isin() in another way to achieve this requirement.

The example is as follows:

In [3]: df = pd.DataFrame([['GD', 'GX', 'FJ'], ['SD', 'SX', 'BJ'], ['HN', 'HB'
 ...: , 'AH'], ['HEN', 'HEN', 'HLJ'], ['SH', 'TJ', 'CQ']], columns=['p1', 'p2
 ...: ', 'p3'])

In [4]: df
Out[4]:
 p1 p2 p3
0 GD GX FJ
1 SD SX BJ
2 HN HB AH
3 HEN HEN HLJ
4 SH TJ CQ

If you only want the two rows where p1 is GD and HN, you can do this:

In [8]: df[df.p1.isin(['GD', 'HN'])]
Out[8]:
 p1 p2 p3
0 GD GX FJ
2 HN HB AH

But if we want To get data other than these two rows, you need to take a detour.

The principle is to first take out p1 and convert it into a list, then remove unnecessary rows (values) from the list, and then use isin() in the DataFrame

In [9]: ex_list = list(df.p1)

In [10]: ex_list.remove('GD')

In [11]: ex_list.remove('HN')

In [12]: ex_list
Out[12]: ['SD', 'HEN', 'SH']

In [13]: df[df.p1.isin(ex_list)]
Out[13]:
 p1 p2 p3
1 SD SX BJ
3 HEN HEN HLJ
4 SH TJ CQ

Summary

[Related recommendations]

1. Python Free Video Tutorial

2. Python basic introductory tutorial

3. Python meets data collection video tutorial

The above is the detailed content of About the basic operations of pandas.DataFrame in python. For more information, please follow other related articles on the PHP Chinese website!

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn