Home >Backend Development >Python Tutorial >How Can I Filter a Pandas DataFrame Based on Substring Matches?

How Can I Filter a Pandas DataFrame Based on Substring Matches?

Susan Sarandon
Susan SarandonOriginal
2024-12-21 01:29:18125browse

How Can I Filter a Pandas DataFrame Based on Substring Matches?

Filtering a Pandas DataFrame by Substring Criteria

Suppose you have a DataFrame with a column containing string values. You might encounter the need to select rows based on partial string matches, similar to using the idiom re.search(pattern, cell_in_question) in regular expressions. While familiar with the syntax df[df['A'] == "hello world"], finding a way to achieve this for partial string matches, such as 'hello,' can be challenging.

The solution lies in utilizing vectorized string methods, specifically Series.str. By employing this approach, you can perform the following operation:

df[df['A'].str.contains("hello")]

This line of code will return the subset of rows in the DataFrame that contain the substring "hello" in the 'A' column. It's important to note that this capability is available in Pandas version 0.8.1 and subsequent versions.

The above is the detailed content of How Can I Filter a Pandas DataFrame Based on Substring Matches?. For more information, please follow other related articles on the PHP Chinese website!

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn