Pandas에서 연속 중복을 제거하는 방법은 무엇입니까?-파이썬 튜토리얼-php.cn

집

백엔드 개발

파이썬 튜토리얼

Pandas에서 연속 중복을 제거하는 방법은 무엇입니까?

Barbara Streisand

Nov 15, 2024 am 04:09 AM

How to Remove Consecutive Duplicates in Pandas?

Pandas에서 연속 중복 제거

Pandas의 drop_duplicates() 메소드는 모든 중복 값을 제거하는 데 효과적이지만 연속 발생을 식별하지는 않습니다. . 이러한 제한을 해결하기 위해 연속된 중복 항목만 선택적으로 삭제하는 효율적인 방법이 있습니다.

한 가지 접근 방식은 시프트 기능을 사용하여 현재 값을 이전 값과 비교합니다.

a.loc[a.shift() != a]

이 논리는 다음을 반환합니다. 연속된 중복이 False 값으로 특성화되는 마스크입니다. 그런 다음 loc 방법은 True 값이 있는 행만 선택하여 연속 중복 항목을 효과적으로 제거합니다.

또 다른 방법은 diff 기능을 활용하여 변경 사항을 감지합니다.

a.loc[a.diff() != 0]

그러나 이 접근 방식은 덜 효율적입니다. 미분 계산과 관련된 오버헤드로 인해 대규모 데이터 세트의 경우

업데이트

기본 이동 기간이 1이므로 Shift() 및 Shift( 1) 동일한 결과 생성:

a.loc[a.shift(1) != a]

이렇게 하면 첫 번째 연속 값이 중복 항목으로 올바르게 식별됩니다.

위 내용은 Pandas에서 연속 중복을 제거하는 방법은 무엇입니까?의 상세 내용입니다. 자세한 내용은 PHP 중국어 웹사이트의 기타 관련 기사를 참조하세요!

성명

본 글의 내용은 네티즌들의 자발적인 기여로 작성되었으며, 저작권은 원저작자에게 있습니다. 본 사이트는 이에 상응하는 법적 책임을 지지 않습니다. 표절이나 침해가 의심되는 콘텐츠를 발견한 경우 admin@php.cn으로 문의하세요.

관련 기사

파이썬 : 편집과 해석에 대한 깊은 다이빙May 12, 2025 am 12:14 AM

Pythonusesahybridmodelofilationandlostretation : 1) ThePyThoninterPretreCeterCompileSsourcodeIntOplatform-IndependentBecode.

Python은 해석 된 또는 편집 된 언어입니까? 왜 중요한가?May 12, 2025 am 12:09 AM

Pythonisbothingretedandcompiled.1) 1) it 'scompiledtobytecodeforportabilityacrossplatforms.2) thebytecodeisthentenningreted, withfordiNamictyTeNgreted, WhithItmayBowerShiledlanguges.

루프 대 파이썬의 루프 : 주요 차이점 설명May 12, 2025 am 12:08 AM

forloopsareideal when

루프를위한 것 및 기간 : 실용 가이드May 12, 2025 am 12:07 AM

forloopsareusedwhendumberofitessiskNowninadvance, whilewhiloopsareusedwhentheationsdepernationsorarrays.2) whiloopsureatableforscenarioScontiLaspecOndCond

파이썬 : 진정으로 해석 되었습니까? 신화를 파악합니다May 12, 2025 am 12:05 AM

pythonisnotpurelynlogreted; itusesahybrideprophorfbyodecodecompilationandruntime -INGRETATION.1) pythoncompilessourcecodeintobytecode, thepythonVirtualMachine (pvm)

동일한 요소를 가진 Python Concatenate 목록May 11, 2025 am 12:08 AM

ToconcatenatelistsinpythonwithesameElements, 사용 : 1) OperatorTokeEpduplicates, 2) asettoremovedUplicates, or3) listComperensionForControlOverDuplicates, 각 methodHasDifferentPerferformanCeanDorderImpestications.

해석 대 컴파일 언어 : Python 's PlaceMay 11, 2025 am 12:07 AM

PythonisancerpretedLanguage, 비판적 요소를 제시하는 PytherfaceLockelimitationsIncriticalApplications.1) 해석 된 언어와 같은 thePeedBackandbackandrapidProtoTyping.2) CompilledlanguagesLikec/C transformt 해석

루프를 위해 및 while 루프 : 파이썬에서 언제 각각을 사용합니까?May 11, 2025 am 12:05 AM

useforloopswhhenmerfiterationsiskNownInAdvance 및 WhileLoopSweHeniTesslationsDepoyConditionismet whilEroopsSuitsCenarioswhereTheLoopScenarioswhereTheLoopScenarioswhereTheLoopScenarioswhereTherInatismet, 유용한 광고 인 푸트 gorit

See all articles