Python Counters: How to use collections.Counter?-Python Tutorial-php.cn

Home

Backend Development

Python Tutorial

Python Counters: How to use collections.Counter?

王林

May 08, 2023 pm 01:34 PM

pythoncollections.counter

1. Introduction

A counter tool provides fast and convenient counting. Counter is a subclass of dict, used to count hashable objects. It is a collection with elements stored like dictionary keys and their counts as values. Counts can be any integer value, including 0 and negative numbers, and the Counter class is a bit like bags or multisets in other languages. To put it simply, it can be counted statistically. Let’s take a look at a few examples to make it clear.
Example:

#计算top10的单词
from collections import Counter
import re
text = &#39;remove an existing key one level down remove an existing key one level down&#39;
words = re.findall(r&#39;\w+&#39;, text)
Counter(words).most_common(10)
[(&#39;remove&#39;, 2),(&#39;an&#39;, 2),(&#39;existing&#39;, 2),(&#39;key&#39;, 2),(&#39;one&#39;, 2)(&#39;level&#39;, 2),(&#39;down&#39;, 2)] 


#计算列表中单词的个数
cnt = Counter()
for word in [&#39;red&#39;, &#39;blue&#39;, &#39;red&#39;, &#39;green&#39;, &#39;blue&#39;, &#39;blue&#39;]:
    cnt[word] += 1
cnt
Counter({&#39;red&#39;: 2, &#39;blue&#39;: 3, &#39;green&#39;: 1})


#上述这样计算有点嘛，下面的方法更简单，直接计算就行
L = [&#39;red&#39;, &#39;blue&#39;, &#39;red&#39;, &#39;green&#39;, &#39;blue&#39;, &#39;blue&#39;] 
Counter(L)
Counter({&#39;red&#39;: 2, &#39;blue&#39;: 3, &#39;green&#39;: 1}

Elements are counted from an iterable or initialized from other mapping (or counter):

from collections import Counter

#字符串计数
Counter(&#39;gallahad&#39;) 
Counter({&#39;g&#39;: 1, &#39;a&#39;: 3, &#39;l&#39;: 2, &#39;h&#39;: 1, &#39;d&#39;: 1})

#字典计数
Counter({&#39;red&#39;: 4, &#39;blue&#39;: 2})  
Counter({&#39;red&#39;: 4, &#39;blue&#39;: 2})

#计数
Counter(cats=4, dogs=8)
Counter({&#39;cats&#39;: 4, &#39;dogs&#39;: 8})

Counter([&#39;red&#39;, &#39;blue&#39;, &#39;red&#39;, &#39;green&#39;, &#39;blue&#39;, &#39;blue&#39;])
Counter({&#39;red&#39;: 2, &#39;blue&#39;: 3, &#39;green&#39;: 1})

2. Basic operations

1. Statistics" The number of occurrences of each element in iterable sequence"

1.1 Effect on list/string

The following are two ways to use it, one is to use it directly, and the other is to instantiate it If you want to call it frequently, obviously the latter one is more concise, because you can easily call various methods in Counter, and the same routine is used for other iterable sequences.

#首先引入该方法
from collections import Counter
#对列表作用
list_01 = [1,9,9,5,0,8,0,9]  #GNZ48-陈珂生日
print(Counter(list_01))  #Counter({9: 3, 0: 2, 1: 1, 5: 1, 8: 1})
 
#对字符串作用
temp = Counter(&#39;abcdeabcdabcaba&#39;)
print(temp)  #Counter({&#39;a&#39;: 5, &#39;b&#39;: 4, &#39;c&#39;: 3, &#39;d&#39;: 2, &#39;e&#39;: 1})
#以上其实是两种使用方法，一种是直接用，一种是实例化以后使用,如果要频繁调用的话，显然后一种更简洁

1.2 Output results

#查看类型
print( type(temp) ) #<class &#39;collections.Counter&#39;>
 
#转换为字典后输出
print( dict(temp) ) #{&#39;b&#39;: 4, &#39;a&#39;: 5, &#39;c&#39;: 3, &#39;d&#39;: 2, &#39;e&#39;: 1}
 
for num,count in enumerate(dict(temp).items()):
    print(count)
"""
(&#39;e&#39;, 1)
(&#39;c&#39;, 3)
(&#39;a&#39;, 5)
(&#39;b&#39;, 4)
(&#39;d&#39;, 2)
"""

1.3 Use the built-in items() method to output

Obviously this method is more convenient than converting to a dictionary and then outputting it:

print(temp.items()) #dict_items([(&#39;e&#39;, 1), (&#39;c&#39;, 3), (&#39;b&#39;, 4), (&#39;d&#39;, 2), (&#39;a&#39;, 5)])
 
for item in temp.items():
    print(item)
"""
(&#39;a&#39;, 5)
(&#39;c&#39;, 3)
(&#39;d&#39;, 2)
(&#39;e&#39;, 1)
(&#39;b&#39;, 4)
"""

2. most_common() counts the elements with the most occurrences

Use the most_common() method to return a list containing the n most common elements and the number of occurrences, in order of commonness Sort to low. If n is omitted or None, most_common() will return all elements in the counter. Elements with equal count values are sorted in the order of first appearance. Words often used to calculate top word frequency:

#求序列中出现次数最多的元素
 
from collections import Counter
 
list_01 = [1,9,9,5,0,8,0,9]
temp = Counter(list_01)
 
#统计出现次数最多的一个元素
print(temp.most_common(1))   #[(9, 3)]  元素“9”出现3次。
print(temp.most_common(2)) #[(9, 3), (0, 2)]  统计出现次数最多个两个元素
 
#没有指定个数，就列出全部
print(temp.most_common())  #[(9, 3), (0, 2), (1, 1), (5, 1), (8, 1)]

Counter(&#39;abracadabra&#39;).most_common(3)
[(&#39;a&#39;, 5), (&#39;b&#39;, 2), (&#39;r&#39;, 2)]

Counter(&#39;abracadabra&#39;).most_common(5)
[(&#39;a&#39;, 5), (&#39;b&#39;, 2), (&#39;r&#39;, 2), (&#39;c&#39;, 1), (&#39;d&#39;, 1)]

3. elements () and sort() methods

Description: Returns an iterator in which each element will be repeated the number of times specified by the count value. Elements are returned in order of first occurrence. If an element's count is less than 1, elements() will ignore it.
Example:

c = Counter(a=4, b=2, c=0, d=-2)
list(c.elements())
[&#39;a&#39;, &#39;a&#39;, &#39;a&#39;, &#39;a&#39;, &#39;b&#39;, &#39;b&#39;]

sorted(c.elements())
[&#39;a&#39;, &#39;a&#39;, &#39;a&#39;, &#39;a&#39;, &#39;b&#39;, &#39;b&#39;]

c = Counter(a=4, b=2, c=0, d=5)
list(c.elements())
[&#39;a&#39;, &#39;a&#39;, &#39;a&#39;, &#39;a&#39;, &#39;b&#39;, &#39;b&#39;, &#39;d&#39;, &#39;d&#39;, &#39;d&#39;, &#39;d&#39;, &#39;d&#39;]

from collections import Counter
 
c = Counter(&#39;ABCABCCC&#39;)
print(c.elements()) #<itertools.chain object at 0x0000027D94126860>
 
#尝试转换为list
print(list(c.elements())) #[&#39;A&#39;, &#39;A&#39;, &#39;C&#39;, &#39;C&#39;, &#39;C&#39;, &#39;C&#39;, &#39;B&#39;, &#39;B&#39;]
 
#或者这种方式
print(sorted(c.elements()))  #[&#39;A&#39;, &#39;A&#39;, &#39;B&#39;, &#39;B&#39;, &#39;C&#39;, &#39;C&#39;, &#39;C&#39;, &#39;C&#39;]
 
#这里与sorted的作用是： list all unique elements，列出所有唯一元素
#例如
print( sorted(c) ) #[&#39;A&#39;, &#39;B&#39;, &#39;C&#39;]

Official document example:

# Knuth&#39;s example for prime factors of 1836:  2**2 * 3**3 * 17**1
prime_factors = Counter({2: 2, 3: 3, 17: 1})
product = 1
for factor in prime_factors.elements():  # loop over factors
    product *= factor  # and multiply them
print(product)  #1836
#1836 = 2*2*3*3*3*17

4. subtract() subtraction operation: the output will not ignore the count whose result is zero or less than zero

Subtract elements from an iterable or mapped object. Both input and output can be 0 or negative.

c = Counter(a=4, b=2, c=0, d=-2)
d = Counter(a=1, b=2, c=3, d=4)
c.subtract(d)
c
Counter({&#39;a&#39;: 3, &#39;b&#39;: 0, &#39;c&#39;: -3, &#39;d&#39;: -6})

#减去一个abcd
str0 = Counter(&#39;aabbccdde&#39;)
str0
Counter({&#39;a&#39;: 2, &#39;b&#39;: 2, &#39;c&#39;: 2, &#39;d&#39;: 2, &#39;e&#39;: 1})

str0.subtract(&#39;abcd&#39;)
str0
Counter({&#39;a&#39;: 1, &#39;b&#39;: 1, &#39;c&#39;: 1, &#39;d&#39;: 1, &#39;e&#39;: 1}

subtract_test01 = Counter("AAB")
subtract_test01.subtract("BCC")
print(subtract_test01)  #Counter({&#39;A&#39;: 2, &#39;B&#39;: 0, &#39;C&#39;: -2})

The count here can be reduced to zero and can include zero and negative numbers:

subtract_test02 = Counter("which")
subtract_test02.subtract("witch")  #从另一个迭代序列中减去元素
subtract_test02.subtract(Counter("watch"))  #^……
 
#查看结果
print( subtract_test02["h"] )  # 0 ,whirch 中两个，减去witch中一个，减去watch中一个，剩0个
print( subtract_test02["w"] )  #-1

5. Dictionary method

Usually dictionary methods can be used for Counter objects, except There are two methods that work differently than dictionaries.

fromkeys(iterable): This class method is not implemented in Counter.
update([iterable-or-mapping]): Count elements from the iterable object or add from another mapping object (or counter), the number of elements is added. In addition, the iteration object should be a sequence element, not a (key, value) pair.

sum(c.values())                 # total of all counts
c.clear()                       # reset all counts
list(c)                         # list unique elements
set(c)                          # convert to a set
dict(c)                         # convert to a regular dictionary
c.items()                       # convert to a list of (elem, cnt) pairs
Counter(dict(list_of_pairs))    # convert from a list of (elem, cnt) pairs
c.most_common(n)                   # n least common elements
+c                              # remove zero and negative counts

6. Mathematical operations

This function is very powerful and provides several mathematical operations that can be combined with Counter objects to produce multisets (elements greater than 0 in the counter ). Addition and subtraction combine counters by adding or subtracting the corresponding count of elements. Intersection and union return the minimum or maximum value of the corresponding count. Each operation accepts signed counts, but the output ignores counts whose result is zero or less than zero.

c = Counter(a=3, b=1)
d = Counter(a=1, b=2)
c + d                       # add two counters together:  c[x] + d[x]
Counter({&#39;a&#39;: 4, &#39;b&#39;: 3})
c - d                       # subtract (keeping only positive counts)
Counter({&#39;a&#39;: 2})
c & d                       # intersection:  min(c[x], d[x]) 
Counter({&#39;a&#39;: 1, &#39;b&#39;: 1})
c | d                       # union:  max(c[x], d[x])
Counter({&#39;a&#39;: 3, &#39;b&#39;: 2})

print(Counter(&#39;AAB&#39;) + Counter(&#39;BCC&#39;))
#Counter({&#39;B&#39;: 2, &#39;C&#39;: 2, &#39;A&#39;: 2})
print(Counter("AAB")-Counter("BCC"))
#Counter({&#39;A&#39;: 2})

And" and "OR" operations:

print(Counter(&#39;AAB&#39;) & Counter(&#39;BBCC&#39;))
#Counter({&#39;B&#39;: 1})
 
print(Counter(&#39;AAB&#39;) | Counter(&#39;BBCC&#39;))
#Counter({&#39;A&#39;: 2, &#39;C&#39;: 2, &#39;B&#39;: 2})

Unidirectional addition and subtraction (unary operators) means adding or subtracting from the empty counter, which is equivalent to multiplying the count value by positive Value or negative value, the output will also ignore the count whose result is zero or less than zero:

c = Counter(a=2, b=-4)
+c
Counter({&#39;a&#39;: 2})
-c
Counter({&#39;b&#39;: 4})

Write an algorithm to calculate text similarity, weighted similarity:

def str_sim(str_0,str_1,topn):
    topn = int(topn)
    collect0 = Counter(dict(Counter(str_0).most_common(topn)))
    collect1 = Counter(dict(Counter(str_1).most_common(topn)))       
    jiao = collect0 & collect1
    bing = collect0 | collect1       
    sim = float(sum(jiao.values()))/float(sum(bing.values()))        
    return(sim)         

str_0 = &#39;定位手机定位汽车定位GPS定位人定位位置查询&#39;         
str_1 = &#39;导航定位手机定位汽车定位GPS定位人定位位置查询&#39;         

str_sim(str_0,str_1,5)    
0.75

7. Calculate the total number of elements, Keys() and Values()

from collections import Counter
 
c = Counter(&#39;ABCABCCC&#39;)
print(sum(c.values()))  # 8  total of all counts
 
print(c.keys())  #dict_keys([&#39;A&#39;, &#39;B&#39;, &#39;C&#39;])
print(c.values())  #dict_values([2, 2, 4])

8. Query single element results

from collections import Counter
c = Counter(&#39;ABBCC&#39;)
#查询具体某个元素的个数
print(c["A"])  #1

9. Add

for elem in &#39;ADD&#39;:  # update counts from an iterabl
    c[elem] += 1
print(c.most_common())  #[(&#39;C&#39;, 2), (&#39;D&#39;, 2), (&#39;A&#39;, 2), (&#39;B&#39;, 2)]
#可以看出“A”增加了一个，新增了两个“D”

10. Delete (del)

del c["D"]
print(c.most_common())  #[(&#39;C&#39;, 2), (&#39;A&#39;, 2), (&#39;B&#39;, 2)]
del c["C"]
print(c.most_common())  #[(&#39;A&#39;, 2), (&#39;B&#39;, 2)]

11. Update update()

d = Counter("CCDD")
c.update(d)
print(c.most_common())  #[(&#39;B&#39;, 2), (&#39;A&#39;, 2), (&#39;C&#39;, 2), (&#39;D&#39;, 2)]

12. Clear clear()

c.clear()
print(c)  #Counter()

3. Summary

Counter is a dict subclass, mainly used to access you The frequency of objects is counted.

Commonly used methods:

elements(): Returns an iterator, the number of repeated calculations for each element, if the count of an element If it is less than 1, it will be ignored.
most_common([n]): Returns a list providing the n most frequently accessed elements and their count
subtract([iterable-or-mapping]): Subtract elements from the iterable object. The input and output can be 0 or negative numbers, which is different from the role of the minus sign -
update ([iterable-or-mapping]): Count elements from an iterable object or add from another mapping object (or counter).

Example:

# 统计字符出现的次数
>>> import collections
>>> collections.Counter(&#39;hello world&#39;)
Counter({&#39;l&#39;: 3, &#39;o&#39;: 2, &#39;h&#39;: 1, &#39;e&#39;: 1, &#39; &#39;: 1, &#39;w&#39;: 1, &#39;r&#39;: 1, &#39;d&#39;: 1})
# 统计单词数
>>> collections.Counter(&#39;hello world hello world hello nihao&#39;.split())
Counter({&#39;hello&#39;: 3, &#39;world&#39;: 2, &#39;nihao&#39;: 1})

Commonly used Method:

>>> c = collections.Counter(&#39;hello world hello world hello nihao&#39;.split())
>>> c
Counter({&#39;hello&#39;: 3, &#39;world&#39;: 2, &#39;nihao&#39;: 1})
# 获取指定对象的访问次数，也可以使用get()方法
>>> c[&#39;hello&#39;]
3
>>> c = collections.Counter(&#39;hello world hello world hello nihao&#39;.split())
# 查看元素
>>> list(c.elements())
[&#39;hello&#39;, &#39;hello&#39;, &#39;hello&#39;, &#39;world&#39;, &#39;world&#39;, &#39;nihao&#39;]
# 追加对象，或者使用c.update(d)
>>> c = collections.Counter(&#39;hello world hello world hello nihao&#39;.split())
>>> d = collections.Counter(&#39;hello world&#39;.split())
>>> c
Counter({&#39;hello&#39;: 3, &#39;world&#39;: 2, &#39;nihao&#39;: 1})
>>> d
Counter({&#39;hello&#39;: 1, &#39;world&#39;: 1})
>>> c + d
Counter({&#39;hello&#39;: 4, &#39;world&#39;: 3, &#39;nihao&#39;: 1})
# 减少对象，或者使用c.subtract(d)
>>> c - d
Counter({&#39;hello&#39;: 2, &#39;world&#39;: 1, &#39;nihao&#39;: 1})
# 清除
>>> c.clear()
>>> c
Counter()

The above is the detailed content of Python Counters: How to use collections.Counter?. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:亿速云. If there is any infringement, please contact admin@php.cn delete

What are the alternatives to concatenate two lists in Python?May 09, 2025 am 12:16 AM

There are many methods to connect two lists in Python: 1. Use operators, which are simple but inefficient in large lists; 2. Use extend method, which is efficient but will modify the original list; 3. Use the = operator, which is both efficient and readable; 4. Use itertools.chain function, which is memory efficient but requires additional import; 5. Use list parsing, which is elegant but may be too complex. The selection method should be based on the code context and requirements.

Python: Efficient Ways to Merge Two ListsMay 09, 2025 am 12:15 AM

There are many ways to merge Python lists: 1. Use operators, which are simple but not memory efficient for large lists; 2. Use extend method, which is efficient but will modify the original list; 3. Use itertools.chain, which is suitable for large data sets; 4. Use * operator, merge small to medium-sized lists in one line of code; 5. Use numpy.concatenate, which is suitable for large data sets and scenarios with high performance requirements; 6. Use append method, which is suitable for small lists but is inefficient. When selecting a method, you need to consider the list size and application scenarios.

Compiled vs Interpreted Languages: pros and consMay 09, 2025 am 12:06 AM

Compiledlanguagesofferspeedandsecurity,whileinterpretedlanguagesprovideeaseofuseandportability.1)CompiledlanguageslikeC arefasterandsecurebuthavelongerdevelopmentcyclesandplatformdependency.2)InterpretedlanguageslikePythonareeasiertouseandmoreportab

Python: For and While Loops, the most complete guideMay 09, 2025 am 12:05 AM

In Python, a for loop is used to traverse iterable objects, and a while loop is used to perform operations repeatedly when the condition is satisfied. 1) For loop example: traverse the list and print the elements. 2) While loop example: guess the number game until you guess it right. Mastering cycle principles and optimization techniques can improve code efficiency and reliability.

Python concatenate lists into a stringMay 09, 2025 am 12:02 AM

To concatenate a list into a string, using the join() method in Python is the best choice. 1) Use the join() method to concatenate the list elements into a string, such as ''.join(my_list). 2) For a list containing numbers, convert map(str, numbers) into a string before concatenating. 3) You can use generator expressions for complex formatting, such as ','.join(f'({fruit})'forfruitinfruits). 4) When processing mixed data types, use map(str, mixed_list) to ensure that all elements can be converted into strings. 5) For large lists, use ''.join(large_li

Python's Hybrid Approach: Compilation and Interpretation CombinedMay 08, 2025 am 12:16 AM

Pythonusesahybridapproach,combiningcompilationtobytecodeandinterpretation.1)Codeiscompiledtoplatform-independentbytecode.2)BytecodeisinterpretedbythePythonVirtualMachine,enhancingefficiencyandportability.

Learn the Differences Between Python's 'for' and 'while' LoopsMay 08, 2025 am 12:11 AM

ThekeydifferencesbetweenPython's"for"and"while"loopsare:1)"For"loopsareidealforiteratingoversequencesorknowniterations,while2)"while"loopsarebetterforcontinuinguntilaconditionismetwithoutpredefinediterations.Un

Python concatenate lists with duplicatesMay 08, 2025 am 12:09 AM

In Python, you can connect lists and manage duplicate elements through a variety of methods: 1) Use operators or extend() to retain all duplicate elements; 2) Convert to sets and then return to lists to remove all duplicate elements, but the original order will be lost; 3) Use loops or list comprehensions to combine sets to remove duplicate elements and maintain the original order.

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

How to fix KB5055523 fails to install in Windows 11?

4 weeks agoByDDD

How to fix KB5055518 fails to install in Windows 10?

4 weeks agoByDDD

Roblox: Grow A Garden - Complete Mutation Guide

3 weeks agoByDDD

Roblox: Bubble Gum Simulator Infinity - How To Get And Use Royal Keys

3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

How to fix KB5055612 fails to install in Windows 10?

3 weeks agoByDDD

Hot Tools

Safe Exam Browser

Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.

SublimeText3 Linux new version

SublimeText3 Linux latest version

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software