Home > Article > Backend Development > Example of counting the number of words in text using the len function in Python
Application example of len function in Python: How to use it to count the number of words in text
In Python programming, the len function is a very useful function. Used to return the length or number of elements of an object. In this article, we will introduce how to use the len function to count the number of words in text and provide specific code examples.
Before you start writing code, you need to understand how to define a word. In this article, we will use spaces as word separators, that is, any string between two spaces is considered a word.
The following is a simple code example that shows how to use the len function to count the number of words in text:
def count_words(text): word_count = len(text.split()) return word_count text = "Python是一种简单易学但功能强大的编程语言。它被广泛应用于各种领域,包括软件开发、数据分析和人工智能等。" word_count = count_words(text) print("文本中的单词数量为:", word_count)
In the above code, a function named count_words is first defined, It receives a string parameter text. The implementation idea of the function is to split the text string text into words one by one through the split method, and use the len function to obtain the length of the word list, that is, the number of words. Finally, the number of words is returned.
In the sample code, we assign a text containing multiple sentences to the variable text, then call the count_words function to count the number of words in the text, and print the results.
Run the above code and the number of words in the output text will be: 17. This is because there are 17 space-separated strings in the text, which make up 17 words.
Of course, in actual applications, you may need to perform some preprocessing on the text, such as removing punctuation marks and line breaks, etc. The following is an improved code example that shows how to preprocess the text before counting the number of words:
import re def count_words(text): text = re.sub(r'[^ws]', '', text) # 去除标点符号 text = text.replace(' ', '') # 去除换行符 word_count = len(text.split()) return word_count text = """Python是一种简单易学但功能强大的编程语言。 它被广泛应用于各种领域,包括软件开发、数据分析和人工智能等。""" word_count = count_words(text) print("文本中的单词数量为:", word_count)
In the improved code, we first replaced all punctuation marks with regular expressions, using re. sub method is implemented. Then, the newlines were removed using the .replace method. Finally, count the number of words according to the previous method.
Run the improved code and the number of words in the output text will be: 18. By removing punctuation and line breaks, we cleaned up the text and got a more accurate word count.
To summarize, the above is an example of using the len function to count the number of words in the text, splitting the string into words by the split method, and using the len function to obtain the length of the word list. In practical applications, you can perform text preprocessing as needed to obtain more accurate results. I hope this article will help you understand and use the len function!
The above is the detailed content of Example of counting the number of words in text using the len function in Python. For more information, please follow other related articles on the PHP Chinese website!