The principle and Python implementation of pooling-Python Tutorial-php.cn

Home

Backend Development

Python Tutorial

The principle and Python implementation of pooling

高洛峰

Nov 02, 2016 am 10:07 AM

This article first explains the operations corresponding to pooling, then analyzes some of the principles behind pooling, and finally gives the Python implementation of pooling.

1. Operations corresponding to pooling

First of all, we have an intuitive concept of pooling as a whole (that is, describing the input, output and specific functions of pooling, but ignoring the specific implementation details): The input of pooling is a Matrix, the output is a matrix; the completed function is to operate on a local area of the input matrix so that the output corresponding to the area can best represent the characteristics of the area. As shown in Figure 1, the yellow matrix on the left represents the input matrix, and the blue matrix on the right represents the output matrix; the dynamic orange matrix represents a local area of the selected input matrix, and then finds the best representative of the area, and finally All selected representatives are sorted in the output matrix according to the spatial position relationship corresponding to the original input matrix.

This process can be compared to the election process. If you want to elect the mayor of Beijing, a feasible approach is for each district in Beijing to select a representative who best suits the interests of the district, and then the elected representatives decide how to select the mayor of Beijing. Of course, we hope that the representatives elected by each district can best meet the interests of that district. To make a simple analogy with pooling, Beijing 〈-〉 input matrix; Chaoyang District, Haidian District and other 〈-〉 local areas; each district represents the 〈-〉 output matrix (if they sit according to geographical location during a meeting, this is the same as the characteristics of pooling Very similar).

The principle and Python implementation of pooling

2. The reason behind pooling

In the process of selecting representatives in a local area, our general approach is to select the most prestigious person in the area as the representative (corresponding to max pooling) or select the person who best represents the area. People with general characteristics of the area owners are used as representatives (corresponding to mean pooling). Correspondingly, there are two common methods in pooling: the one with the largest local area value wins as the representative of the area or all the values in the area are taken. average as representative of the area.

Choose the most reputable person in the area as a representative vs. select the person who best represents the general characteristics of everyone in the area as a representative. The advantages are:

1) The most reputable person in a local area is not suitable when electing the mayor. There is a deviation, but he may rely on his old age and cannot represent the views of the general public in the area (local maximum values, it is easy to ignore the general characteristics of the area)

2) Although the person who best represents the general characteristics of everyone in the area can represent the The greatest rights and interests of all residents in the region, but due to his limited cognitive ability (the local mean is small, so his cognitive ability is limited), bias is prone to occur when selecting the mayor.

3) If the people in the area have a certain degree of freedom of movement (corresponding to translation and rotation invariance), it will basically have no impact on the above two methods of selecting representatives.

Formal explanation of pooling

According to relevant theories: (1) The variance of the estimated value increases due to the limited size of the neighborhood; (2) The error causes the deviation of the estimated mean. Generally speaking, mean-pooling can reduce the first error and retain more background information of the image, while max-pooling can reduce the second error and retain more texture information.

Generally, the input dimension of pooling is high and the output dimension is low. This can be understood as dimensionality reduction to a certain extent. Based on the above explanation of the pooling principle, we can infer that this dimensionality reduction process greatly retains Some of the most important information to enter. In the actual application of pooling, we need to conduct a detailed analysis based on the characteristics of the actual problem. In fact, once you know the operation and principle of pooling, if it is well combined with specific problems, it will be a good innovation point, haha.

3. Python implementation of pooing

Some of the author’s thoughts when writing the code are as follows. The core is to split a complex problem into a problem that can be directly implemented with code:

1) The input matrix can be mxn, It can also be mxnxp. If you directly consider these two forms when writing code, you will not know where to start (there are a lot of situations to consider, and I am easily confused by multi-dimensional matrices). After careful analysis, I found that if I implement the pooling of the mxn matrix, then the mxnxp matrix can be easily implemented using the implementation of the mxn matrix.

2) For mxn matrix input, it is possible that the orange box in Figure 1 cannot exactly cover the input matrix, so the input matrix needs to be expanded. The expansion is also very simple. As long as the poolSize corresponding to the last poolStride can cover the input matrix, the others can definitely be covered.

3) Finally, the for loop performs similar operations.

def pooling(inputMap,poolSize=3,poolStride=2,mode=&#39;max&#39;):
    """INPUTS:
              inputMap - input array of the pooling layer
              poolSize - X-size(equivalent to Y-size) of receptive field
              poolStride - the stride size between successive pooling squares
       
       OUTPUTS:
               outputMap - output array of the pooling layer
               
       Padding mode - &#39;edge&#39;
    """
    # inputMap sizes
    in_row,in_col = np.shape(inputMap)
    
    # outputMap sizes
    out_row,out_col = int(np.floor(in_row/poolStride)),int(np.floor(in_col/poolStride))
    row_remainder,col_remainder = np.mod(in_row,poolStride),np.mod(in_col,poolStride)
    if row_remainder != 0:
        out_row +=1
    if col_remainder != 0:
        out_col +=1
    outputMap = np.zeros((out_row,out_col))
    
    # padding
    temp_map = np.lib.pad(inputMap, ((0,poolSize-row_remainder),(0,poolSize-col_remainder)), &#39;edge&#39;)
    
    # max pooling
    for r_idx in range(0,out_row):
        for c_idx in range(0,out_col):
            startX = c_idx * poolStride
            startY = r_idx * poolStride
            poolField = temp_map[startY:startY + poolSize, startX:startX + poolSize]
            poolOut = np.max(poolField)
            outputMap[r_idx,c_idx] = poolOut
    
    # retrun outputMap
    return  outputMap
# 测试实例
test = np.array([[1,2,3,4],[5,6,7,8],[9,10,11,12]])
test_result = pooling(test, 2, 2, &#39;max&#39;)
print(test_result)

Test results:

The principle and Python implementation of pooling

Summary: First understand the input, output and functions of a technology; then look for similar examples in life; finally, break down the technology into achievable steps.

Statement

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Python vs. C : Learning Curves and Ease of UseApr 19, 2025 am 12:20 AM

Python is easier to learn and use, while C is more powerful but complex. 1. Python syntax is concise and suitable for beginners. Dynamic typing and automatic memory management make it easy to use, but may cause runtime errors. 2.C provides low-level control and advanced features, suitable for high-performance applications, but has a high learning threshold and requires manual memory and type safety management.

Python vs. C : Memory Management and ControlApr 19, 2025 am 12:17 AM

Python and C have significant differences in memory management and control. 1. Python uses automatic memory management, based on reference counting and garbage collection, simplifying the work of programmers. 2.C requires manual management of memory, providing more control but increasing complexity and error risk. Which language to choose should be based on project requirements and team technology stack.

Python for Scientific Computing: A Detailed LookApr 19, 2025 am 12:15 AM

Python's applications in scientific computing include data analysis, machine learning, numerical simulation and visualization. 1.Numpy provides efficient multi-dimensional arrays and mathematical functions. 2. SciPy extends Numpy functionality and provides optimization and linear algebra tools. 3. Pandas is used for data processing and analysis. 4.Matplotlib is used to generate various graphs and visual results.

Python and C : Finding the Right ToolApr 19, 2025 am 12:04 AM

Whether to choose Python or C depends on project requirements: 1) Python is suitable for rapid development, data science, and scripting because of its concise syntax and rich libraries; 2) C is suitable for scenarios that require high performance and underlying control, such as system programming and game development, because of its compilation and manual memory management.

Python for Data Science and Machine LearningApr 19, 2025 am 12:02 AM

Python is widely used in data science and machine learning, mainly relying on its simplicity and a powerful library ecosystem. 1) Pandas is used for data processing and analysis, 2) Numpy provides efficient numerical calculations, and 3) Scikit-learn is used for machine learning model construction and optimization, these libraries make Python an ideal tool for data science and machine learning.

Learning Python: Is 2 Hours of Daily Study Sufficient?Apr 18, 2025 am 12:22 AM

Is it enough to learn Python for two hours a day? It depends on your goals and learning methods. 1) Develop a clear learning plan, 2) Select appropriate learning resources and methods, 3) Practice and review and consolidate hands-on practice and review and consolidate, and you can gradually master the basic knowledge and advanced functions of Python during this period.

Python for Web Development: Key ApplicationsApr 18, 2025 am 12:20 AM

Key applications of Python in web development include the use of Django and Flask frameworks, API development, data analysis and visualization, machine learning and AI, and performance optimization. 1. Django and Flask framework: Django is suitable for rapid development of complex applications, and Flask is suitable for small or highly customized projects. 2. API development: Use Flask or DjangoRESTFramework to build RESTfulAPI. 3. Data analysis and visualization: Use Python to process data and display it through the web interface. 4. Machine Learning and AI: Python is used to build intelligent web applications. 5. Performance optimization: optimized through asynchronous programming, caching and code

Python vs. C : Exploring Performance and EfficiencyApr 18, 2025 am 12:20 AM

Python is better than C in development efficiency, but C is higher in execution performance. 1. Python's concise syntax and rich libraries improve development efficiency. 2.C's compilation-type characteristics and hardware control improve execution performance. When making a choice, you need to weigh the development speed and execution efficiency based on project needs.

See all articles