How to create a multi-level index (MultiIndex) using Python's pandas library?-Python Tutorial-php.cn

Home

Backend Development

Python Tutorial

How to create a multi-level index (MultiIndex) using Python's pandas library?

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

May 07, 2023 pm 02:55 PM

pythonpandasmultiindex

Introduction

pd.MultiIndex, an index with multiple levels. Through multi-level indexes, we can operate the data of the entire index group. This article mainly introduces 6 ways to create multi-level indexes in Pandas:

pd.MultiIndex.from_arrays(): Multi-dimensional arrays are used as parameters, high dimensions specify high-level indexes, and low dimensions specify low-level indexes. index.
pd.MultiIndex.from_tuples(): List of tuples as argument, each tuple specifying each index (high-dimensional and low-dimensional index).
pd.MultiIndex.from_product(): A list of iterable objects as parameters, created based on the Cartesian product (pairwise combination of elements) of multiple iterable object elements index.
pd.MultiIndex.from_frame: directly generated based on the existing data frame
groupby(): obtained through data grouping statistics
pivot_table(): Generate a pivot table to get

pd.MultiIndex.from_arrays()

In [1] :

import pandas as pd
import numpy as np

is generated through an array, usually specifying the elements in the list:

In [2]:

# 列表元素是字符串和数字
array1 = [["xiaoming","guanyu","zhangfei"], 
          [22,25,27]
         ]
m1 = pd.MultiIndex.from_arrays(array1)
m1

Out[2]:

MultiIndex([(&#39;xiaoming&#39;, 22),            (  &#39;guanyu&#39;, 25),            (&#39;zhangfei&#39;, 27)],
           )

In [3]:

type(m1)  # 查看数据类型

Use the type function to check the data type and find that it is indeed: MultiIndex

Out[3]:

pandas.core.indexes.multi.MultiIndex

is created At the same time, you can specify the name of each level:

In [4]:

# 列表元素全是字符串
array2 = [["xiaoming","guanyu","zhangfei"],
          ["male","male","female"]
         ]
m2 = pd.MultiIndex.from_arrays(
	array2, 
  # 指定姓名和性别
  names=["name","sex"])
m2

Out[4]:

MultiIndex([(&#39;xiaoming&#39;,   &#39;male&#39;),            (  &#39;guanyu&#39;,   &#39;male&#39;),            (&#39;zhangfei&#39;, &#39;female&#39;)],
           names=[&#39;name&#39;, &#39;sex&#39;])

The following example generates an index of three levels and Specify name:

In [5]:

array3 = [["xiaoming","guanyu","zhangfei"],
          ["male","male","female"],
          [22,25,27]
         ]
m3 = pd.MultiIndex.from_arrays(
	array3, 
	names=["姓名","性别","年龄"])
m3

Out[5]:

MultiIndex([(&#39;xiaoming&#39;,   &#39;male&#39;, 22),            (  &#39;guanyu&#39;,   &#39;male&#39;, 25),            (&#39;zhangfei&#39;, &#39;female&#39;, 27)],
           names=[&#39;姓名&#39;, &#39;性别&#39;, &#39;年龄&#39;])

pd.MultiIndex.from_tuples()

Through tuples To generate multi-level indexes in the form:

In [6]:

# 元组的形式
array4 = (("xiaoming","guanyu","zhangfei"), 
          (22,25,27)
         )
m4 = pd.MultiIndex.from_arrays(array4)
m4

Out[6]:

MultiIndex([(&#39;xiaoming&#39;, 22),            (  &#39;guanyu&#39;, 25),            (&#39;zhangfei&#39;, 27)],
           )

In [7]:

# 元组构成的3层索引
array5 = (("xiaoming","guanyu","zhangfei"),
          ("male","male","female"),
          (22,25,27))
m5 = pd.MultiIndex.from_arrays(array5)
m5

Out [7]:

MultiIndex([(&#39;xiaoming&#39;,   &#39;male&#39;, 22),            (  &#39;guanyu&#39;,   &#39;male&#39;, 25),            (&#39;zhangfei&#39;, &#39;female&#39;, 27)],
           )

Lists and tuples can be mixed.

The outermost layer is the list
All are tuples

In [8]:

array6 = [("xiaoming","guanyu","zhangfei"),
          ("male","male","female"),
          (18,35,27)
         ]
# 指定名字
m6 = pd.MultiIndex.from_arrays(array6,names=["姓名","性别","年龄"])
m6

Out[8]:

MultiIndex([(&#39;xiaoming&#39;,   &#39;male&#39;, 18),            (  &#39;guanyu&#39;,   &#39;male&#39;, 35),            (&#39;zhangfei&#39;, &#39;female&#39;, 27)],
           names=[&#39;姓名&#39;, &#39;性别&#39;, &#39;年龄&#39;] # 指定名字
           )

pd.MultiIndex.from_product()

Use a list of iterable objects as parameters to create an index based on the Cartesian product of multiple iterable object elements (a pairwise combination of elements).

In Python, we use the isinstance() function to determine whether a python object is iterable:

# 导入 collections 模块的 Iterable 对比对象
from collections import Iterable

How to create a multi-level index (MultiIndex) using Pythons pandas library?

Through the above examples we summarize: Common strings, lists, sets, tuples, and dictionaries are all iterable objects

The following examples are given to illustrate:

In [18 ]:

names = ["xiaoming","guanyu","zhangfei"]
numbers = [22,25]
m7 = pd.MultiIndex.from_product(
    [names, numbers], 
    names=["name","number"]) # 指定名字
m7

Out[18]:

MultiIndex([(&#39;xiaoming&#39;, 22),            (&#39;xiaoming&#39;, 25),            (  &#39;guanyu&#39;, 22),            (  &#39;guanyu&#39;, 25),            (&#39;zhangfei&#39;, 22),            (&#39;zhangfei&#39;, 25)],
           names=[&#39;name&#39;, &#39;number&#39;])

In [19]:

# 需要展开成列表形式
strings = list("abc") 
lists = [1,2]
m8 = pd.MultiIndex.from_product(
	[strings, lists],
	names=["alpha","number"])
m8

Out[19]:

MultiIndex([(&#39;a&#39;, 1),            (&#39;a&#39;, 2),            (&#39;b&#39;, 1),            (&#39;b&#39;, 2),            (&#39;c&#39;, 1),            (&#39;c&#39;, 2)],
           names=[&#39;alpha&#39;, &#39;number&#39;])

In [20]:

# 使用元组形式
strings = ("a","b","c") 
lists = [1,2]
m9 = pd.MultiIndex.from_product(
	[strings, lists],
	names=["alpha","number"])
m9

Out[20]:

MultiIndex([(&#39;a&#39;, 1),            (&#39;a&#39;, 2),            (&#39;b&#39;, 1),            (&#39;b&#39;, 2),            (&#39;c&#39;, 1),            (&#39;c&#39;, 2)],
           names=[&#39;alpha&#39;, &#39;number&#39;])

In [21]:

# 使用range函数
strings = ("a","b","c")  # 3个元素
lists = range(3)  # 0,1,2  3个元素
m10 = pd.MultiIndex.from_product(
	[strings, lists],
	names=["alpha","number"])
m10

Out[21]:

MultiIndex([(&#39;a&#39;, 0),            (&#39;a&#39;, 1),            (&#39;a&#39;, 2),            (&#39;b&#39;, 0),            (&#39;b&#39;, 1),            (&#39;b&#39;, 2),            (&#39;c&#39;, 0),            (&#39;c&#39;, 1),            (&#39;c&#39;, 2)],
           names=[&#39;alpha&#39;, &#39;number&#39;])

In [22]:

# 使用range函数
strings = ("a","b","c") 
list1 = range(3)  # 0,1,2
list2 = ["x","y"]
m11 = pd.MultiIndex.from_product(
	[strings, list1, list2],
  names=["name","l1","l2"]
  )
m11  # 总个数 3*3*2=18

The total number is ``332=18`:

Out[22]:

MultiIndex([(&#39;a&#39;, 0, &#39;x&#39;),            (&#39;a&#39;, 0, &#39;y&#39;),            (&#39;a&#39;, 1, &#39;x&#39;),            (&#39;a&#39;, 1, &#39;y&#39;),            (&#39;a&#39;, 2, &#39;x&#39;),            (&#39;a&#39;, 2, &#39;y&#39;),            (&#39;b&#39;, 0, &#39;x&#39;),            (&#39;b&#39;, 0, &#39;y&#39;),            (&#39;b&#39;, 1, &#39;x&#39;),            (&#39;b&#39;, 1, &#39;y&#39;),            (&#39;b&#39;, 2, &#39;x&#39;),            (&#39;b&#39;, 2, &#39;y&#39;),            (&#39;c&#39;, 0, &#39;x&#39;),            (&#39;c&#39;, 0, &#39;y&#39;),            (&#39;c&#39;, 1, &#39;x&#39;),            (&#39;c&#39;, 1, &#39;y&#39;),            (&#39;c&#39;, 2, &#39;x&#39;),            (&#39;c&#39;, 2, &#39;y&#39;)],
           names=[&#39;name&#39;, &#39;l1&#39;, &#39;l2&#39;])

pd.MultiIndex.from_frame()

By current Some DataFrames directly generate multi-level indexes:

df = pd.DataFrame({"name":["xiaoming","guanyu","zhaoyun"],
                  "age":[23,39,34],
                  "sex":["male","male","female"]})
df

How to create a multi-level index (MultiIndex) using Pythons pandas library?

The multi-level indexes are directly generated, and the names are the column fields of the existing data frame:

In [24]:

pd.MultiIndex.from_frame(df)

Out[24]:

MultiIndex([(&#39;xiaoming&#39;, 23,   &#39;male&#39;),            (  &#39;guanyu&#39;, 39,   &#39;male&#39;),            ( &#39;zhaoyun&#39;, 34, &#39;female&#39;)],
           names=[&#39;name&#39;, &#39;age&#39;, &#39;sex&#39;])

Specify the name through the names parameter:

In [25]:

# 可以自定义名字
pd.MultiIndex.from_frame(df,names=["col1","col2","col3"])

Out[ 25]:

MultiIndex([(&#39;xiaoming&#39;, 23,   &#39;male&#39;),            (  &#39;guanyu&#39;, 39,   &#39;male&#39;),            ( &#39;zhaoyun&#39;, 34, &#39;female&#39;)],
           names=[&#39;col1&#39;, &#39;col2&#39;, &#39;col3&#39;])

groupby()

is calculated through the grouping function of the groupby function:

In [26]:

df1 = pd.DataFrame({"col1":list("ababbc"),
                   "col2":list("xxyyzz"),
                   "number1":range(90,96),
                   "number2":range(100,106)})
df1

Out[26] :

How to create a multi-level index (MultiIndex) using Pythons pandas library?

df2 = df1.groupby(["col1","col2"]).agg({"number1":sum,
                                        "number2":np.mean})
df2

How to create a multi-level index (MultiIndex) using Pythons pandas library?

View the index of the data:

In [28]:

df2.index

Out [28]:

MultiIndex([(&#39;a&#39;, &#39;x&#39;),            (&#39;a&#39;, &#39;y&#39;),            (&#39;b&#39;, &#39;x&#39;),            (&#39;b&#39;, &#39;y&#39;),            (&#39;b&#39;, &#39;z&#39;),            (&#39;c&#39;, &#39;z&#39;)],
           names=[&#39;col1&#39;, &#39;col2&#39;])

pivot_table()

Obtained through the data pivot function:

In [29]:

df3 = df1.pivot_table(values=["col1","col2"],index=["col1","col2"])
df3

How to create a multi-level index (MultiIndex) using Pythons pandas library?

In [30]:

df3.index

Out[30]:

MultiIndex([(&#39;a&#39;, &#39;x&#39;),            (&#39;a&#39;, &#39;y&#39;),            (&#39;b&#39;, &#39;x&#39;),            (&#39;b&#39;, &#39;y&#39;),            (&#39;b&#39;, &#39;z&#39;),            (&#39;c&#39;, &#39;z&#39;)],
           names=[&#39;col1&#39;, &#39;col2&#39;])

The above is the detailed content of How to create a multi-level index (MultiIndex) using Python's pandas library?. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:亿速云. If there is any infringement, please contact admin@php.cn delete

What is Python Switch Statement?Apr 30, 2025 pm 02:08 PM

The article discusses Python's new "match" statement introduced in version 3.10, which serves as an equivalent to switch statements in other languages. It enhances code readability and offers performance benefits over traditional if-elif-el

What are Exception Groups in Python?Apr 30, 2025 pm 02:07 PM

Exception Groups in Python 3.11 allow handling multiple exceptions simultaneously, improving error management in concurrent scenarios and complex operations.

What are Function Annotations in Python?Apr 30, 2025 pm 02:06 PM

Function annotations in Python add metadata to functions for type checking, documentation, and IDE support. They enhance code readability, maintenance, and are crucial in API development, data science, and library creation.

What are unit tests in Python?Apr 30, 2025 pm 02:05 PM

The article discusses unit tests in Python, their benefits, and how to write them effectively. It highlights tools like unittest and pytest for testing.

What are Access Specifiers in Python?Apr 30, 2025 pm 02:03 PM

Article discusses access specifiers in Python, which use naming conventions to indicate visibility of class members, rather than strict enforcement.

What is __init__() in Python and how does self play a role in it?Apr 30, 2025 pm 02:02 PM

Article discusses Python's \_\_init\_\_() method and self's role in initializing object attributes. Other class methods and inheritance's impact on \_\_init\_\_() are also covered.

What is the difference between @classmethod, @staticmethod and instance methods in Python?Apr 30, 2025 pm 02:01 PM

The article discusses the differences between @classmethod, @staticmethod, and instance methods in Python, detailing their properties, use cases, and benefits. It explains how to choose the right method type based on the required functionality and da

How do you append elements to a Python array?Apr 30, 2025 am 12:19 AM

InPython,youappendelementstoalistusingtheappend()method.1)Useappend()forsingleelements:my_list.append(4).2)Useextend()or =formultipleelements:my_list.extend(another_list)ormy_list =[4,5,6].3)Useinsert()forspecificpositions:my_list.insert(1,5).Beaware

See all articles