What are the fast and easy-to-use Python data visualization methods?-Python Tutorial-php.cn

Home

Backend Development

Python Tutorial

What are the fast and easy-to-use Python data visualization methods?

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

May 29, 2023 pm 05:34 PM

python

Data visualization is a very important part of data science or machine learning projects. Typically, you need to perform exploratory data analysis (EDA) early in a project to gain some understanding of the data, and creating visualizations can really make the task of analysis clearer and easier to understand, especially for large-scale, high-dimensional data. set. Nearing the end of a project, it's also important to present the end result in a clear, concise and compelling way that your audience (who are often non-technical clients) can understand.

Heat Map

A method of using color to represent the value of each element in a data matrix is called a Heat Map. Through matrix indexing, two items or features that need to be compared are associated and different colors are used to represent their different values. Heat maps are suitable for displaying relationships between multiple feature variables because the color can directly reflect the size of the matrix element at that position. You can compare each relationship to other relationships in the data set through other points in the heat map. Because of the intuitive nature of color, it provides us with a simple and easy-to-understand way of interpreting data.

What are the fast and easy-to-use Python data visualization methods?

Now let’s take a look at the implementation code. Compared with "matplotlib", "seaborn" can be used to draw more advanced graphics, which usually requires more components, such as multiple colors, graphics or variables. "matplotlib" can be used to display graphics, "NumPy" can be used to generate data, and "pandas" can be used to process data! Drawing is just a simple function of "seaborn".

# Importing libs
import seaborn as sns
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt

# Create a random dataset
data = pd.DataFrame(np.random.random((10,6)), columns=["Iron Man","Captain America","Black Widow","Thor","Hulk", "Hawkeye"])

print(data)

# Plot the heatmap
heatmap_plot = sns.heatmap(data, center=0, cmap=&#39;gist_ncar&#39;)

plt.show()

Two-dimensional density plot

The two-dimensional density plot (2D Density Plot) is an intuitive extension of the one-dimensional version of the density plot. Compared with the one-dimensional version, its advantage is that it can see the relationship between the two probability distribution of a variable. The scale plot on the right uses color to represent the probability of each point in the 2D density plot below. The place where our data has the highest probability of occurrence (that is, where the data points are most concentrated) seems to be around size=0.5 and speed=1.4. As you know by now, 2D density plots are very useful for quickly finding the areas where our data is most concentrated with two variables, as opposed to just one variable like a 1D density plot. Observing the data with a two-dimensional density plot is useful when you have two variables that are important to the output and want to understand how they work together to contribute to the distribution of the output.

What are the fast and easy-to-use Python data visualization methods?

#Facts have once again proven that using "seaborn" to write code is very convenient! This time, we'll create a skewed distribution to make the data visualization more interesting. You can adjust most of the optional parameters to make the visualization look clearer.

# Importing libs
import seaborn as sns
import matplotlib.pyplot as plt
from scipy.stats import skewnorm

# Create the data
speed = skewnorm.rvs(4, size=50) 
size = skewnorm.rvs(4, size=50)

# Create and shor the 2D Density plot
ax = sns.kdeplot(speed, size, cmap="Reds", shade=False, bw=.15, cbar=True)
ax.set(xlabel=&#39;speed&#39;, ylabel=&#39;size&#39;)
plt.show()

Spider plots

Spider plots are one of the best ways to display one-to-many relationships.. In other words, you can plot and view the values of multiple variables in relation to a specific variable or category. In a spider web diagram, the significance of one variable over another is clear and obvious because the area covered and the length from the center become larger in a particular direction. You can plot the different categories of objects described by these variables side by side to see the differences between them. In the chart below, it’s easy to compare the different attributes of the Avengers and see where they each excel! (Please note that these data are randomly set and I am not biased against the members of the Avengers.)

What are the fast and easy-to-use Python data visualization methods?

We can use "matplotlib" to generate visualization results, and No need to use "seaborn". We need to have each attribute equally spaced around the circumference. There will be labels on each corner and we will plot the values as a point whose distance from the center is proportional to its value/size. To show this more clearly, we will fill the area formed by the lines connecting the property points with a semi-transparent color.

# Import libs
import pandas as pd
import seaborn as sns
import numpy as np
import matplotlib.pyplot as plt

# Get the data
df=pd.read_csv("avengers_data.csv")
print(df)

"""
   #             Name  Attack  Defense  Speed  Range  Health
0  1         Iron Man      83       80     75     70      70
1  2  Captain America      60       62     63     80      80
2  3             Thor      80       82     83    100     100
3  3             Hulk      80      100     67     44      92
4  4      Black Widow      52       43     60     50      65
5  5          Hawkeye      58       64     58     80      65

"""

# Get the data for Iron Man
labels=np.array(["Attack","Defense","Speed","Range","Health"])
stats=df.loc[0,labels].values

# Make some calculations for the plot
angles=np.linspace(0, 2*np.pi, len(labels), endpoint=False)
stats=np.concatenate((stats,[stats[0]]))
angles=np.concatenate((angles,[angles[0]]))

# Plot stuff
fig = plt.figure()
ax = fig.add_subplot(111, polar=True)
ax.plot(angles, stats, &#39;o-&#39;, linewidth=2)
ax.fill(angles, stats, alpha=0.25)
ax.set_thetagrids(angles * 180/np.pi, labels)
ax.set_title([df.loc[0,"Name"]])
ax.grid(True)

plt.show()

Treemap

We have learned to use treemaps since elementary school. Because tree diagrams are naturally intuitive, they are easy to understand. Nodes that are directly connected are closely related, while nodes with multiple connections are less similar. In the visualization below, I plotted a tree diagram of a small subset of the Pokemon game's dataset based on Kaggle's statistics (health, attack, defense, special attack, special defense, speed).

因此，统计意义上最匹配的口袋妖怪将被紧密地连接在一起。例如，在图的顶部，阿柏怪和尖嘴鸟是直接连接的，如果我们查看数据，阿柏怪的总分为 438，尖嘴鸟则为 442，二者非常接近！但是如果我们看看拉达，我们可以看到其总得分为 413，这和阿柏怪、尖嘴鸟就具有较大差别了，所以它们在树状图中是被分开的！当我们沿着树往上移动时，绿色组的口袋妖怪彼此之间比它们和红色组中的任何口袋妖怪都更相似，即使这里并没有直接的绿色的连接。

What are the fast and easy-to-use Python data visualization methods?

实际上，我们需要使用「Scipy」来绘制树状图。一旦读取了数据集中的数据，我们就会删除字符串列。这么做只是为了使可视化结果更加直观、便于理解，但在实践中，将这些字符串转换为分类变量会得到更好的结果和对比效果。我们还创建了数据帧的索引，以方便在每个节点上正确引用它的列。告诉大家的最后一件事是：在“Scipy”中，计算和绘制树状图只需一行简单代码。

# Import libs
import pandas as pd
from matplotlib import pyplot as plt
from scipy.cluster import hierarchy
import numpy as np

# Read in the dataset
# Drop any fields that are strings
# Only get the first 40 because this dataset is big
df = pd.read_csv(&#39;Pokemon.csv&#39;)
df = df.set_index(&#39;Name&#39;)
del df.index.name
df = df.drop(["Type 1", "Type 2", "Legendary"], axis=1)
df = df.head(n=40)

# Calculate the distance between each sample
Z = hierarchy.linkage(df, &#39;ward&#39;)

# Orientation our tree
hierarchy.dendrogram(Z, orientation="left", labels=df.index)

plt.show()

The above is the detailed content of What are the fast and easy-to-use Python data visualization methods?. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:亿速云. If there is any infringement, please contact admin@php.cn delete

Merging Lists in Python: Choosing the Right MethodMay 14, 2025 am 12:11 AM

TomergelistsinPython,youcanusethe operator,extendmethod,listcomprehension,oritertools.chain,eachwithspecificadvantages:1)The operatorissimplebutlessefficientforlargelists;2)extendismemory-efficientbutmodifiestheoriginallist;3)listcomprehensionoffersf

How to concatenate two lists in python 3?May 14, 2025 am 12:09 AM

In Python 3, two lists can be connected through a variety of methods: 1) Use operator, which is suitable for small lists, but is inefficient for large lists; 2) Use extend method, which is suitable for large lists, with high memory efficiency, but will modify the original list; 3) Use * operator, which is suitable for merging multiple lists, without modifying the original list; 4) Use itertools.chain, which is suitable for large data sets, with high memory efficiency.

Python concatenate list stringsMay 14, 2025 am 12:08 AM

Using the join() method is the most efficient way to connect strings from lists in Python. 1) Use the join() method to be efficient and easy to read. 2) The cycle uses operators inefficiently for large lists. 3) The combination of list comprehension and join() is suitable for scenarios that require conversion. 4) The reduce() method is suitable for other types of reductions, but is inefficient for string concatenation. The complete sentence ends.

Python execution, what is that?May 14, 2025 am 12:06 AM

PythonexecutionistheprocessoftransformingPythoncodeintoexecutableinstructions.1)Theinterpreterreadsthecode,convertingitintobytecode,whichthePythonVirtualMachine(PVM)executes.2)TheGlobalInterpreterLock(GIL)managesthreadexecution,potentiallylimitingmul

Python: what are the key featuresMay 14, 2025 am 12:02 AM

Key features of Python include: 1. The syntax is concise and easy to understand, suitable for beginners; 2. Dynamic type system, improving development speed; 3. Rich standard library, supporting multiple tasks; 4. Strong community and ecosystem, providing extensive support; 5. Interpretation, suitable for scripting and rapid prototyping; 6. Multi-paradigm support, suitable for various programming styles.

Python: compiler or Interpreter?May 13, 2025 am 12:10 AM

Python is an interpreted language, but it also includes the compilation process. 1) Python code is first compiled into bytecode. 2) Bytecode is interpreted and executed by Python virtual machine. 3) This hybrid mechanism makes Python both flexible and efficient, but not as fast as a fully compiled language.

Python For Loop vs While Loop: When to Use Which?May 13, 2025 am 12:07 AM

Useaforloopwheniteratingoverasequenceorforaspecificnumberoftimes;useawhileloopwhencontinuinguntilaconditionismet.Forloopsareidealforknownsequences,whilewhileloopssuitsituationswithundeterminediterations.

Python loops: The most common errorsMay 13, 2025 am 12:07 AM

Pythonloopscanleadtoerrorslikeinfiniteloops,modifyinglistsduringiteration,off-by-oneerrors,zero-indexingissues,andnestedloopinefficiencies.Toavoidthese:1)Use'i

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

How to fix KB5055612 fails to install in Windows 10?

4 weeks agoByDDD

Roblox: Bubble Gum Simulator Infinity - How To Get And Use Royal Keys

4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Roblox: Grow A Garden - Complete Mutation Guide

3 weeks agoByDDD

Nordhold: Fusion System, Explained

4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Mandragora: Whispers Of The Witch Tree - How To Unlock The Grappling Hook

3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),