将 Plotly 图表并行转换为图像-Python教程-PHP中文网

首页

后端开发

Python教程

将 Plotly 图表并行转换为图像

Susan Sarandon

Jan 04, 2025 am 02:32 AM

Converting Plotly charts into images in parallel

我们在我工作的公司广泛使用 Plotly 图表。它们可以轻松创建看起来不错的交互式图形。通过 Plotly Express 库获得的 Python 体验非常棒，而且入门门槛很低。

Plotly 图表有两个主要用例：

使用 Plotly Dash 的交互式仪表板。将 Plotly 图表集成到 Dash 中显然很棒。
对于我们的 PDF 报告，我们在渲染 PDF 之前将图表转换为图像。

对于典型的 PDF 报告，我们使用 5-20 个数字来显示特定指标随时间的演变、某些值在多个类别上的分布，或者不同类别之间的比较。

为了创建 PDF 报告，我们结合使用了 Weasyprint、Jinja 和 Plotly 图表。要将报告呈现为 PDF，我们首先必须将所有图表呈现为图像。

使用 Kaleido 渲染图表

为此，我们使用了很棒的 Kaleido 包。它使用 Chrome 浏览器渲染图形并将其保存为图像。该 API 易于使用。

from kaleido.scopes.plotly import PlotlyScope

scope = PlotlyScope()
img_bytes = scope.transform(
    figure=figure, format="png", width=1000, height=1000, scale=4,
)

这会将图中的图形渲染为高度和宽度为 1000 像素、渲染比例为 4 的图像（即图像实际尺寸为 4000 像素 x 4000 像素）。比例越高，最终图像的 DPI 越高，看起来越好，最终的 PDF 也越大。

渲染大量图表

渲染图表需要一点时间，如果您渲染大量图表（10-20），它将占用程序运行时间的很大一部分。为了加快 PDF 渲染管道的速度，我们部署了以下解决方案。

在内部，Kaleido 只是将将图形渲染为图像的问题外包给附带的 Chrome 网络浏览器。这意味着，对于Python本身来说，渲染这个图像基本上是在等待I/O。

为了加速这个特定的过程，并且由于我们只是等待 I/O，所以我们可以使用多线程。

创建随机图

让我们首先创建一个随机图形，如下所示：

import pandas as pd
import numpy as np
import plotly.graph_objects as go

def get_random_figure() -> go.Figure:
    n_bars = 50
    dates = pd.date_range(start="2021-01-01", end="2021-12-31", freq="M")

    figure = go.Figure()
    for i in range(n_bars):
        values = np.random.rand(len(dates))
        figure.add_trace(go.Bar(x=dates, y=values, name=f"Label {i+1}"))

    figure.update_layout(
        dict(
            barmode="group",
            legend=dict(orientation="h", yanchor="top", xanchor="left"),
        )
    )
    figure.update_layout(yaxis=dict(tickformat=".0%"), xaxis=dict(showgrid=False))
    return figure

现在，可以使用上面的代码将图形转换为图像：

from kaleido.scopes.plotly import PlotlyScope
import plotly.graph_objects as go

def figure_to_bytes(figure: go.Figure) -> bytes:
    scope = PlotlyScope()
    return scope.transform(figure=figure, format="png", width=1000, height=1000, scale=4)

最后我们还为以后定义：

def transform_random_figure() -> bytes:
    return figure_to_bytes(get_random_figure())

在线程中运行图像转换

你可能知道，也可能不知道，由于Python中的GIL（全局解释器锁），只有一个线程可以同时执行Python代码。由于图到图像的转换不是Python代码，因此我们可以利用线程同时启动大量图的转换，然后收集结果。

为此，我们定义了一个辅助类：

from kaleido.scopes.plotly import PlotlyScope

scope = PlotlyScope()
img_bytes = scope.transform(
    figure=figure, format="png", width=1000, height=1000, scale=4,
)

这个类将帮助我们检索转换的结果（即图像的字节）。

接下来我们要做的就是遵循在 Python 中使用线程的标准模式：

使用start()方法启动你想要启动的线程。
使用join()方法等待线程返回结果。

我们的线程应该每个调用transform_random_figure()，然后返回字节。在本例中我们启动 10 个线程。

import pandas as pd
import numpy as np
import plotly.graph_objects as go

def get_random_figure() -> go.Figure:
    n_bars = 50
    dates = pd.date_range(start="2021-01-01", end="2021-12-31", freq="M")

    figure = go.Figure()
    for i in range(n_bars):
        values = np.random.rand(len(dates))
        figure.add_trace(go.Bar(x=dates, y=values, name=f"Label {i+1}"))

    figure.update_layout(
        dict(
            barmode="group",
            legend=dict(orientation="h", yanchor="top", xanchor="left"),
        )
    )
    figure.update_layout(yaxis=dict(tickformat=".0%"), xaxis=dict(showgrid=False))
    return figure

start()方法还将调用启动实际逻辑的线程的run()方法（即执行给定的函数，在我们的例子中意味着transform_random_figure()）。

为了收集结果，我们使用线程的 join() 方法并将结果写入文件。

from kaleido.scopes.plotly import PlotlyScope
import plotly.graph_objects as go

def figure_to_bytes(figure: go.Figure) -> bytes:
    scope = PlotlyScope()
    return scope.transform(figure=figure, format="png", width=1000, height=1000, scale=4)

它是如何运作的

这里的主要思想是，每当我们想要将图形转换为图像时，我们都会启动一个线程，并且该线程将在后台等待图形完成。

将整个报告放在一起后，我们在所有线程上调用 join() 并检索所有图形的图像，然后将它们放入报告中。

这样，我们就可以生成没有图表的整个报告，并且无需等待每个图表本身都被转换，从而节省时间。

概括

综上所述，如果您想将多个 Plotly 图表转换为图像，请使用 Python 标准库中的多线程模块来加快转换过程。

您可以非常轻松地做到这一点，只需将 transform() 调用移动到一个线程中，然后等待所有线程完成即可。

附录：守则

def transform_random_figure() -> bytes:
    return figure_to_bytes(get_random_figure())

以上是将 Plotly 图表并行转换为图像的详细内容。更多信息请关注PHP中文网其他相关文章！

声明

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系admin@php.cn

您如何切成python阵列？May 01, 2025 am 12:18 AM

Python列表切片的基本语法是list[start:stop:step]。1.start是包含的第一个元素索引，2.stop是排除的第一个元素索引，3.step决定元素之间的步长。切片不仅用于提取数据，还可以修改和反转列表。

在什么情况下，列表的表现比数组表现更好？May 01, 2025 am 12:06 AM

ListSoutPerformarRaysin：1）DynamicsizicsizingandFrequentInsertions/删除，2）储存的二聚体和3）MemoryFeliceFiceForceforseforsparsedata，butmayhaveslightperformancecostsinclentoperations。

如何将Python数组转换为Python列表？May 01, 2025 am 12:05 AM

toConvertapythonarraytoalist，usEthelist（）constructororageneratorexpression.1）intimpthearraymoduleandcreateanArray.2）USELIST（ARR）或[XFORXINARR] to ConconverTittoalist，请考虑performorefformanceandmemoryfformanceandmemoryfformienceforlargedAtasetset。

当Python中存在列表时，使用数组的目的是什么？May 01, 2025 am 12:04 AM

choosearraysoverlistsinpythonforbetterperformanceandmemoryfliceSpecificScenarios.1）largenumericaldatasets：arraysreducememoryusage.2）绩效 - 临界杂货：arraysoffersoffersOffersOffersOffersPoostSfoostSforsssfortasssfortaskslikeappensearch orearch.3）testessenforcety：arraysenforce：arraysenforc