I am new to spark. I'm trying to flatten the dataframe but am not able to do it via "explode".
The original data frame structure is as follows:
id|approvaljson 1|[{"approvertype":"1st line manager","status":"approved"},{"approvertype":"2nd line manager","status":"approved"}] 2|[{"approvertype":"1st line manager","status":"approved"},{"approvertype":"2nd line manager","status":"rejected"}]
Do I need to convert it to the following schema?
id|approvaltype|status 1|1st line manager|approved 1|2nd line manager|approved 2|1st line manager|approved 2|2nd line manager|rejected
I've tried it
df_exploded = df.withcolumn("approvaljson", explode("approvaljson"))
But I got the error:
Cannot resolve "explode(ApprovalJSON)" due to data type mismatch: parameter 1 requires ("ARRAY" or "MAP") type, however, "ApprovalJSON" is of "STRING" type.;
Correct answer
First parse the json-like string into an array of structures, then use inline
to break the array into rows and columns
df1 = df.withcolumn("approvaljson", f.from_json("approvaljson", schema="array<struct<approvertype string, status string>>")) df1 = df1.select("id", f.inline('approvaljson'))
result
df1.show() +---+----------------+--------+ | ID| ApproverType| Status| +---+----------------+--------+ | 1|1st Line Manager|Approved| | 1|2nd Line Manager|Approved| | 2|1st Line Manager|Approved| | 2|2nd Line Manager|Rejected| +---+----------------+--------+
The above is the detailed content of Unable to decompose nested JSON in Spark dataframe. For more information, please follow other related articles on the PHP Chinese website!

parse 函数解析字符串,将其转换为由分隔符分隔的令牌列表。步骤:1. 从字符串开头搜索第一个非分隔符字符;2. 继续搜索直到遇到分隔符,并在该分隔符处终止字符串;3. 将令牌存储在令牌数组中;4. 重复 1-3 步,直至字符串结束;5. 在数组末尾添加指向 NULL 的指针,表示数组结束。

我有一个jwt令牌,我可以在https://jwt.io/网站上看到解码后的令牌。它不需要我设置任何秘密或声明。所以我正在寻找一种方法来解码令牌以获得过期日期而不提供任何秘密。我正在使用库ngopkg.in/square/go-jose.v2/jwt,下面是我的代码:token,err:=jwt.ParseSigned(jwtToken)返回值token有一个标头字段,其中包括keyid、算法,但它没有给我过期日期。我搜索过这个主题,人们说使用github.com/a

在 C++ 中,有两种将 string 转换为 int 的方法:使用 sto i() 函数,直接接收字符串并返回整数。使用 istringstream 类,将字符串解析为输入流,然后提取整数。选择方法取决于字符串格式:如果格式明确且无非数字字符,stoi() 更简洁;如果字符串可能包含非数字字符或需要自定义转换,则 istringstream 更灵活。

我有一个用例,我们在x-www-form-urlencoded主体中获取嵌套键值,如下所示name=abc&age=12¬es[key1]=value1¬es[key2]=value2我尝试了url.parsequery("name=abc&age=12¬es\[key1\]=value1¬es\[key2\]=value2")但它给出了{"name":"abc","age":12,"notes[key1]":"value1","note

Java中的parse指将字符串或其他表示形式转换为指定类型或对象的处理过程。常见的应用包括将字符串转换为数字类型、日期/时间对象、JSON对象以及从XML文档中提取数据。通过内置方法、格式化器类或第三方库进行parse。

Go语言反射允许在运行时操控变量值,包括修改布尔值、整数、浮点数和字符串。通过获取变量的Value,可以调用SetBool、SetInt、SetFloat和SetString方法进行修改。例如,可以解析JSON字符串为结构体,然后使用反射修改结构体字段的值。需要注意,反射操作较慢,且无法修改不可修改字段,修改结构体字段值时可能不会自动更新相关字段。

Go语言中解析地址的关键函数包括:net.ParseIP():解析IPv4或IPv6地址。net.ParseCIDR():解析CIDR标记。net.ResolveIPAddr():解析主机名或IP地址为IP地址。net.ResolveTCPAddr():解析主机名和端口为TCP地址。net.ResolveUDPAddr():解析主机名和端口为UDP地址。

深入了解PHP:JSONUnicode转中文的实现方法在开发中,我们经常会遇到需要处理JSON数据的情况,而JSON中的Unicode编码在一些场景下会给我们带来一些问题,特别是当需要将Unicode编码转换为中文字符时。在PHP中,有一些方法可以帮助我们实现这个转换过程,下面将介绍一种常用的方法,并提供具体的代码示例。首先,让我们先了解一下JSON中Un


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

DVWA
Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

EditPlus Chinese cracked version
Small size, syntax highlighting, does not support code prompt function

Dreamweaver CS6
Visual web development tools

SublimeText3 Chinese version
Chinese version, very easy to use

MantisBT
Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.
