search
HomeJavaJavaBaseSolution to garbled code exported by excel in java

Solution to garbled code exported by excel in java

Dec 05, 2019 am 10:41 AM
javaGarbled characters

Solution to garbled code exported by excel in java

Wrong way:

response.setHeader("Content-Disposition", "inline;filename="+ new String(fileName.getBytes(),"utf-8"));
String fileName =    aicaseNodeDeptService.getFilename(request)+".xlsx";
response.setCharacterEncoding("UTF-8");
response.setContentType("application/ms-excel");

Correct way

response.setHeader("Content-Disposition","inline;filename="+new String(fileName.getBytes("utf-8"),"iso8859-1"));

Reason: httpheader requires that its content must be iso8859-1 encoded before passing these characters After arriving at the destination, the destination program uses the opposite method Strings_utf8 = newString(s_iso88591.getBytes("ISO8859-1"), "UTF-8") to get the correct Chinese character "中", thus ensuring compliance The agreement stipulates that Chinese is also supported.

In Java, String's getBytes() method is to obtain a byte array in the operating system's default encoding format. This means that under different operating systems, the returned things are different!

String.getBytes(Stringdecode) method will return the byte array representation of a certain string under the encoding according to the specified decode encoding, such as:

byte[] b_gbk = "中".getBytes("GBK");
byte[] b_utf8 = "中".getBytes("UTF-8");
byte[] b_iso88591 = "中".getBytes("ISO8859-1");

will return the byte array representation of the Chinese character "中" in GBK, UTF-8 and ISO8859-1 encoding respectively. At this time, the length of

b_gbk is 2,

## The length of #b_utf8 is 3, and the length of

b_iso88591 is 1.

Compared with getBytes, the "中" character can be restored through new String(byte[], decode).

This new String(byte[], decode) actually It uses the specified encoding decode to parse byte[] into a string.

String s_gbk = new String(b_gbk,"GBK");
String s_utf8 = new String(b_utf8,"UTF-8");
String s_iso88591 = new String(b_iso88591,"ISO8859-1");

By outputting s_gbk, s_utf8 and s_iso88591, you will find that s_gbk and s_utf8 are both "medium", and only s_iso88591 is not recognized characters (can be understood as garbled characters), why can't the word "中" be restored after using ISO8859-1 encoding and recombination?

The reason is very simple, because the encoding table of ISO8859-1 encoding does not contain Chinese characters at all. Of course, it is impossible to get the correct "中" through "中".getBytes("ISO8859-1"); The encoded value of the word in ISO8859-1, so it is impossible to restore it through newString().

Therefore, when obtaining byte[] through the String.getBytes(Stringdecode) method, you must make sure that the code value represented by String actually exists in the decode encoding table, so that the obtained byte[] array can be restored correctly. .

For more java knowledge, please pay attention to the

java basic tutorial column.

Note:

Sometimes, in order to adapt Chinese characters to certain special requirements (such as httpheader requiring that its content must be iso8859-1 encoded), the Chinese characters may be encoded in byte format To encode the situation, such as:

String s_iso88591 = newString("中".getBytes("UTF-8"),"ISO8859-1"), the s_iso8859-1 string obtained in this way is actually three For characters in ISO8859-1, after passing these characters to the destination, the destination program uses the opposite method Strings_utf8 = newString(s_iso88591.getBytes("ISO8859-1"), "UTF-8") to get the correct The Chinese character "中", which not only ensures compliance with the agreement, but also supports the Chinese language.

The above is the detailed content of Solution to garbled code exported by excel in java. For more information, please follow other related articles on the PHP Chinese website!

Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

PhpStorm Mac version

PhpStorm Mac version

The latest (2018.2.1) professional PHP integrated development tool

DVWA

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

SecLists

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

Dreamweaver Mac version

Dreamweaver Mac version

Visual web development tools