search
HomeJavaJavaBaseSolution to Java reading Chinese garbled characters

Solution to Java reading Chinese garbled characters

Dec 14, 2019 pm 03:27 PM
javaGarbled characters

Solution to Java reading Chinese garbled characters

Java reads text files (such as csv files, txt files, etc.), and when encountering Chinese, it becomes garbled. The reading code is as follows:

List<String> lines=new ArrayList<String>();    
BufferedReader br = new BufferedReader(new FileReader(fileName));  
String line = null;  
while ((line = br.readLine()) != null) {   
      lines.add(line);  
}  
br.close();

Java's I/O class processing is as shown in the figure:

Solution to Java reading Chinese garbled characters

The Reader class is the parent of the read characters in Java's I/O class, and the InputStream class is the parent class for reading bytes. The InputStreamReader class is the bridge that associates bytes to characters. It is responsible for processing the conversion of read bytes into characters during the I/O process, and the decoding of specific bytes into characters. It is implemented by StreamDecoder, and the Charset encoding format must be specified by the user during the decoding process of StreamDecoder. It is worth noting that if you do not specify Charset, the default character set in the local environment will be used. For example, in the Chinese environment, GBK encoding will be used.

Summary: When Java reads the data stream, you must specify the encoding method of the data stream, otherwise the default character set in the local environment will be used.

After the above analysis, the modified code is as follows:

List<String> lines=new ArrayList<String>();  
BufferedReader br=new BufferedReader(new InputStreamReader(new FileInputStream(fileName),"UTF-8"));  
String line = null;  
while ((line = br.readLine()) != null) {  
      lines.add(line);  
}  
br.close();

For more java knowledge, please pay attention to the java basic tutorial column.

The above is the detailed content of Solution to Java reading Chinese garbled characters. For more information, please follow other related articles on the PHP Chinese website!

Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

MantisBT

MantisBT

Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

VSCode Windows 64-bit Download

VSCode Windows 64-bit Download

A free and powerful IDE editor launched by Microsoft

WebStorm Mac version

WebStorm Mac version

Useful JavaScript development tools

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

SublimeText3 Linux new version

SublimeText3 Linux new version

SublimeText3 Linux latest version