Home  >  Article  >  Web Front-end  >  Nodejs reads garbled files

Nodejs reads garbled files

WBOY
WBOYOriginal
2023-05-27 20:23:05928browse

Overview

As Node.js is more and more widely used in web development, many developers will encounter the problem of reading garbled files. Although this problem is not unsolvable, it will consume a lot of time and energy. This article will introduce how to solve the problem of garbled files read by Node.js.

Problem Analysis

In Node.js, we can use the fs module to read local files. When reading a file, we need to specify the encoding method of the file, otherwise Node.js will read the file in binary mode by default, resulting in garbled characters when reading the file.

The following is an example code for reading a Chinese file:

const fs = require('fs');

fs.readFile('test.txt', 'utf8', function (err, data) {
    if (err) {
        console.error(err);
    } else {
        console.log(data);
    }
});

In this example, we read test by calling fs.readFile() .txt file. In the second parameter, we specify the encoding method of the file as utf8. However, even if we specify the encoding method of the file, the read file will still be garbled.

Solution

Node.js reads garbled files. There are several solutions:

  1. Confirm the file encoding method

Before determining that the second parameter of fs.readFile() specifies the correct encoding method, we need to confirm whether the encoding method of the file is correct. In Windows, you can view the file encoding by right-clicking the file, selecting Properties, and then selecting the General tab.

If the file encoding method is not utf8, we need to specify the correct encoding method when calling fs.readFile(). Common file encoding methods are:

  • UTF-8: used to support various languages ​​and special characters.
  • GB2312: Applicable to Simplified Chinese.
  • BIG5: Applicable to Traditional Chinese.
  1. Use the iconv-lite module

If you confirm that the file encoding method is correct but garbled characters still appear, we can use the iconv-lite module Perform encoding conversion.

iconv-lite is a Node.js module specially used for encoding conversion. The content of the read binary file can be read by calling its decode() method. Perform encoding conversion.

The code using the iconv-lite module is as follows:

const fs = require('fs');
const iconv = require('iconv-lite');

fs.readFile('test.txt', function (err, data) {
    if (err) {
        console.error(err);
    } else {
        const content = iconv.decode(data, 'gbk'); // 将读取出的二进制文件解码为GBK
        console.log(content);
    }
});

In this example, we read the The extracted binary file is decoded into GBK encoding. In order to decode correctly, we need to specify the correct encoding.

Use Buffer to convert encoding
  1. Another solution is to use the
Buffer

object provided by Node.js for encoding conversion. When reading a file, we can specify the file encoding as null, which will cause fs.readFile() to return a Buffer object. Then we can use the decode() method in the iconv-lite module to convert the Buffer object into text with the specified encoding method. The code for converting encoding using

Buffer

is as follows: <pre class='brush:php;toolbar:false;'>const fs = require('fs'); const iconv = require('iconv-lite'); fs.readFile('test.txt', function (err, data) { if (err) { console.error(err); } else { const buffer = Buffer.from(data); const content = iconv.decode(buffer, 'gbk'); // 将Buffer对象解码为GBK console.log(content); } });</pre>In this example, we will read by calling the

Buffer.from()

method Convert the outgoing content to a Buffer object, and then use the iconv.decode() method to convert it to GBK-encoded text. Summary

The problem of Node.js reading garbled files requires choosing a solution based on the actual situation. If you confirm that the file encoding method is correct but garbled characters still appear, we can try to use the

iconv-lite

module or Buffer object for encoding conversion. When using the fs module to read files, reasonably specifying the file encoding method is the basic method to avoid garbled characters.

The above is the detailed content of Nodejs reads garbled files. For more information, please follow other related articles on the PHP Chinese website!

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn