search
HomeBackend DevelopmentGolanggolang zip Chinese garbled code

With the continuous popularity and application of Golang in the field of web development, Zip compressed files have become an essential functional module. However, some developers encountered a common problem when using Golang's Zip package for file compression - Chinese file names appear garbled.

This is a very troublesome problem, because it not only makes the originally beautiful file name look nondescript, but may also cause a series of other errors. Below we will explore the causes and solutions to this problem.

Cause of the problem

Zip format is a binary format that contains the file name, file directory, compression method and compressed data. Among this information, the file name is a very critical part, because it determines the name and storage path of the file after the user decompresses it.

However, different file systems and encoding formats may result in different parsing results for the same file name. For example, on Windows systems, the default encoding used for file names is GBK, while on UNIX/Linux systems, the file names use UTF-8. If we do not handle these different encoding methods when using the Zip package in Golang for compression, the file name may be treated as garbled characters.

Solution

We have a variety of solutions for the problem of garbled Chinese file names. Below we will introduce some of the more feasible methods.

Method 1: GB18030 encoding conversion

Because the Zip format uses CP437 encoding to represent the file name, we can convert the file name from GB18030 encoding to CP437 encoding, so as to ensure that File names can be correctly parsed in any environment. In Go, you can use the golang.org/x/text/encoding/simplifiedchinese package to convert between GB18030 encoding and CP437 encoding.

import (
    "golang.org/x/text/encoding/simplifiedchinese"
    "golang.org/x/text/transform"
)

func GbkToUtf8(data []byte) ([]byte, error) {
    return transform.NewReader(bytes.NewReader(data), simplifiedchinese.GB18030.NewDecoder()).ReadAll()
}

func Utf8ToGbk(data []byte) ([]byte, error) {
    return transform.NewReader(bytes.NewReader(data), simplifiedchinese.GB18030.NewEncoder()).ReadAll()
}

Method 2: Use reflect

We can also use the StructTag option in the reflect package to force the use of the specified encoding method. Specifically, you can add a zip tag to the structure and a chinese-utf8 tag to the Tag. The sample code is as follows:

type File struct {
    Name string `zip:"filename=测试文件,chinese-utf8"`
}

func main() {
    zhName := "测试文件"
    utf8Name, _ := GbkToUtf8([]byte(zhName))
    f := &File{Name: string(utf8Name)}
    // 压缩文件...
}

Method 3: Use the name attribute in fileheader and FileInfo

In the Golang Zip package, we can also call fileheader and ##name attribute of #FileInfo to manually specify the correct encoding for each file.

import "archive/zip"

func zipFiles(filePaths []string, dest string) error {
    // 创建文件
    newZipFile, err := os.Create(dest)
    if err != nil {
        return err
    }
    defer newZipFile.Close()

    // 创建 ZIP writer 对象
    zipWriter := zip.NewWriter(newZipFile)
    defer zipWriter.Close()

    // 遍历 filePaths,为每个文件设置正确的编码方式
    for _, filePath := range filePaths {
        zipFile, err := os.Open(filePath)
        if err != nil {
            return err
        }
        defer zipFile.Close()

        // 解析文件名,并转换编码
        zipFileInfo, _ := zip.FileInfoHeader(zipFile.Stat())
        zipFileInfo.Name, _ = GbkToUtf8([]byte(zipFileInfo.Name))

        // 创建 Zip 文件写入器
        zipWriterNewFile, err := zipWriter.CreateHeader(zipFileInfo)
        if err != nil {
            return err
        }

        // 读取文件并写入 Zip 文件中
        _, err = io.Copy(zipWriterNewFile, zipFile)
        if err != nil {
            return err
        }
    }

    return nil
}

Conclusion

Golang’s Zip package provides a variety of solutions to the problem of garbled Chinese file names. If the encoding format of the file name is not uniform, it is easy for the file name to be garbled. We can easily avoid this problem by simply choosing the right solution according to our needs.

The above is the detailed content of golang zip Chinese garbled code. For more information, please follow other related articles on the PHP Chinese website!

Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Golang vs. C  : Code Examples and Performance AnalysisGolang vs. C : Code Examples and Performance AnalysisApr 15, 2025 am 12:03 AM

Golang is suitable for rapid development and concurrent programming, while C is more suitable for projects that require extreme performance and underlying control. 1) Golang's concurrency model simplifies concurrency programming through goroutine and channel. 2) C's template programming provides generic code and performance optimization. 3) Golang's garbage collection is convenient but may affect performance. C's memory management is complex but the control is fine.

Golang's Impact: Speed, Efficiency, and SimplicityGolang's Impact: Speed, Efficiency, and SimplicityApr 14, 2025 am 12:11 AM

Goimpactsdevelopmentpositivelythroughspeed,efficiency,andsimplicity.1)Speed:Gocompilesquicklyandrunsefficiently,idealforlargeprojects.2)Efficiency:Itscomprehensivestandardlibraryreducesexternaldependencies,enhancingdevelopmentefficiency.3)Simplicity:

C   and Golang: When Performance is CrucialC and Golang: When Performance is CrucialApr 13, 2025 am 12:11 AM

C is more suitable for scenarios where direct control of hardware resources and high performance optimization is required, while Golang is more suitable for scenarios where rapid development and high concurrency processing are required. 1.C's advantage lies in its close to hardware characteristics and high optimization capabilities, which are suitable for high-performance needs such as game development. 2.Golang's advantage lies in its concise syntax and natural concurrency support, which is suitable for high concurrency service development.

Golang in Action: Real-World Examples and ApplicationsGolang in Action: Real-World Examples and ApplicationsApr 12, 2025 am 12:11 AM

Golang excels in practical applications and is known for its simplicity, efficiency and concurrency. 1) Concurrent programming is implemented through Goroutines and Channels, 2) Flexible code is written using interfaces and polymorphisms, 3) Simplify network programming with net/http packages, 4) Build efficient concurrent crawlers, 5) Debugging and optimizing through tools and best practices.

Golang: The Go Programming Language ExplainedGolang: The Go Programming Language ExplainedApr 10, 2025 am 11:18 AM

The core features of Go include garbage collection, static linking and concurrency support. 1. The concurrency model of Go language realizes efficient concurrent programming through goroutine and channel. 2. Interfaces and polymorphisms are implemented through interface methods, so that different types can be processed in a unified manner. 3. The basic usage demonstrates the efficiency of function definition and call. 4. In advanced usage, slices provide powerful functions of dynamic resizing. 5. Common errors such as race conditions can be detected and resolved through getest-race. 6. Performance optimization Reuse objects through sync.Pool to reduce garbage collection pressure.

Golang's Purpose: Building Efficient and Scalable SystemsGolang's Purpose: Building Efficient and Scalable SystemsApr 09, 2025 pm 05:17 PM

Go language performs well in building efficient and scalable systems. Its advantages include: 1. High performance: compiled into machine code, fast running speed; 2. Concurrent programming: simplify multitasking through goroutines and channels; 3. Simplicity: concise syntax, reducing learning and maintenance costs; 4. Cross-platform: supports cross-platform compilation, easy deployment.

Why do the results of ORDER BY statements in SQL sorting sometimes seem random?Why do the results of ORDER BY statements in SQL sorting sometimes seem random?Apr 02, 2025 pm 05:24 PM

Confused about the sorting of SQL query results. In the process of learning SQL, you often encounter some confusing problems. Recently, the author is reading "MICK-SQL Basics"...

Is technology stack convergence just a process of technology stack selection?Is technology stack convergence just a process of technology stack selection?Apr 02, 2025 pm 05:21 PM

The relationship between technology stack convergence and technology selection In software development, the selection and management of technology stacks are a very critical issue. Recently, some readers have proposed...

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Best Graphic Settings
4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. How to Fix Audio if You Can't Hear Anyone
4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
WWE 2K25: How To Unlock Everything In MyRise
1 months agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

DVWA

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

EditPlus Chinese cracked version

EditPlus Chinese cracked version

Small size, syntax highlighting, does not support code prompt function

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Safe Exam Browser

Safe Exam Browser

Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.