Home >Backend Development >Golang >How to Convert ANSI Text to UTF-8 in Go?

How to Convert ANSI Text to UTF-8 in Go?

Patricia Arquette
Patricia ArquetteOriginal
2024-11-26 13:48:10792browse

How to Convert ANSI Text to UTF-8 in Go?

Converting ANSI Text to UTF-8 in Go

In Go, all strings are stored in UTF-8 format. However, you might encounter situations where you need to convert ANSI text, which uses a different character encoding, to UTF-8. Here's how you can achieve this:

According to the Go language specification, all strings are handled as UTF-8 internally. Therefore, there is no need for explicit conversion. If you have a byte array representing ANSI text, you can simply use the following snippet to convert it to a Go string:

import (
    "bytes"
    "unicode/utf8"
)

func convertANSItoUTF8(ansiBytes []byte) string {
    reader := bytes.NewReader(ansiBytes)
    decoder := utf8.RuneReader(reader)

    var utf8String []rune
    for {
        r, size, err := decoder.ReadRune()
        if err != nil {
            break
        }
        utf8String = append(utf8String, r)
    }

    return string(utf8String)
}

This function uses the utf8.RuneReader function to iterate over the ANSI bytes and convert them to UTF-8 runes. The runes are then appended to a slice, which is finally converted to a UTF-8 string.

Note: This function assumes that the ANSI input is valid. If there are any invalid sequences, the behavior of the function is undefined.

The above is the detailed content of How to Convert ANSI Text to UTF-8 in Go?. For more information, please follow other related articles on the PHP Chinese website!

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn