What is Base64 encoding? Before answering this question, we need to understand the classification of files in computers. For computers, files can be divided into two categories, one is text files and the other is binary files.
For binary files, their content is represented in binary, which is not immediately understandable to humans. If you try to open a binary file with a text editor, you may see gibberish. This is because the encoding method of binary files is different from the encoding method of text files, so when the text editor tries to translate the binary files into text content, garbled characters will appear.
For text files, there are many encoding methods, such as the earliest ASCII encoding and the currently commonly used encoding methods such as UTF-8 and UTF-16. Even text files may see garbled characters if you open them using a different encoding.
So whether it is a text file or a binary file, the encoding format needs to be unified. In other words, what the encoding of writing looks like, then the encoding of data reading should also match it.
Base64 encoding is actually an encoding method that encodes binary data into visual ASCII characters.
Why is there such a requirement?
We know that the development of the computer world does not happen overnight. It is a process of slow growth. For character encoding, it only supports ASCII encoding at first, and later it was expanded to Unicode and so on. Therefore, for many applications, encoding formats other than ASCII encoding are not supported. So how to display non-ASCII code in these systems?
The solution is to perform encoding mapping to map non-ASCII characters to ASCII characters. Base64 is such an encoding method.
The common place to use Base64 is in web pages. Sometimes we need to display images on web pages, so we can base64 encode the images and then fill them into html.
Another application is to base64 encode the file and then send it as an email attachment.
Since base64 encoding is so easy to use, let’s take a look at the base64 implementation in JAVA.
There is a corresponding base64 implementation in java, called java.util.Base64. This class is a tool class for Base64, which was introduced by JDK in version 1.8.
Base64 provides three getEncoder and getDecoder methods. By obtaining the corresponding Encoder and Decoder, you can then call the encoder's encode and decode methods to encode and decode the data, which is very convenient.
Let’s first take a look at the basic usage examples of Base64:
// 使用encoder进行编码 String encodedString = Base64.getEncoder().encodeToString("what is your name baby?".getBytes("utf-8")); System.out.println("Base64编码过后的字符串 :" + encodedString); // 使用encoder进行解码 byte[] decodedBytes = Base64.getDecoder().decode(encodedString); System.out.println("解码过后的字符串: " + new String(decodedBytes, "utf-8"));
As a tool class, the Base64 tool class provided in the JDK is still very useful.
I won’t explain its use in detail here. This article mainly analyzes how Base64 is implemented in JDK.
The Base64 class in JDK provides three encoder methods, namely getEncoder, getUrlEncoder and getMimeEncoder:
public static Encoder getEncoder() { return Encoder.RFC4648; } public static Encoder getUrlEncoder() { return Encoder.RFC4648_URLSAFE; } public static Encoder getMimeEncoder() { return Encoder.RFC2045; }
Similarly, it Three corresponding decoder are also provided, namely getDecoder, getUrlDecoder, getMimeDecoder:
public static Decoder getDecoder() { return Decoder.RFC4648; } public static Decoder getUrlDecoder() { return Decoder.RFC4648_URLSAFE; } public static Decoder getMimeDecoder() { return Decoder.RFC2045; }
As can be seen from the code, these three encodings correspond to RFC4648, RFC4648_URLSAFE and RFC2045 respectively.
These three are variants of base64 encoding. Let’s take a look at their differences:
Encoding name | Encoded characters | Encoded characters | Encoded characters |
---|---|---|---|
The 62nd digit | The 63rd digit | Complete character | |
RFC 2045: Base64 transfer encoding for MIME | |
/ |
= mandatory |
RFC 4648: base64 (standard) | |
/ |
= optional |
-
|
_
|
= optional
|
private static final char[] toBase64 = { 'A', 'B', 'C', 'D', 'E', 'F', 'G', 'H', 'I', 'J', 'K', 'L', 'M', 'N', 'O', 'P', 'Q', 'R', 'S', 'T', 'U', 'V', 'W', 'X', 'Y', 'Z', 'a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j', 'k', 'l', 'm', 'n', 'o', 'p', 'q', 'r', 's', 't', 'u', 'v', 'w', 'x', 'y', 'z', '0', '1', '2', '3', '4', '5', '6', '7', '8', '9', '+', '/' };
private static final char[] toBase64URL = { 'A', 'B', 'C', 'D', 'E', 'F', 'G', 'H', 'I', 'J', 'K', 'L', 'M', 'N', 'O', 'P', 'Q', 'R', 'S', 'T', 'U', 'V', 'W', 'X', 'Y', 'Z', 'a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j', 'k', 'l', 'm', 'n', 'o', 'p', 'q', 'r', 's', 't', 'u', 'v', 'w', 'x', 'y', 'z', '0', '1', '2', '3', '4', '5', '6', '7', '8', '9', '-', '_' };For MIME, the maximum number of characters in a line and the newline character are defined:
private static final int MIMELINEMAX = 76; private static final byte[] CRLF = new byte[] {'\r', '\n'};Advanced usage of Base64Generally, the length of the object we encode with Base64 is fixed. We only need to convert the input object into a byte array to call the encode or decode method. But in some cases we need to convert stream data. At this time, we can use the two methods of wrapping Stream provided in Base64:
public OutputStream wrap(OutputStream os) { Objects.requireNonNull(os); return new EncOutputStream(os, isURL ? toBase64URL : toBase64, newline, linemax, doPadding); }
public InputStream wrap(InputStream is) { Objects.requireNonNull(is); return new DecInputStream(is, isURL ? fromBase64URL : fromBase64, isMIME); }These two methods are respectively Corresponds to encoder and decoder.
The above is the detailed content of How to implement base64 encoder in Java. For more information, please follow other related articles on the PHP Chinese website!