Use character stream to read files (non-text)
Take Java's character stream to read files as an example: it can only read For characters between 0-65535, it can be seen that the characters are all positive numbers, but the binary byte can be negative numbers. But when reading, it will be read as a positive number, or a character that cannot be found in the encoding table will return a strange symbol (you may have seen that strange "?").
But in some cases, characters must be used to display binary data, and there is no way. Let’s introduce one of our methods -base64 encoding .
Base64 encoding
Introduction to base64 encoding
base64 is one of the common encoding methods used to transmit 8Bit bytecode on the Internet. Base64 is a coding method based on A method of representing binary data with 64 printable characters. Base64 encoding is a process from binary to characters and can be used to transmit longer identification information in an HTTP environment. Base64 encoding is unreadable and needs to be decoded before it can be read. Its Chinese name is based on 64 printable characters to represent binary data.
Encoding rules
1. Convert 3 bytes into 4 bytes.
2. No 76 characters plus a newline character.
3. The final terminator must also be processed.
Disadvantages of the encoding method
As can be seen from the encoding rules, base64 requires that every three 8Bit bytes be converted into four 6Bit characters (38 = 46 = 24) , and then add two high-bit 0s to the 6Bit to form four 8Bit bytes. In other words, the converted string will theoretically be 1/3 (33%) longer than the original one.
Here is an introduction to a concept and more detailed content. If you are interested, you can collect it to learn more.
Application of Base64 in Java
Java's Base64 tool class provides a set of static methods to obtain the following three BASE64 codecs:
Basic: The output is mapped to a set of characters A-Za-z0-9/, encoding does not add any line markers, and decoding of the output only supports A-Za-z0-9/.
URL: The output maps to a set of characters A-Za-z0-9 _, the output is a URL and a file.
MIME: Output is implicitly mapped to a MIME friendly format. The output should be no longer than 76 characters per line and separated by '\r' followed by '\n'. The encoded output ends up with no line splitting.
corresponds to the following methods:
Encoder basicEncoder = Base64.getEncoder(); Encoder mimeEncoder = Base64.getMimeEncoder(); Encoder urlEncoder = Base64.getUrlEncoder();
I wrote a simple tool class to testBasic(basic) encoder . .
package com.dragon; import java.io.BufferedInputStream; import java.io.ByteArrayOutputStream; import java.io.File; import java.io.FileInputStream; import java.io.FileNotFoundException; import java.io.IOException; import java.io.InputStream; import java.io.UnsupportedEncodingException; import java.util.Base64; import java.util.Base64.Decoder; import java.util.Base64.Encoder; /** * @author Alfred * */ public class Base64Util { private static Encoder encoder = Base64.getEncoder(); private static Decoder decoder = Base64.getDecoder(); private static String ENCODE = "UTF-8"; private static int LENGTH = 1024; /** * 静态方法: * 将文件等二进制数据(文本和非文本都可以) * 转为base64字符串。 * @throws IOException * @throws FileNotFoundException * * */ public static String dataToBase64(File src) throws FileNotFoundException, IOException { Encoder encoder = Base64.getEncoder(); int len = (int)src.length(); byte[] bar = new byte[(int)len]; int hasRead = 0; byte[] b = new byte[LENGTH]; //使用专门处理 byte 的IO流比较方便,一次性读取较大文件对内存压力较大 try (InputStream in = new BufferedInputStream(new FileInputStream(src)); ByteArrayOutputStream bos = new ByteArrayOutputStream(len)) { while ((hasRead = in.read(b)) != -1) { bos.write(b, 0, hasRead); } bar = bos.toByteArray(); } return encoder.encodeToString(bar); } public static String dataToBase64(String src) throws UnsupportedEncodingException { return encoder.encodeToString(src.getBytes(ENCODE)); } public static byte[] base64ToData(String src) { return decoder.decode(src); } }
Convert the image to Base64 string for reading and writing
import java.io.BufferedReader; import java.io.BufferedWriter; import java.io.File; import java.io.FileNotFoundException; import java.io.FileReader; import java.io.FileWriter; import java.io.IOException; import java.io.UnsupportedEncodingException; import java.io.Writer; import java.nio.file.Path; import java.nio.file.Paths; public class Base64Test { public static void main(String[] args) throws FileNotFoundException, IOException { testPic(); } static void testPic() throws FileNotFoundException, IOException { // 测试图片文件。 Path picPath = Paths.get("./src/com/dragon/001.jpg"); File picFile = picPath.toFile(); String picToBase64 = Base64Util.dataToBase64(picFile); System.out.println(picToBase64); long oldSize = picFile.length(); long newSize = picToBase64.getBytes("UTF-8").length; System.out.println("图片原始大小(字节):" + oldSize); System.out.println("转换后数据大小(字节):" + newSize); System.out.println("转换后比原来扩大的比例为:" + (double)(newSize-oldSize)/(double)oldSize + " %"); //将数据写入文件 try (Writer writer = new BufferedWriter(new FileWriter("./src/com/dragon/002.txt"))) { writer.write(picToBase64); } //从文件中读取数据 String line = null; try (BufferedReader reader = new BufferedReader(new FileReader("./src/com/dragon/002.txt"))){ line = reader.readLine(); } System.out.println(picToBase64.equals(line)); } }
Run screenshot
So, the reading of the image data is completed. Maybe you said here that you are not reading the binary data of the image, but in fact, all files are stored in binary! Moreover, this base64 string can also be used directly as an image.
Note: I have selected a very small picture here. You can see that the original size is only 3639 bytes, which is less than 4 KB, but if it is converted into text, it will be a lot (So, it will appear very long, very long. ).
Test picture
image.html
<!DOCTYPE> <html> <head> <meta charset="UTF-8"/> <title>base测试</title> </head> <body> <img src="/static/imghwm/default1.png" data-src="" class="lazy" / alt="How to use character stream to read and write non-text files in Java" > </body> <html>
Open the browser to test it
public class Base64Test {
public static void main(String[] args) throws FileNotFoundException, IOException {
testStr("I love you yesterday and today!");
}
static void testStr(String src) throws UnsupportedEncodingException {
//测试文本数据。
String strToBase64 = Base64Util.dataToBase64(src);
System.out.println("base64编码:" + strToBase64);
String base64ToStr = new String(Base64Util.base64ToData(strToBase64));
System.out.println("base64解码:" + base64ToStr);
}
}
Test screenshot
注2:可以观察一下这个base64字符串的特点,我上次学习Java爬虫的时候,爬了一个网站,发现这个网站的一个 script 脚本中,含有一个json对象,其中有一个属性是 url,但是对应的链接却看不懂(base64字符串是不可读的),但是我感觉它就是base64字符串,所以我利用base64编解码网站解码一看,真的是一个网站的地址。然后,就可以写一个解码方法,当爬到这个数据时,给它解码了,哈哈。
举一个简单的例子:
{"url":"aHR0cHMlM0ElMkYlMkZ3d3cuYmFpZHUuY29tJTJG"}
import java.io.UnsupportedEncodingException; import java.net.URLDecoder; import java.net.URLEncoder; import java.util.Base64; import java.util.Base64.Decoder; import java.util.Base64.Encoder; public class TestALittle { public static void main(String[] args) throws UnsupportedEncodingException { String base64Str = "aHR0cHMlM0ElMkYlMkZ3d3cuYmFpZHUuY29tJTJG"; String de_str = base64ToUrlEncoderToURL(base64Str); System.out.println("解码:" + de_str); } //base64解密为urlencoder,再解码为url public static String base64ToUrlEncoderToURL(String base64Str) throws UnsupportedEncodingException { Decoder decoder = Base64.getDecoder(); byte[] bt = decoder.decode(base64Str); String en_str = new String(bt, 0, bt.length); return URLDecoder.decode(en_str, "UTF-8"); } }
说明: 这个例子中的 url 进行了两次编码,第一次是将url中的非西欧字符编码(可以去了解一下为什么这么做?),然后再使用base64编码。但是,如果你掌握了解码技术,解码也是很简单的。(但是如果你看不出来它是base64编码,那估计就没有办法了!)
The above is the detailed content of How to use character stream to read and write non-text files in Java. For more information, please follow other related articles on the PHP Chinese website!

本篇文章给大家带来了关于java的相关知识,其中主要介绍了关于结构化数据处理开源库SPL的相关问题,下面就一起来看一下java下理想的结构化数据处理类库,希望对大家有帮助。

本篇文章给大家带来了关于java的相关知识,其中主要介绍了关于PriorityQueue优先级队列的相关知识,Java集合框架中提供了PriorityQueue和PriorityBlockingQueue两种类型的优先级队列,PriorityQueue是线程不安全的,PriorityBlockingQueue是线程安全的,下面一起来看一下,希望对大家有帮助。

本篇文章给大家带来了关于java的相关知识,其中主要介绍了关于java锁的相关问题,包括了独占锁、悲观锁、乐观锁、共享锁等等内容,下面一起来看一下,希望对大家有帮助。

本篇文章给大家带来了关于java的相关知识,其中主要介绍了关于多线程的相关问题,包括了线程安装、线程加锁与线程不安全的原因、线程安全的标准类等等内容,希望对大家有帮助。

本篇文章给大家带来了关于Java的相关知识,其中主要介绍了关于关键字中this和super的相关问题,以及他们的一些区别,下面一起来看一下,希望对大家有帮助。

本篇文章给大家带来了关于java的相关知识,其中主要介绍了关于枚举的相关问题,包括了枚举的基本操作、集合类对枚举的支持等等内容,下面一起来看一下,希望对大家有帮助。

封装是一种信息隐藏技术,是指一种将抽象性函式接口的实现细节部分包装、隐藏起来的方法;封装可以被认为是一个保护屏障,防止指定类的代码和数据被外部类定义的代码随机访问。封装可以通过关键字private,protected和public实现。

本篇文章给大家带来了关于java的相关知识,其中主要介绍了关于平衡二叉树(AVL树)的相关知识,AVL树本质上是带了平衡功能的二叉查找树,下面一起来看一下,希望对大家有帮助。


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

EditPlus Chinese cracked version
Small size, syntax highlighting, does not support code prompt function

SublimeText3 Linux new version
SublimeText3 Linux latest version

ZendStudio 13.5.1 Mac
Powerful PHP integrated development environment

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 English version
Recommended: Win version, supports code prompts!
