The bytes occupied by char in java: 1. The char in the internal code in java is encoded in UTF16, and one char occupies two bytes; 2. The char in the foreign code in java is encoded in UTF8. One character occupies [1~6] bytes.
#Before discussing this issue, we need to distinguish between unicode and UTF.
unicode: A unified character number, which only provides mapping between characters and numbers. The number of symbols is constantly increasing and has exceeded one million. Details: [https://zh.wikipedia.org/zh-cn/Unicode]
UTF: unicode transformation format. Defines the encoding method of numbers in unicode. UTF8 and UTF16 are two of the implementation methods. Among them, utf8 is a variable-length representation, and the length may be 1 to 6 bytes; utf16 is a variable-length representation, and the length may be 2 or 4 bytes. Details: UTF8 [https://zh.wikipedia.org/zh-cn/UTF-8] UTF16 [https://zh.wikipedia.org/zh-cn/UTF-16]
Next, we need to distinguish between internal encoding and external encoding.
Inner code: The encoding method of char and string in memory when a certain language is running.
Outer code: Except for the inner code, all are outer codes.
It should be noted that the encoding method in the object code file (executable file or class file) generated by source code compilation belongs to foreign code.
Let’s take a look at the internal code first
The internal code in JVM uses UTF16. In the early days, UTF16 was encoded using a fixed-length 2-byte encoding. Two bytes can represent 65536 symbols (in fact, it can actually represent less than this), which was enough to represent all characters in Unicode at that time. However, with the increase of characters in Unicode, 2 bytes cannot represent all characters. UTF16 uses 2 bytes or 4 bytes to complete the encoding. To deal with this situation, Java uses a pair of char to represent characters that require 4 bytes, taking into account forward compatibility requirements. Therefore, char in Java takes up two bytes, but some characters require two chars to represent them.
Foreign code
Java's class file uses UTF8 to store characters, that is to say, the characters in the class occupy 1 to 6 bytes.
During Java serialization, characters are also encoded in UTF8, accounting for 1 to 6 characters.
Summary:
The char in the internal code (running memory) of Java is encoded using UTF16. One char occupies two bytes, but some characters require Represented by two chars. So, one character will occupy 2 or 4 bytes.
char in java Chinese and foreign code is encoded using UTF8, and one character occupies 1 to 6 bytes.
In UTF16 encoding, English characters occupy two bytes; most Chinese characters (especially commonly used Chinese characters) occupy two bytes, and individual Chinese characters (unicode-encoded Chinese characters will be added later) , usually rare words that are rarely used) occupy four bytes.
In UTF8 encoding, English characters occupy one byte; most Chinese characters occupy three bytes, and some Chinese characters occupy four bytes.
EOF
Related free learning recommendations: java basic tutorial
The above is the detailed content of How many bytes does char occupy in java?. For more information, please follow other related articles on the PHP Chinese website!

本篇文章给大家带来了关于java的相关知识,其中主要介绍了关于结构化数据处理开源库SPL的相关问题,下面就一起来看一下java下理想的结构化数据处理类库,希望对大家有帮助。

本篇文章给大家带来了关于java的相关知识,其中主要介绍了关于PriorityQueue优先级队列的相关知识,Java集合框架中提供了PriorityQueue和PriorityBlockingQueue两种类型的优先级队列,PriorityQueue是线程不安全的,PriorityBlockingQueue是线程安全的,下面一起来看一下,希望对大家有帮助。

本篇文章给大家带来了关于java的相关知识,其中主要介绍了关于java锁的相关问题,包括了独占锁、悲观锁、乐观锁、共享锁等等内容,下面一起来看一下,希望对大家有帮助。

本篇文章给大家带来了关于java的相关知识,其中主要介绍了关于多线程的相关问题,包括了线程安装、线程加锁与线程不安全的原因、线程安全的标准类等等内容,希望对大家有帮助。

本篇文章给大家带来了关于java的相关知识,其中主要介绍了关于枚举的相关问题,包括了枚举的基本操作、集合类对枚举的支持等等内容,下面一起来看一下,希望对大家有帮助。

本篇文章给大家带来了关于Java的相关知识,其中主要介绍了关于关键字中this和super的相关问题,以及他们的一些区别,下面一起来看一下,希望对大家有帮助。

封装是一种信息隐藏技术,是指一种将抽象性函式接口的实现细节部分包装、隐藏起来的方法;封装可以被认为是一个保护屏障,防止指定类的代码和数据被外部类定义的代码随机访问。封装可以通过关键字private,protected和public实现。

本篇文章给大家带来了关于java的相关知识,其中主要介绍了关于平衡二叉树(AVL树)的相关知识,AVL树本质上是带了平衡功能的二叉查找树,下面一起来看一下,希望对大家有帮助。


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Dreamweaver Mac version
Visual web development tools

SublimeText3 Chinese version
Chinese version, very easy to use

SAP NetWeaver Server Adapter for Eclipse
Integrate Eclipse with SAP NetWeaver application server.

Safe Exam Browser
Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.

VSCode Windows 64-bit Download
A free and powerful IDE editor launched by Microsoft
