Foregoing
VARCHAR and CHAR are the two main string types. Unfortunately, it is difficult to explain exactly how these values are stored on disk and in memory, as it depends on the specific implementation of the storage engine. The following description assumes that the storage engine used is InnoDB and/or MyISAM. If you are not using these two storage engines, please refer to the documentation of the storage engine you are using.
Let’s first look at how VARCHAR and CHAR values are usually stored on disk. Please note that the way the storage engine stores CHAR or VARCHAR values may be different in memory than on disk, so the value read by the MySQL server from the storage engine may need to be converted to another storage format.
VARCHAR type
The VARCHAR type is used to store variable-length strings and is the most common string data type. It is more space-efficient than fixed-length types because it uses only the necessary space (for example, shorter strings use less space). There is an exception. If the MySQL table is created with ROW_FORMAT=FIXED, each row will use fixed-length storage, which will waste space.
VARCHAR requires 1 or 2 extra bytes to record the length of the string: if the maximum length of the column is less than or equal to 255 bytes, only 1 byte is used to represent it, otherwise 2 bytes are used. Assuming the latin1 character set is used, a VARCHAR(10) column requires 11 bytes of storage space. The VARCHAR(1000) column requires 1002 bytes because 2 bytes are needed to store the length information.
VARCHAR saves storage space, so it also helps performance. However, because the row is variable-length, the row may become longer than the original during UPDATE, which requires additional work. If the space occupied by a row grows and there is no more space to store in the page, different storage engines handle this situation differently. For example, MyISAM will split the rows into different fragments for storage, and InnoDB needs to split the page so that the rows can fit into the page. Some other storage engines may never update data in the original data location.
VARCHAR applicable situations
It is appropriate to apply VARCHAR in the following situations:
The maximum length of the string column is much larger than the average length
Columns are rarely updated, so fragmentation is not an issue
A complex character set like UTF-8 is used, each character is stored using a different number of bytes
CHAR type
The CHAR type is fixed-length: MySQL always allocates enough space according to the defined string length. When storing CHAR values, MySQL removes all trailing spaces. The CHAR value will be padded with spaces as needed to facilitate comparison.
CHAR is suitable for storing very short strings, or all values are close to the same length. For example, CHAR is ideal for storing the MD5 value of a password because it is a fixed-length value. For data that changes frequently, CHAR is also better than VARCHAR because the fixed-length CHAR type is not prone to fragmentation. CHAR is also more storage space efficient than VARCHAR for very short columns. For example, CHAR(1) is used to store only Y and N values. If a single-byte character set is used, only one byte is needed, but VARCHAR(1) requires two bytes because there is an extra byte for the record length. .
Test
The following is an example to illustrate the difference in behavior between CHAR and VARCHAR. First, we create a table with only one CHAR(10) field, and Insert some values into it:
CREATE TABLE char_test ( char_col CHAR(10) ); INSERT INTO char_test VALUES ('string1'). (' string2'). ('string3 ');
When we retrieve these values, we will find that the spaces at the end of string3 are truncated.
SELECT CONCAT("'", char_col, "'") FROM char_test;
Execution results:
If you use the VARCHAR(10) field to store the same value, you can get the following results:
CREATE TABLE varchar_test ( varchar_col VARCHAR(10) ); INSERT INTO varchar_test VALUES ('string1'). (' string2'). ('string3 '); SELECT CONCAT("'", varchar_col, "'") FROM varchar_test;
Execution Result
The difference between VARCHAR(5) and VARCHAR(200)
If we use VARCHAR(5) and VARCHAR(200) ) to store 'hello', we know that the space overhead of the two is the same. So can we keep the length of VARCHAR always large? Are there any advantages to using shorter columns?
Facts have proved to have great advantages. Longer columns consume more memory because MySQL typically allocates fixed-size blocks of memory to hold internal values. This is especially bad when using in-memory temporary tables for sorting or operations. It's equally bad when sorting using disk temporary tables.
So the best strategy is to allocate only the space you really need.
Summary
When we select a type for a string type field, to determine whether to choose VARCHAR or CHAR, we can consider the following aspects:
Is there a very small difference between the average length and the maximum length of the field data set? If the difference is small, the CHAR type will be given priority. Otherwise, the VARCHAR type will be considered.
If the field stores a hash value after MD5, or some fixed-length value, the CHAR type is preferred.
If the field needs to be updated frequently, the CHAR type is given priority. Since the CHAR type is of fixed length, it is not prone to fragmentation.
For field values that store very small information, such as gender, the CHAR type is preferred, because the VARCHAR type will occupy additional bytes to store string length information.
In a word, when we can choose the CHAR type, or when space consumption is relatively not the focus of the influencing factors, try to choose the CHAR type, because in other aspects, the CHAR type has more or less The advantages. When space consumption becomes a big influencing factor, we will consider using the VARCHAR type.
Recommended tutorial: "Mysql Tutorial"
The above is the detailed content of The difference between CHAR and VARCHAR in MySQL. For more information, please follow other related articles on the PHP Chinese website!

ACID attributes include atomicity, consistency, isolation and durability, and are the cornerstone of database design. 1. Atomicity ensures that the transaction is either completely successful or completely failed. 2. Consistency ensures that the database remains consistent before and after a transaction. 3. Isolation ensures that transactions do not interfere with each other. 4. Persistence ensures that data is permanently saved after transaction submission.

MySQL is not only a database management system (DBMS) but also closely related to programming languages. 1) As a DBMS, MySQL is used to store, organize and retrieve data, and optimizing indexes can improve query performance. 2) Combining SQL with programming languages, embedded in Python, using ORM tools such as SQLAlchemy can simplify operations. 3) Performance optimization includes indexing, querying, caching, library and table division and transaction management.

MySQL uses SQL commands to manage data. 1. Basic commands include SELECT, INSERT, UPDATE and DELETE. 2. Advanced usage involves JOIN, subquery and aggregate functions. 3. Common errors include syntax, logic and performance issues. 4. Optimization tips include using indexes, avoiding SELECT* and using LIMIT.

MySQL is an efficient relational database management system suitable for storing and managing data. Its advantages include high-performance queries, flexible transaction processing and rich data types. In practical applications, MySQL is often used in e-commerce platforms, social networks and content management systems, but attention should be paid to performance optimization, data security and scalability.

The relationship between SQL and MySQL is the relationship between standard languages and specific implementations. 1.SQL is a standard language used to manage and operate relational databases, allowing data addition, deletion, modification and query. 2.MySQL is a specific database management system that uses SQL as its operating language and provides efficient data storage and management.

InnoDB uses redologs and undologs to ensure data consistency and reliability. 1.redologs record data page modification to ensure crash recovery and transaction persistence. 2.undologs records the original data value and supports transaction rollback and MVCC.

Key metrics for EXPLAIN commands include type, key, rows, and Extra. 1) The type reflects the access type of the query. The higher the value, the higher the efficiency, such as const is better than ALL. 2) The key displays the index used, and NULL indicates no index. 3) rows estimates the number of scanned rows, affecting query performance. 4) Extra provides additional information, such as Usingfilesort prompts that it needs to be optimized.

Usingtemporary indicates that the need to create temporary tables in MySQL queries, which are commonly found in ORDERBY using DISTINCT, GROUPBY, or non-indexed columns. You can avoid the occurrence of indexes and rewrite queries and improve query performance. Specifically, when Usingtemporary appears in EXPLAIN output, it means that MySQL needs to create temporary tables to handle queries. This usually occurs when: 1) deduplication or grouping when using DISTINCT or GROUPBY; 2) sort when ORDERBY contains non-index columns; 3) use complex subquery or join operations. Optimization methods include: 1) ORDERBY and GROUPB


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

SAP NetWeaver Server Adapter for Eclipse
Integrate Eclipse with SAP NetWeaver application server.

Dreamweaver CS6
Visual web development tools

Zend Studio 13.0.1
Powerful PHP integrated development environment

EditPlus Chinese cracked version
Small size, syntax highlighting, does not support code prompt function

MinGW - Minimalist GNU for Windows
This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.