Getting UTF-8 Encoding in Java Webapps
Problem: Implementing UTF-8 encoding to support non-Latin characters for text and specific alphabets.
Environment:
- Development: Windows XP
- Production: Debian
- Database: MySQL 5.x
- Browsers: Firefox2, Opera 9.x, FF3, IE7, Google Chrome
Solution:
-
Configure Tomcat's server.xml:
- Enable UTF-8 encoding for GET parameters:
- Enable UTF-8 encoding for GET parameters:
-
CharsetFilter:
- Force Java app to handle all requests and responses in UTF-8.
-
Add this filter to the web.xml:
<filter> <filter-name>CharsetFilter</filter-name> <filter-class>fi.foo.filters.CharsetFilter</filter-class> <init-param> <param-name>requestEncoding</param-name> <param-value>UTF-8</param-value> </init-param> </filter> <filter-mapping> <filter-name>CharsetFilter</filter-name> <url-pattern>/*</url-pattern> </filter-mapping>
-
JSP Page Encoding:
- Specify encoding for JSP pages in web.xml or add the following meta tag to each page:
- Specify encoding for JSP pages in web.xml or add the following meta tag to each page:
-
JDBC Connection:
- Use ?useEncoding=true&characterEncoding=UTF-8 in connection URL.
-
MySQL Database and Tables:
- Create database and tables with DEFAULT CHARACTER SET=utf8 COLLATE=utf8_swedish_ci.
-
MySQL Server Configuration:
- Set default-character-set=utf8 in my.ini (Windows) or my.cnf (Linux).
-
MySQL Procedures and Functions:
- Specify UTF-8 character set explicitly, e.g.:
CREATE FUNCTION ... RETURNS TEXT CHARACTER SET utf8
- Specify UTF-8 character set explicitly, e.g.:
Handling GET Requests:
- By default, URLs are encoded in Latin1, causing problems with non-ASCII characters.
- To address this, define URL encoding in server.xml as UTF-8.
- Instruct browsers to read pages in UTF-8 using meta-tags and request headers.
UTF-8 vs. Latin1 in GET Requests:
- POST requests are encoded in UTF-8 by browsers.
- For GET requests, while the page is defined as UTF-8, some characters may still be encoded in Latin1. This results in mixed encoding, making it difficult for the webapp to handle request parameters correctly.
References:
- http://tagunov.tripod.com/i18n/i18n.html
- http://wiki.apache.org/tomcat/Tomcat/UTF-8
- http://java.sun.com/developer/technicalArticles/Intl/HTTPCharset/
- http://dev.mysql.com/doc/refman/5.0/en/charset-syntax.html
- http://cagan327.blogspot.com/2006/05/utf-8-encoding-fix-tomcat-jsp-etc.html
- http://cagan327.blogspot.com/2006/05/utf-8-encoding-fix-for-mysql-tomcat.html
- http://jeppesn.dk/utf-8.html
- http://www.nabble.com/request-parameters-mishandle-utf-8-encoding-td18720039.html
- http://www.utoronto.ca/webdocs/HTMLdocs/NewHTML/iso_table.html
- http://www.utf8-chartable.de/
The above is the detailed content of How to Properly Implement UTF-8 Encoding in a Java Web Application?. For more information, please follow other related articles on the PHP Chinese website!

This article analyzes the top four JavaScript frameworks (React, Angular, Vue, Svelte) in 2025, comparing their performance, scalability, and future prospects. While all remain dominant due to strong communities and ecosystems, their relative popul

The article discusses implementing multi-level caching in Java using Caffeine and Guava Cache to enhance application performance. It covers setup, integration, and performance benefits, along with configuration and eviction policy management best pra

Node.js 20 significantly enhances performance via V8 engine improvements, notably faster garbage collection and I/O. New features include better WebAssembly support and refined debugging tools, boosting developer productivity and application speed.

Java's classloading involves loading, linking, and initializing classes using a hierarchical system with Bootstrap, Extension, and Application classloaders. The parent delegation model ensures core classes are loaded first, affecting custom class loa

Iceberg, an open table format for large analytical datasets, improves data lake performance and scalability. It addresses limitations of Parquet/ORC through internal metadata management, enabling efficient schema evolution, time travel, concurrent w

This article addresses the CVE-2022-1471 vulnerability in SnakeYAML, a critical flaw allowing remote code execution. It details how upgrading Spring Boot applications to SnakeYAML 1.33 or later mitigates this risk, emphasizing that dependency updat

This article explores integrating functional programming into Java using lambda expressions, Streams API, method references, and Optional. It highlights benefits like improved code readability and maintainability through conciseness and immutability

The article discusses using Maven and Gradle for Java project management, build automation, and dependency resolution, comparing their approaches and optimization strategies.


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

SublimeText3 Chinese version
Chinese version, very easy to use

SublimeText3 Mac version
God-level code editing software (SublimeText3)

MantisBT
Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

Dreamweaver CS6
Visual web development tools

DVWA
Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software
