Home  >  Article  >  Backend Development  >  PHP generates csv with garbled characters under mac

PHP generates csv with garbled characters under mac

不言
不言Original
2018-04-24 09:20:001965browse

This article mainly introduces php to generate csv garbled code under mac. It has a certain reference value. Now I share it with everyone. Friends in need can refer to it


<?php
     $file_name = date(&#39;Ymd&#39;, time()) . &#39;.csv&#39;;  //设置文件名
     header(&#39;Content-Type: application/vnd.ms-excel&#39;);
header(&#39;Content-Disposition: attachment;filename=&#39;.$file_name);
header(&#39;Cache-Control: max-age=0&#39;);
$fp = fopen(&#39;php://output&#39;, &#39;a&#39;);
$row = [
&#39;name&#39; => &#39;测试&#39;,
&#39;email&#39; => 111111,
&#39;mobile&#39; => 22222,
&#39;weixinid&#39; => &#39;微信号&#39;,
];
fwrite($fp,"\xEF\xBB\xBF");
fputcsv($fp, $row);


Solution to the garbled problem of php exporting csv files



Before talking about this question, let’s first talk about what is a CSV file? Comma Separator Value (comma separated value) is also. Intermediate files that are often used for data conversion exist, such as exporting data from Mysql to CSV and importing CSV into SqlServer. Use a PHP script under Linux to export the table data from the Mysql database into csv according to conditions. Use utf-8 encoding to export the CSV file. After opening, the Chinese inside becomes garbled (CSV files under Windows are associated with Microsoft Excel by default). Use Notepad or Word opens normally, but the layout is messy. Reason: BOM is in trouble, Microsoft is in trouble.

What is BOM? Byte Order Mark (bit order mark) is also.

In order to identify Unicode files, Microsoft recommends that all Unicode files should start with ZERO WIDTH NOBREAK SPACE characters. This serves as a "signature" or "byte-order mark (BOM)" to identify the encoding and byte order (big-endian or little-endian) used in the file. The specific correspondence is shown in the table below .

Bytes Encoding Form

##00 00 FE FFUTF-32, big-endianFF FE 00 00UTF-32, little-endianFE FFUTF-16, big-endianFF FEUTF-16, little-endianEF BB BFUTF-8

BOM is not used in Unix-like systems because it will break the existing ASCII file syntax convention.


Implementation code if

Note: When writing the csv file, ensure that the php source code is utf-8, has no BOM, and does not output anything.




When Excel reads csv, it reads the file header The BOM is used to identify the encoding. If there is no BOM information in the file header, it will be read according to the Unicode encoding by default. (This BOM is a file header protocol defined by Microsoft itself. As the name implies, it is stored in the file header. The stored content is the information that identifies the file encoding.)

The platform we use to generate CSV does not necessarily follow Microsoft's BOM protocol, causing if a non-unicode encoded csv file (such as utf-8) is output and no BOM information is generated, Excel will automatically read according to the unicode encoding, and garbled characters will occur.

After mastering this, I believe that garbled characters can no longer stop us from moving forward: just open the non-unicode encoded csv file with a text editor (notepad is recommended) and convert it to the encoding form with BOM (specifically The encoding method is arbitrary), the problem is solved.

Related recommendations:

PHP generates a unique RequestID class

Detailed example of how to generate a .csv suffix file table in PHP


The above is the detailed content of PHP generates csv with garbled characters under mac. For more information, please follow other related articles on the PHP Chinese website!

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn