Rumah > Artikel > pembangunan bahagian belakang > PHP 截取 中文
函数巧妙地运用了正则表达式, 用起来很方便, 就像 substr 的用法一样, 可以正向截取也可反相截取, 思路值得学习.
PHP代码:
function c_substr ( $string, $from, $length = null ){
preg_match_all ( '/[x80-xff]?./', $string, $match );
if ( is_null ( $length )){
$result = implode ( '', array_slice ( $match [ 0 ], $from ));
} else {
$result = implode ( '', array_slice ( $match [ 0 ], $from, $length ));
}
return $result;
$str = "zhon华人min共和guo";
$from = 3;
$length = 7;
echo (c_substr ( $str, $from, $length ));
// 输出: n华人min共
//还有utf-8的
Regarding windix's function to handle UTF-8 strings:
one can use the "u" modifier on the regular expression so that the pattern string is treated as UTF-8
(available from PHP 4.1.0 or greater on Unix and from PHP 4.2.3 on win32).
This way the function works for other encodings too (like Greek for example).
The modified function would read like this:
function utf8_substr ( $str, $start ) {
$null = "";
preg_match_all ( "/./u", $str, $ar );
if ( func_num_args () >= 3 ) {
$end = func_get_arg ( 2 );
return join ( $null, array_slice ( $ar [ 0 ], $start, $end ));
} else {
return join ( $null, array_slice ( $ar [ 0 ], $start ));
}