PHP intercepts string length (Chinese and English mixed string)

Home

Backend Development

PHP Tutorial

PHP intercepts string length (Chinese and English mixed string)_PHP tutorial

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Jul 13, 2016 pm 04:56 PM

phpChinesefunctionstringinterceptsupportarticleat lastofBring your ownlength

The article introduces the string interception function from the interception function that comes with PHP to finally supporting Chinese, English and mixed Chinese and English string interception methods. Friends in need can refer to it.

Get part of the string.

Syntax: string substr(string string, int start, int [length]);

Return value: String

Function type: Data processing

Content Description

This function extracts length characters from the start position of the string string. If start is a negative number, it starts from the end of the string. If the omitted parameter length exists but is a negative number, it means that the length character from the bottom is obtained.

Usage Example

The code is as follows

Copy code

代码如下	复制代码
echo substr("abcdef", 1, 3); // 返回 "bcd" echo substr("abcdef", -2); // 返回 "ef" echo substr("abcdef", -3, 1); // 返回 "d" echo substr("abcdef", 1, -1); // 返回 "bcde" ?>

echo substr("abcdef", 1, 3); // Return "bcd"

echo substr("abcdef", -2); // Return "ef"

echo substr("abcdef", -3, 1); // return "d"
echo substr("abcdef", 1, -1); // return "bcde"

代码如下	复制代码
//截取中文字符串 function mysubstr($str, $start, $len) { $tmpstr = ""; $strlen = $start + $len; for($i = 0; $i if(ord(substr($str, $i, 1)) > 0xa0) { $tmpstr .= substr($str, $i, 2); $i++; } else $tmpstr .= substr($str, $i, 1); } return $tmpstr; } ?>

The above only supports English and not Chinese

代码如下	复制代码
//截取utf8字符串 function utf8Substr($str, $from, $len) { return preg_replace('#^(?:[x00-x7F]\|[xC0-xFF][x80-xBF]+){0,'.$from.'}'. '((?:[x00-x7F]\|[xC0-xFF][x80-xBF]+){0,'.$len.'}).*#s', '',$str); } ?>

Intercept GB2312 Chinese string

The code is as follows	Copy code
//Intercept Chinese string function mysubstr($str, $start, $len) { $tmpstr = ""; $strlen = $start + $len; for($i = 0; $i If(ord(substr($str, $i, 1)) > 0xa0) { $tmpstr .= substr($str, $i, 2); $i++; } else $tmpstr .= substr($str, $i, 1); } Return $tmpstr; } ?>

Intercept utf8 encoded multi-byte string

The code is as follows	Copy code
//Intercept utf8 string function utf8Substr($str, $from, $len) { Return preg_replace('#^(?:[x00-x7F]\|[xC0-xFF][x80-xBF]+){0,'.$from.'}'. ‘((?:[x00-x7F]\|[xC0-xFF][x80-xBF]+){0,'.$len.'}).*#s', ‘$1’,$str); } ?>

/*
* Function: The function is the same as substr, except that it will not cause garbled characters
* Parameter:
* Return:
*/

The code is as follows

Copy code

function utf8_substr( $str , $start , $length=null ){

                   // Intercept normally first.
           $res = substr( $str, $start, $length);
           $strlen = strlen( $str);

/* Then determine whether the first and last 6 bytes are complete (not incomplete) */

                            // If the parameter start is a positive number
             if ( $start >= 0 ){
                         // intercept about 6 bytes forward
                 $next_start = $start + $length; // Initial position
                 $next_len = $next_start + 6                   $next_segm = substr( $str , $next_start , $next_len );

// If the first byte is not the first byte of the complete character, then intercept about 6 bytes
$prev_start = $start - 6 > 0 ? $start - 6 : 0;
                 $prev_segm = substr( $str , $prev_start , $start - $prev_start );
}
// start is a negative number
        else{
                         // intercept about 6 bytes forward
                 $next_start = $strlen + $start + $length; // Initial position
                 $next_len = $next_start + 6                   $next_segm = substr( $str , $next_start , $next_len );
                                                                      // If the first byte is not the first byte of the complete character, then intercept about 6 bytes.
                $start = $strlen + $start;
$prev_start = $start - 6 > 0 ? $start - 6 : 0;
                 $prev_segm = substr( $str , $prev_start , $start - $prev_start );
}

// Determine whether the first 6 bytes comply with utf8 rules

If ( preg_match( '@^([x80-xBF]{0,5})[xC0-xFD]?@' , $next_segm , $bytes ) ){
If ( !empty( $bytes[1] ) ){
$bytes = $bytes[1];
$res .= $bytes;
}
}

// Determine whether the last 6 bytes comply with utf8 rules
         $ord0 = ord( $res[0] );
If ( 128 = $ord0 ){
// Take it back and add it in front of the res.
If ( preg_match( '@[xC0-xFD][x80-xBF]{0,5}$@' , $prev_segm , $bytes ) ){
If ( !empty( $bytes[0] ) ) {
$bytes = $bytes[0];
                           $res = $bytes . $res;
                }
            }
}

return $res;
}

Test data::

The code is as follows

代码如下	复制代码
$str = 'dfjdjf测13f试65&2数据ｆｄｊ（1就mfe&……就'; var_dump( utf8_substr( $str , 22 , 12 ) ); echo ' '; var_dump( utf8_substr( $str , 22 , -6 ) ); echo ' '; var_dump( utf8_substr( $str , 9 , 12 ) ); echo ' '; var_dump( utf8_substr( $str , 19 , 12 ) ); echo ' '; var_dump( utf8_substr( $str , 28 , -6 ) ); echo ' ';

Copy code

$str = 'dfjdjf test 13f test 65&2 dataｽｽ（1 is mfe&...just'; var_dump( utf8_substr( $str , 22 , 12 ) ); echo '
'; var_dump( utf8_substr( $str , 22 , -6 ) ); echo '
'; var_dump( utf8_substr( $str , 9 , 12 ) ); echo '
'; var_dump( utf8_substr( $str , 19 , 12 ) ); echo '
'; var_dump( utf8_substr( $str , 28 , -6 ) ); echo '
';

显示结果::(截取无乱码, 欢迎大家测试, 提交bug)
string(12) "据ｆｄｊ"
string(26) "据ｆｄｊ（1就mfe&…"
string(13) "13f试65&2数"
string(12) "数据ｆｄ"
string(20) "ｄｊ（1就mfe&…"

把我常用的分享出来

下面我们再来看中文截函数吧。

代码如下

复制代码

function MooCutstr($string, $length, $dot = ' ...') {
global $charset;

if(strlen($string) return $string;
}
$string = str_replace(array('&', '"', '<', '>'), array('&', '"', ''), $string);
$strcut = '';
if(strtolower($charset) == 'utf-8') {
$n = $tn = $noc = 0;
while($n    $t = ord($string[$n]);
   if($t == 9 || $t == 10 || (32     $tn = 1; $n++; $noc++;
   } elseif (194     $tn = 2; $n += 2; $noc += 2;
   } elseif (224     $tn = 3; $n += 3; $noc += 2;
   } elseif (240     $tn = 4; $n += 4; $noc += 2;
   } elseif (248     $tn = 5; $n += 5; $noc += 2;
   } elseif ($t == 252 || $t == 253) {
    $tn = 6; $n += 6; $noc += 2;
   } else {
    $n++;
   }
   if($noc >= $length) {
    break;
   }
}
if($noc > $length) {
   $n -= $tn;
}
$strcut = substr($string, 0, $n);
} else {
for($i = 0; $i    $strcut .= ord($string[$i]) > 127 ? $string[$i].$string[++$i] : $string[$i];
}
}
//$strcut = str_replace(array('&', '"', ''), array('&', '"', '<', '>'), $strcut);

return $strcut.$dot;
}

Statement

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

What is the difference between unset() and session_destroy()?May 04, 2025 am 12:19 AM

Thedifferencebetweenunset()andsession_destroy()isthatunset()clearsspecificsessionvariableswhilekeepingthesessionactive,whereassession_destroy()terminatestheentiresession.1)Useunset()toremovespecificsessionvariableswithoutaffectingthesession'soveralls

What is sticky sessions (session affinity) in the context of load balancing?May 04, 2025 am 12:16 AM

Stickysessionsensureuserrequestsareroutedtothesameserverforsessiondataconsistency.1)SessionIdentificationassignsuserstoserversusingcookiesorURLmodifications.2)ConsistentRoutingdirectssubsequentrequeststothesameserver.3)LoadBalancingdistributesnewuser

What are the different session save handlers available in PHP?May 04, 2025 am 12:14 AM

PHPoffersvarioussessionsavehandlers:1)Files:Default,simplebutmaybottleneckonhigh-trafficsites.2)Memcached:High-performance,idealforspeed-criticalapplications.3)Redis:SimilartoMemcached,withaddedpersistence.4)Databases:Offerscontrol,usefulforintegrati

What is a session in PHP, and why are they used?May 04, 2025 am 12:12 AM

Session in PHP is a mechanism for saving user data on the server side to maintain state between multiple requests. Specifically, 1) the session is started by the session_start() function, and data is stored and read through the $_SESSION super global array; 2) the session data is stored in the server's temporary files by default, but can be optimized through database or memory storage; 3) the session can be used to realize user login status tracking and shopping cart management functions; 4) Pay attention to the secure transmission and performance optimization of the session to ensure the security and efficiency of the application.

Explain the lifecycle of a PHP session.May 04, 2025 am 12:04 AM

PHPsessionsstartwithsession_start(),whichgeneratesauniqueIDandcreatesaserverfile;theypersistacrossrequestsandcanbemanuallyendedwithsession_destroy().1)Sessionsbeginwhensession_start()iscalled,creatingauniqueIDandserverfile.2)Theycontinueasdataisloade

What is the difference between absolute and idle session timeouts?May 03, 2025 am 12:21 AM

Absolute session timeout starts at the time of session creation, while an idle session timeout starts at the time of user's no operation. Absolute session timeout is suitable for scenarios where strict control of the session life cycle is required, such as financial applications; idle session timeout is suitable for applications that want users to keep their session active for a long time, such as social media.

What steps would you take if sessions aren't working on your server?May 03, 2025 am 12:19 AM

The server session failure can be solved through the following steps: 1. Check the server configuration to ensure that the session is set correctly. 2. Verify client cookies, confirm that the browser supports it and send it correctly. 3. Check session storage services, such as Redis, to ensure that they are running normally. 4. Review the application code to ensure the correct session logic. Through these steps, conversation problems can be effectively diagnosed and repaired and user experience can be improved.

What is the significance of the session_start() function?May 03, 2025 am 12:18 AM

session_start()iscrucialinPHPformanagingusersessions.1)Itinitiatesanewsessionifnoneexists,2)resumesanexistingsession,and3)setsasessioncookieforcontinuityacrossrequests,enablingapplicationslikeuserauthenticationandpersonalizedcontent.

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

How to fix KB5055523 fails to install in Windows 11?

3 weeks agoByDDD

How to fix KB5055518 fails to install in Windows 10?

3 weeks agoByDDD

Roblox: Dead Rails - How To Tame Wolves

4 weeks agoByDDD

Strength Levels for Every Enemy & Monster in R.E.P.O.

4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Roblox: Grow A Garden - Complete Mutation Guide

2 weeks agoByDDD

Hot Tools

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),