search
HomeBackend DevelopmentPHP TutorialPHP intercepts string length (Chinese and English mixed string)_PHP tutorial

PHP intercepts string length (Chinese and English mixed string)_PHP tutorial

Jul 13, 2016 pm 04:56 PM
phpChinesefunctionstringinterceptsupportarticleat lastofBring your ownlength

The article introduces the string interception function from the interception function that comes with PHP to finally supporting Chinese, English and mixed Chinese and English string interception methods. Friends in need can refer to it.

Get part of the string.

Syntax: string substr(string string, int start, int [length]);

Return value: String

Function type: Data processing

Content Description

This function extracts length characters from the start position of the string string. If start is a negative number, it starts from the end of the string. If the omitted parameter length exists but is a negative number, it means that the length character from the bottom is obtained.

Usage Example

The code is as follows Copy code
 代码如下 复制代码


echo substr("abcdef", 1, 3);  // 返回 "bcd"
echo substr("abcdef", -2);    // 返回 "ef"
echo substr("abcdef", -3, 1); // 返回 "d"
echo substr("abcdef", 1, -1); // 返回 "bcde"
?>

echo substr("abcdef", 1, 3); // Return "bcd"

echo substr("abcdef", -2); // Return "ef"

echo substr("abcdef", -3, 1); // return "d"
echo substr("abcdef", 1, -1); // return "bcde"

?>
 代码如下 复制代码

 //截取中文字符串
 function mysubstr($str, $start, $len) {
     $tmpstr = "";
     $strlen = $start + $len;
     for($i = 0; $i          if(ord(substr($str, $i, 1)) > 0xa0) {
             $tmpstr .= substr($str, $i, 2);
             $i++;
         } else
             $tmpstr .= substr($str, $i, 1);
     }
     return $tmpstr;
 }
 ?>

The above only supports English and not Chinese

 代码如下 复制代码

 //截取utf8字符串
 function utf8Substr($str, $from, $len)
 {
     return preg_replace('#^(?:[x00-x7F]|[xC0-xFF][x80-xBF]+){0,'.$from.'}'.
                        '((?:[x00-x7F]|[xC0-xFF][x80-xBF]+){0,'.$len.'}).*#s',
                        '',$str);
 }
 ?>

Intercept GB2312 Chinese string
The code is as follows Copy code
//Intercept Chinese string function mysubstr($str, $start, $len) { $tmpstr = ""; $strlen = $start + $len; for($i = 0; $i If(ord(substr($str, $i, 1)) > 0xa0) {                    $tmpstr .= substr($str, $i, 2);                 $i++;            } else                    $tmpstr .= substr($str, $i, 1); }   Return $tmpstr; } ?>
Intercept utf8 encoded multi-byte string
The code is as follows Copy code
//Intercept utf8 string function utf8Substr($str, $from, $len) { Return preg_replace('#^(?:[x00-x7F]|[xC0-xFF][x80-xBF]+){0,'.$from.'}'. ‘((?:[x00-x7F]|[xC0-xFF][x80-xBF]+){0,'.$len.'}).*#s', ‘$1’,$str); } ?>

/*
* Function: The function is the same as substr, except that it will not cause garbled characters
* Parameter:
* Return:
*/

The code is as follows Copy code

function utf8_substr( $str , $start , $length=null ){
         
                   // Intercept normally first.
           $res = substr( $str, $start, $length);
           $strlen = strlen( $str);
         
/* Then determine whether the first and last 6 bytes are complete (not incomplete) */

                            // If the parameter start is a positive number
             if ( $start >= 0 ){
                         // intercept about 6 bytes forward
                 $next_start = $start + $length; // Initial position
                 $next_len = $next_start + 6                   $next_segm = substr( $str , $next_start , $next_len );

// If the first byte is not the first byte of the complete character, then intercept about 6 bytes
$prev_start = $start - 6 > 0 ? $start - 6 : 0;
                 $prev_segm = substr( $str , $prev_start , $start - $prev_start );
}
​​​​ // start is a negative number
        else{
                         // intercept about 6 bytes forward
                 $next_start = $strlen + $start + $length; // Initial position
                 $next_len = $next_start + 6                   $next_segm = substr( $str , $next_start , $next_len );
                                                                      // If the first byte is not the first byte of the complete character, then intercept about 6 bytes.
                $start = $strlen + $start;
$prev_start = $start - 6 > 0 ? $start - 6 : 0;
                 $prev_segm = substr( $str , $prev_start , $start - $prev_start );
}

// Determine whether the first 6 bytes comply with utf8 rules

If ( preg_match( '@^([x80-xBF]{0,5})[xC0-xFD]?@' , $next_segm , $bytes ) ){
If ( !empty( $bytes[1] ) ){
$bytes = $bytes[1];
                      $res .= $bytes;
            }
}

// Determine whether the last 6 bytes comply with utf8 rules
         $ord0 = ord( $res[0] );
If ( 128 = $ord0 ){
// Take it back and add it in front of the res.
If ( preg_match( '@[xC0-xFD][x80-xBF]{0,5}$@' , $prev_segm , $bytes ) ){
If ( !empty( $bytes[0] ) ) {
$bytes = $bytes[0];
                           $res = $bytes . $res;
                }
            }
}

return $res;
}

Test data::

The code is as follows
 代码如下 复制代码
    $str = 'dfjdjf测13f试65&2数据fdj(1就mfe&……就';
    var_dump( utf8_substr( $str , 22 , 12 ) );  echo '
';
    var_dump( utf8_substr( $str , 22 , -6 ) ); echo '
';
    var_dump( utf8_substr( $str , 9 , 12 ) ); echo '
';
    var_dump( utf8_substr( $str , 19 , 12 ) ); echo '
';
    var_dump( utf8_substr( $str , 28 , -6 ) ); echo '
';
Copy code
$str = 'dfjdjf test 13f test 65&2 dataスス(1 is mfe&...just'; var_dump( utf8_substr( $str , 22 , 12 ) ); echo '
'; var_dump( utf8_substr( $str , 22 , -6 ) ); echo '
'; var_dump( utf8_substr( $str , 9 , 12 ) ); echo '
'; var_dump( utf8_substr( $str , 19 , 12 ) ); echo '
'; var_dump( utf8_substr( $str , 28 , -6 ) ); echo '
';

显示结果::(截取无乱码, 欢迎大家测试, 提交bug)
string(12) "据fdj"
string(26) "据fdj(1就mfe&…"
string(13) "13f试65&2数"
string(12) "数据fd"
string(20) "dj(1就mfe&…"

把我常用的分享出来

下面我们再来看中文截函数吧。

 代码如下 复制代码

function MooCutstr($string, $length, $dot = ' ...') {
 global $charset;

 if(strlen($string)   return $string;
 }
 $string = str_replace(array('&', '"', '<', '>'), array('&', '"', ''), $string);
 $strcut = '';
 if(strtolower($charset) == 'utf-8') {
  $n = $tn = $noc = 0;
  while($n    $t = ord($string[$n]);
   if($t == 9 || $t == 10 || (32     $tn = 1; $n++; $noc++;
   } elseif (194     $tn = 2; $n += 2; $noc += 2;
   } elseif (224     $tn = 3; $n += 3; $noc += 2;
   } elseif (240     $tn = 4; $n += 4; $noc += 2;
   } elseif (248     $tn = 5; $n += 5; $noc += 2;
   } elseif ($t == 252 || $t == 253) {
    $tn = 6; $n += 6; $noc += 2;
   } else {
    $n++;
   }
   if($noc >= $length) {
    break;
   }
  }
  if($noc > $length) {
   $n -= $tn;
  }
  $strcut = substr($string, 0, $n);
 } else {
  for($i = 0; $i    $strcut .= ord($string[$i]) > 127 ? $string[$i].$string[++$i] : $string[$i];
  }
 }
 //$strcut = str_replace(array('&', '"', ''), array('&', '"', '<', '>'), $strcut);

 return $strcut.$dot;
}

www.bkjia.comtruehttp://www.bkjia.com/PHPjc/631589.htmlTechArticle文章介绍了字符串截取函数从php自带的截取函数到最后支持中文,英文和中英文混合字符串截取方法介绍,有需要的朋友可参考一下。 取部...
Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
What is the difference between unset() and session_destroy()?What is the difference between unset() and session_destroy()?May 04, 2025 am 12:19 AM

Thedifferencebetweenunset()andsession_destroy()isthatunset()clearsspecificsessionvariableswhilekeepingthesessionactive,whereassession_destroy()terminatestheentiresession.1)Useunset()toremovespecificsessionvariableswithoutaffectingthesession'soveralls

What is sticky sessions (session affinity) in the context of load balancing?What is sticky sessions (session affinity) in the context of load balancing?May 04, 2025 am 12:16 AM

Stickysessionsensureuserrequestsareroutedtothesameserverforsessiondataconsistency.1)SessionIdentificationassignsuserstoserversusingcookiesorURLmodifications.2)ConsistentRoutingdirectssubsequentrequeststothesameserver.3)LoadBalancingdistributesnewuser

What are the different session save handlers available in PHP?What are the different session save handlers available in PHP?May 04, 2025 am 12:14 AM

PHPoffersvarioussessionsavehandlers:1)Files:Default,simplebutmaybottleneckonhigh-trafficsites.2)Memcached:High-performance,idealforspeed-criticalapplications.3)Redis:SimilartoMemcached,withaddedpersistence.4)Databases:Offerscontrol,usefulforintegrati

What is a session in PHP, and why are they used?What is a session in PHP, and why are they used?May 04, 2025 am 12:12 AM

Session in PHP is a mechanism for saving user data on the server side to maintain state between multiple requests. Specifically, 1) the session is started by the session_start() function, and data is stored and read through the $_SESSION super global array; 2) the session data is stored in the server's temporary files by default, but can be optimized through database or memory storage; 3) the session can be used to realize user login status tracking and shopping cart management functions; 4) Pay attention to the secure transmission and performance optimization of the session to ensure the security and efficiency of the application.

Explain the lifecycle of a PHP session.Explain the lifecycle of a PHP session.May 04, 2025 am 12:04 AM

PHPsessionsstartwithsession_start(),whichgeneratesauniqueIDandcreatesaserverfile;theypersistacrossrequestsandcanbemanuallyendedwithsession_destroy().1)Sessionsbeginwhensession_start()iscalled,creatingauniqueIDandserverfile.2)Theycontinueasdataisloade

What is the difference between absolute and idle session timeouts?What is the difference between absolute and idle session timeouts?May 03, 2025 am 12:21 AM

Absolute session timeout starts at the time of session creation, while an idle session timeout starts at the time of user's no operation. Absolute session timeout is suitable for scenarios where strict control of the session life cycle is required, such as financial applications; idle session timeout is suitable for applications that want users to keep their session active for a long time, such as social media.

What steps would you take if sessions aren't working on your server?What steps would you take if sessions aren't working on your server?May 03, 2025 am 12:19 AM

The server session failure can be solved through the following steps: 1. Check the server configuration to ensure that the session is set correctly. 2. Verify client cookies, confirm that the browser supports it and send it correctly. 3. Check session storage services, such as Redis, to ensure that they are running normally. 4. Review the application code to ensure the correct session logic. Through these steps, conversation problems can be effectively diagnosed and repaired and user experience can be improved.

What is the significance of the session_start() function?What is the significance of the session_start() function?May 03, 2025 am 12:18 AM

session_start()iscrucialinPHPformanagingusersessions.1)Itinitiatesanewsessionifnoneexists,2)resumesanexistingsession,and3)setsasessioncookieforcontinuityacrossrequests,enablingapplicationslikeuserauthenticationandpersonalizedcontent.

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

SublimeText3 Linux new version

SublimeText3 Linux new version

SublimeText3 Linux latest version

Dreamweaver Mac version

Dreamweaver Mac version

Visual web development tools

MantisBT

MantisBT

Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor