Home  >  Article  >  Backend Development  >  Intercept Chinese string in php

Intercept Chinese string in php

WBOY
WBOYOriginal
2016-07-25 08:42:12944browse
1. Intercept GB2312 Chinese string
  1. < ?php
  2. //Intercept Chinese string
  3. function mysubstr($str, $start, $len) {
  4. $tmpstr = "";
  5. $strlen = $start + $len;
  6. for($i = 0; $i < $strlen; $i++) {
  7. if(ord(substr($str, $i, 1)) > 0xa0) {
  8. $tmpstr .= substr($str, $i, 2);
  9. $i++;
  10. } else
  11. $tmpstr .= substr($str, $i, 1);
  12. }
  13. return $tmpstr;
  14. }
  15. ?> ;
Copy code
2. Intercept the utf8 encoded multi-byte string
  1. < ?php
  2. //Intercept the utf8 string
  3. function utf8Substr($str, $from, $len)
  4. {
  5. return preg_replace( '#^(?:[x00-x7F]|[xC0-xFF][x80-xBF]+){0,'.$from.'}'.
  6. '((?:[x00-x7F]|[xC0 -xFF][x80-xBF]+){0,'.$len.'}).*#s',
  7. '$1',$str);
  8. }
  9. ?>
Copy code
3. Chinese character interception function supported by UTF-8 and GB2312
  1. < ?php
  2. /*
  3. Utf-8 and gb2312 Chinese character interception function supported by
  4. cut_str(string, interception length, Start length, encoding);
  5. Encoding defaults to utf-8
  6. Start length defaults to 0
  7. */
  8. function cut_str($string, $sublen, $start = 0, $code = 'UTF-8')
  9. {
  10. if($code == 'UTF-8')
  11. {
  12. $pa = "/[x01-x7f]|[xc2-xdf][x80-xbf]|xe0[xa0-xbf][x80-xbf]|[ xe1-xef][x80-xbf][x80-xbf]|xf0[x90-xbf][x80-xbf][x80-xbf]|[xf1-xf7][x80-xbf][x80-xbf][x80- xbf]/";
  13. preg_match_all($pa, $string, $t_string);
  14. if(count($t_string[0]) - $start > $sublen) return join('', array_slice($t_string[0 ], $start, $sublen))."...";
  15. return join('', array_slice($t_string[0], $start, $sublen));
  16. }
  17. else
  18. {
  19. $start = $ start*2;
  20. $sublen = $sublen*2;
  21. $strlen = strlen($string);
  22. $tmpstr = '';
  23. for($i=0; $i< $strlen; $i++)
  24. {
  25. if($i>=$start && $i< ($start+$sublen))
  26. {
  27. if(ord(substr($string, $i, 1))>129)
  28. {
  29. $tmpstr.= substr ($string, $i, 2);
  30. }
  31. else
  32. {
  33. $tmpstr.= substr($string, $i, 1);
  34. }
  35. }
  36. if(ord(substr($string, $i, 1) ))>129) $i++;
  37. }
  38. if(strlen($tmpstr)< $strlen ) $tmpstr.= "...";
  39. return $tmpstr;
  40. }
  41. }
  42. $str = "abcd The string that needs to be intercepted";
  43. echo cut_str($str, 8, 0, 'gb2312');
  44. ?>
Copy code
4. BugFree’s character interception function
  1. < ?php
  2. /**
  3. * @package BugFree
  4. * @version $Id: FunctionsMain.inc.php,v 1.32 2005/09/24 11:38:37 wwccss Exp $
  5. *
  6. *
  7. * Return part of a string(Enhance the function substr())
  8. *
  9. * @author Chunsheng Wang
  10. * @param string $String the string to cut.
  11. * @param int $Length the length of returned string.
  12. * @param booble $Append whether append "...": false|true
  13. * @return string the cutted string.
  14. */
  15. function sysSubStr($String,$Length,$Append = false)
  16. {
  17. if (strlen($String) < ; = $Length )
  18. {
  19. return $String;
  20. }
  21. else
  22. {
  23. $I = 0;
  24. while ($I < $Length)
  25. {
  26. $StringTMP = substr($String,$I,1) ;
  27. if ( ord($StringTMP) >=224 )
  28. {
  29. $StringTMP = substr($String,$I,3);
  30. $I = $I + 3;
  31. }
  32. elseif( ord($StringTMP) >=192 )
  33. {
  34. $StringTMP = substr($String,$I,2);
  35. $I = $I + 2;
  36. }
  37. else
  38. {
  39. $I = $I + 1;
  40. }
  41. $ StringLast[] = $StringTMP;
  42. }
  43. $StringLast = implode("",$StringLast);
  44. if($Append)
  45. {
  46. $StringLast .= "...";
  47. }
  48. return $StringLast;
  49. }
  50. }
  51. $String = "17test.info is at the forefront of automated testing in China";
  52. $Length = "18";
  53. $Append = false;
  54. echo sysSubStr($String,$Length,$Append);
  55. ? >
Copy code

php


Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn