Home  >  Article  >  Backend Development  >  Implementation method of putting PHP word segmentation into MVC framework

Implementation method of putting PHP word segmentation into MVC framework

小云云
小云云Original
2018-03-13 10:27:231198browse


This article mainly shares with you the implementation method of putting PHP word segmentation into the MVC framework. It is placed directly in the root directory of the website for testing, and in the thinkphp directory, and the class is compressed [ Util.rar】Extract to \ThinkPHP\Library\Org\Util

Code block

The code block syntax follows the standard markdown code, for example:

 <?php
namespace Org\Util;
// 严格开发模式
ini_set(&#39;display_errors&#39;, &#39;On&#39;);
ini_set(&#39;memory_limit&#39;, &#39;64M&#39;);
error_reporting(E_ALL);

$t1 = $ntime = microtime(true);
$endtime = &#39;未执行任何操作,不统计!&#39;;
function print_memory($rc, &$infostr)
{    global $ntime;

    $cutime = microtime(true);
    $etime = sprintf(&#39;%0.4f&#39;, $cutime - $ntime);
    $m = sprintf(&#39;%0.2f&#39;, memory_get_usage()/1024/1024);
    $infostr .= "{$rc}:  {$m} MB 用时:{$etime} 秒<br />\n";
    $ntime = $cutime;
}

header(&#39;Content-Type: text/html; charset=utf-8&#39;);

$memory_info = &#39;&#39;;
print_memory(&#39;没任何操作&#39;, $memory_info);

require_once &#39;/ThinkPHP/Library/Org/Util/Phpanalysis.class.php&#39;;

$str = (isset($_POST[&#39;source&#39;]) ? $_POST[&#39;source&#39;] : &#39;&#39;);

$loadtime = $endtime1  = $endtime2 = $slen = 0;

$do_fork = $do_unit = true;
$do_multi = $do_prop = $pri_dict = false;if($str != &#39;&#39;)
{
    //岐义处理
    $do_fork = empty($_POST[&#39;do_fork&#39;]) ? false : true;
    //新词识别
    $do_unit = empty($_POST[&#39;do_unit&#39;]) ? false : true;
    //多元切分
    $do_multi = empty($_POST[&#39;do_multi&#39;]) ? false : true;
    //词性标注
    $do_prop = empty($_POST[&#39;do_prop&#39;]) ? false : true;
    //是否预载全部词条
    $pri_dict = empty($_POST[&#39;pri_dict&#39;]) ? false : true;

    $tall = microtime(true);

    //初始化类
    PhpAnalysis::$loadInit = false;
    $pa = new PhpAnalysis(&#39;utf-8&#39;, &#39;utf-8&#39;, $pri_dict);
    print_memory(&#39;初始化对象&#39;, $memory_info);

    //载入词典
    $pa->LoadDict();
    print_memory(&#39;载入基本词典&#39;, $memory_info);    

    //执行分词
    $pa->SetSource($str);
    $pa->differMax = $do_multi;
    $pa->unitWord = $do_unit;

    $pa->StartAnalysis( $do_fork );
    print_memory(&#39;执行分词&#39;, $memory_info);

    $okresult = $pa->GetFinallyResult(&#39; &#39;, $do_prop);
    print_memory(&#39;输出分词结果&#39;, $memory_info);

    $pa_foundWordStr = $pa->foundWordStr;

    $t2 = microtime(true);
    $endtime = sprintf(&#39;%0.4f&#39;, $t2 - $t1);

    $slen = strlen($str);
    $slen = sprintf(&#39;%0.2f&#39;, $slen/1024);

    $pa = &#39;&#39;;

}

$teststr = "2010年1月,美国国际消费电子展 (CES)上,联想将展出一款基于ARM架构的新产品,这有可能是传统四大PC厂商首次推出的基于ARM架构的消费电子产品,也意味着在移动互联网和产业融合趋势下,传统的PC芯片霸主英特尔正在遭遇挑战。
11月12日,联想集团副总裁兼中国区总裁夏立向本报证实,联想基于ARM架构的新产品正在筹备中。
英特尔新闻发言人孟轶嘉表示,对第三方合作伙伴信息不便评论。
正面交锋
ARM内部人士透露,11月5日,ARM高级副总裁lanDrew参观了联想研究院,拜访了联想负责消费产品的负责人,进一步商讨基于ARM架构的新产品。ARM是英国芯片设计厂商,全球几乎95%的手机都采用ARM设计的芯片。
据悉,这是一款采用高通芯片(基于ARM架构)的新产品,高通产品市场总监钱志军表示,联想对此次项目很谨慎,对于产品细节不方便透露。
夏立告诉记者,联想研究院正在考虑多种方案,此款基于ARM架构的新产品应用邻域多样化,并不是替代传统的PC,而是更丰富的满足用户的需求。目前,客户调研还没有完成,“设计、研发更前瞻一些,最终还要看市场、用户接受程度。”";

?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
<title>分词测试</title>
</head>
<body>
<table width=&#39;90%&#39; align=&#39;center&#39;>
<tr>
    <td>    

<hr size=&#39;1&#39; />

<form id="form1" name="form1" method="post" action="?ac=done" style="margin:0px;padding:0px;line-height:24px;">
  <b>源文本:</b>  <a href="dict_build_new.php" target="_blank">[更新词典]</a> <br/>
    <textarea name="source" style="width:98%;height:150px;font-size:14px;"><?php echo (isset($_POST[&#39;source&#39;]) ? $_POST[&#39;source&#39;] : $teststr); ?></textarea>
    <br/>
    <input type=&#39;checkbox&#39; name=&#39;do_fork&#39; value=&#39;1&#39; <?php echo ($do_fork ? "checked=&#39;1&#39;" : &#39;&#39;); ?>/>岐义处理
    <input type=&#39;checkbox&#39; name=&#39;do_unit&#39; value=&#39;1&#39; <?php echo ($do_unit ? "checked=&#39;1&#39;" : &#39;&#39;); ?>/>新词识别
    <input type=&#39;checkbox&#39; name=&#39;do_multi&#39; value=&#39;1&#39; <?php echo ($do_multi ? "checked=&#39;1&#39;" : &#39;&#39;); ?>/>多元切分
    <input type=&#39;checkbox&#39; name=&#39;do_prop&#39; value=&#39;1&#39; <?php echo ($do_prop ? "checked=&#39;1&#39;" : &#39;&#39;); ?>/>词性标注
    <input type=&#39;checkbox&#39; name=&#39;pri_dict&#39; value=&#39;1&#39; <?php echo ($pri_dict ? "checked=&#39;1&#39;" : &#39;&#39;); ?>/>预载全部词条
    <br/>
    <input type="submit" name="Submit" value="提交进行分词" />
         
    <input type="reset" name="Submit2" value="重设表单数据" />
</form>
<br />
<textarea name="result" id="result" style="width:98%;height:120px;font-size:14px;color:#555"><?php echo (isset($okresult) ? $okresult : &#39;&#39;); ?></textarea>
<br /><br />
<b>调试信息:</b>
<hr />
<font color=&#39;blue&#39;>字串长度:</font><?php echo $slen; ?>K <font color=&#39;blue&#39;>自动识别词:</font><?php echo (isset($pa_foundWordStr)) ? $pa_foundWordStr : &#39;&#39;; ?><br />
<hr />
<font color=&#39;blue&#39;>内存占用及执行时间:</font>(表示完成某个动作后正在占用的内存)<hr />
<?php echo $memory_info; ?>
总用时:<?php echo $endtime; ?> 秒
</td>
</tr>
</table>
</body>
</html>

Baidu download【Word segmentation .rar】

The above is the detailed content of Implementation method of putting PHP word segmentation into MVC framework. For more information, please follow other related articles on the PHP Chinese website!

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn