search
HomeBackend DevelopmentPHP TutorialTeach you step by step how to do a keyword matching project (search engine) ---- Day 22, teach you how to do it on the 22nd day_PHP Tutorial

Teach you step by step how to do keyword matching project (search engine) ---- Day 22, teach you how to do it on day 22

Latest interview experience: Interview Feelings (2), Interview feelings

The latest architecture: high-concurrency data collection architecture application (Redis application)

Comment: I just adjusted my mentality today and continued to write the articles that I had not finished before. I have also taken a break in the past few months. I went home to do hard work, which was also regarded as exercising my body. After all, anything It can't change your health. I also recommend handsome guys in the IT industry to exercise more and exercise other parts of your body.

Day 22

Starting point: Teach you step by step how to do keyword matching project (search engine) ---- Day 1

Review: Teach you step by step how to do keyword matching project (search engine) ---- Day 21

Xiao Shuaishuai is a person who likes to make summaries. Based on the knowledge he has learned before, he summarized it as follows:

1. The problem of expansion and type of baby attributes has been well controlled initially, but there are still big obstacles in promotion, operation and maintenance.

2. To split keywords, we use scws extension and our own native business word splitting solution. Word splitting effectively solves the difficulty of matching phrases.

3. All the initial work seems to have been completed, and only the final finishing work is needed to make the project officially operational.

Xiao Shuaishuai has a strong sense of initiative. He wrote a code by himself without asking Boss Yu. The code is mainly to connect all the steps.

For the construction of extended CharList of baby attributes, please refer to: Teach you step by step how to do keyword matching projects (search engine) ---- Day 12 ~ Teach you step by step how to do keyword matching projects (search engine) ---- Day 18

The main steps of Selector are as follows:

1. Get baby attributes.

2. Use business knowledge to expand baby attributes to form CharList

3. Get keywords from the thesaurus

4. Keyword splitting algorithm

5. Matching degree algorithm

6. Return the matching keyword list

The code is as follows:

<span> 1</span> <?<span>php
</span><span> 2</span> <span>#</span><span>@Filename:selector/Selector.php</span>
<span> 3</span> <span>#</span><span>@Author:oshine</span>
<span> 4</span> 
<span> 5</span> <span>require_once</span> <span>dirname</span>(<span>__FILE__</span>) . '/SelectorItem.php'<span>;
</span><span> 6</span> <span>require_once</span> <span>dirname</span>(<span>__FILE__</span>) . '/charlist/CharList.php'<span>;
</span><span> 7</span> <span>require_once</span> <span>dirname</span>(<span>__FILE__</span>) . '/charlist/CharlistHandle.php'<span>;
</span><span> 8</span> <span>require_once</span> <span>dirname</span>(<span>dirname</span>(<span>__FILE__</span>)) . '/lib/Logger.php'<span>;
</span><span> 9</span> 
<span>10</span> <span>class</span><span> Selector
</span><span>11</span> <span>{
</span><span>12</span> 
<span>13</span>     <span>private</span> <span>static</span> <span>$charListHandle</span> = <span>array</span><span>(
</span><span>14</span>         "黑名单" => "BacklistCharListHandle",
<span>15</span>         "近义词" => "LinklistCharListHandle"
<span>16</span> <span>    );
</span><span>17</span> 
<span>18</span>     <span>public</span> <span>static</span> <span>function</span> select(<span>$num_iid</span><span>)
</span><span>19</span> <span>    {
</span><span>20</span>         <span>$selectorItem</span> = SelectorItem::createFromApi(<span>$num_iid</span><span>);
</span><span>21</span> 
<span>22</span>         Logger::trace(<span>$selectorItem</span>-><span>props_name);
</span><span>23</span> 
<span>24</span>         <span>$charlist</span> = <span>new</span><span> CharList();
</span><span>25</span> 
<span>26</span>         <span>foreach</span> (self::<span>$charListHandle</span> <span>as</span> <span>$matchKey</span> => <span>$className</span><span>) {
</span><span>27</span> 
<span>28</span>             <span>$handle</span> = self::createCharListHandle(<span>$className</span>, <span>$charlist</span>, <span>$selectorItem</span><span>);
</span><span>29</span>             <span>$handle</span>-><span>exec</span><span>();
</span><span>30</span> 
<span>31</span> <span>        }
</span><span>32</span> 
<span>33</span>         <span>$selectWords</span> = <span>array</span><span>();
</span><span>34</span> 
<span>35</span>         <span>$keywords</span> = DB::makeArray("select word from keywords"<span>);
</span><span>36</span>         <span>foreach</span> (<span>$keywords</span> <span>as</span> <span>$val</span><span>) {
</span><span>37</span>             <span>#</span><span> code...</span>
<span>38</span>             <span>$keywordEntity</span> = SplitterApp::<span>split</span>(<span>$val</span>["word"<span>]);
</span><span>39</span>             
<span>40</span>                 <span>#</span><span> code...</span>
<span>41</span>             <span>if</span>(MacthExector::macth(<span>$keywordEntity</span>,<span>$charlist</span><span>)){
</span><span>42</span>                 <span>$selectWords</span>[] = <span>$val</span>["word"<span>];
</span><span>43</span> <span>            }           
</span><span>44</span> 
<span>45</span> <span>        }
</span><span>46</span> 
<span>47</span>         <span>return</span> <span>$selectWords</span><span>;
</span><span>48</span> <span>    }
</span><span>49</span> 
<span>50</span>     <span>public</span> <span>static</span> <span>function</span> createCharListHandle(<span>$className</span>, <span>$charlist</span>, <span>$selectorItem</span><span>)
</span><span>51</span> <span>    {
</span><span>52</span>         <span>if</span> (<span>class_exists</span>(<span>$className</span><span>)) {
</span><span>53</span>             <span>return</span> <span>new</span> <span>$className</span>(<span>$charlist</span>, <span>$selectorItem</span><span>);
</span><span>54</span> <span>        }
</span><span>55</span>         <span>throw</span> <span>new</span> <span>Exception</span>("class not exists", 0<span>);
</span><span>56</span> <span>    }
</span><span>57</span> }

For test-driven code programming, please refer to:

The same principle is also used. Write the test code first, and then complete the MatchExector code.

The main function of MatchExector is to calculate the matching degree.

1. If there is only one word in the blacklist, the matching degree will definitely be zero.

2. If it is a core word, then calculate it according to the algorithm mentioned before, please refer to: Teach you step-by-step keyword matching project (search engine) ---- Day 19

<span> 1</span> <?<span>php
</span><span> 2</span> <span>#</span><span>@Filename:mathes/MatchExector.php</span>
<span> 3</span> <span>#</span><span>@Author:oshine</span>
<span> 4</span> 
<span> 5</span> <span>class</span><span> MatchExector {
</span><span> 6</span> 
<span> 7</span>     <span>public</span> <span>static</span> <span>function</span> match(KeywordEntity <span>$keywordEntity</span>,CharList <span>$charlist</span><span>){
</span><span> 8</span> 
<span> 9</span>         <span>$matchingDegree</span> = 0<span>;
</span><span>10</span>         <span>$elementWords</span> = <span>$keywordEntity</span>-><span>getElementWords();
</span><span>11</span>         <span>foreach</span> (<span>$elementWords</span> <span>as</span> <span>$word</span><span>) {
</span><span>12</span>             <span>#</span><span> code...</span>
<span>13</span>             <span>if</span>(<span>in_array</span>(<span>$word</span>, <span>$charlist</span>-><span>getBlacklist()))
</span><span>14</span>                 <span>return</span> <span>false</span><span>;
</span><span>15</span>             <span>if</span>(<span>in_array</span>(<span>$word</span>, <span>$charlist</span>-><span>getCore()))
</span><span>16</span>                 <span>$matchingDegree</span>+=<span>$keywordEntity</span>->calculateWeight(<span>$word</span><span>);
</span><span>17</span> 
<span>18</span> <span>        }
</span><span>19</span> 
<span>20</span>         <span>if</span>(<span>$matchingDegree</span>>0.8<span>)
</span><span>21</span>             <span>return</span> <span>true</span><span>;
</span><span>22</span>         <span>return</span> <span>false</span><span>;
</span><span>23</span> 
<span>24</span> <span>    }
</span><span>25</span>     
<span>26</span> }

Relatively speaking, the entire code has achieved the functions it should have. Xiao Shuaishuai is very happy, because after the project is completed, there will definitely be a project bonus, and maybe a rich dinner,

My mouth is watering just thinking about it.

Xiao Shuai Shuai handed the code to Boss Yu and waited expectantly for Boss Yu’s final affirmation.

What will be Boss Yu’s reaction after reading it? Please pay attention to Chapter 3: In-depth study of keyword matching projects (1)

Chapter 2 has been completed, source code address: Teach you step by step how to do keyword matching projects (Chapter 2 completed)

www.bkjia.comtruehttp: //www.bkjia.com/PHPjc/931389.htmlTechArticleTeach you step by step how to do keyword matching projects (search engines) ---- Day 22, teach you The latest interview experience on the 22nd day: interview feelings (2), interview feelings, the latest structure...
Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
解释一下explorer.exe进程是什么解释一下explorer.exe进程是什么Feb 18, 2024 pm 12:11 PM

explorer.exe是什么进程在我们使用Windows操作系统的时候,经常会听到一个名词"explorer.exe".那么,你是否好奇这个进程到底是什么?在本文中,我们将详细解释explorer.exe是什么进程以及其功能和作用。首先,explorer.exe是Windows操作系统的一个关键进程,它负责管理和控制Windows资源管理器(Window

r5 5600x最高能带动什么显卡 最新用5600X搭配RX6800XT性能r5 5600x最高能带动什么显卡 最新用5600X搭配RX6800XT性能Feb 25, 2024 am 10:34 AM

10月29日,AMD终于发布了备受用户期待的重磅产品,即基于全新RDNA2架构的RX6000系列游戏显卡。这款显卡与之前推出的基于全新ZEN3架构的锐龙5000系列处理器相辅相成,形成了一个全新的双A组合。这一次的发布不仅使得竞争对手“双英”黯然失色,也对整个DIY硬件圈产生了重大影响。接下来,围绕笔者手中这套AMD锐龙5600X和RX6800XT的组合作为测试例子,来见证下现如今的AMD究竟有多么Yse?首先说说CPU处理器部分,上一代采用ZEN2架构的AMD锐龙3000系列处理器其实已经令用

内存频率和时序哪个对性能影响更大内存频率和时序哪个对性能影响更大Feb 19, 2024 am 08:58 AM

内存是计算机中非常重要的组件之一,它对计算机的性能和稳定性有着重要影响。在选择内存时,人们往往会关注两个重要的参数,即时序和频率。那么,对于内存性能来说,时序和频率哪个更重要呢?首先,我们来了解一下时序和频率的概念。时序指的是内存芯片在接收和处理数据时所需的时间间隔。它通常以CL值(CASLatency)来表示,CL值越小,内存的处理速度越快。而频率则是内

发生0x0000004e错误代表了什么问题发生0x0000004e错误代表了什么问题Feb 18, 2024 pm 01:54 PM

0x0000004e是什么故障在计算机系统中,故障是一个常见的问题。当计算机遇到故障时,系统通常会因为无法正常运行而出现停机、崩溃或者出现错误提示。而在Windows系统中,有一个特定的故障代码0x0000004e,这是一个蓝屏错误代码,表示系统遇到了一个严重的错误。0x0000004e蓝屏错误是由于系统内核或驱动程序问题导致的。这种错误通常会导致计算机系统

教你使用 iOS 17.4「失窃设备保护」新进阶功能教你使用 iOS 17.4「失窃设备保护」新进阶功能Mar 10, 2024 pm 04:34 PM

Apple在周二推出了iOS17.4更新,为iPhone带来了一系列新功能和修复。这次更新包括了全新的表情符号,同时欧盟用户也能够下载其他应用商店。此外,更新还加强了对iPhone安全性的控制,引入了更多的「失窃设备保护」设置选项,为用户提供更多选择和保障。"iOS17.3首次引入了“失窃设备保护”功能,为用户的敏感资料增加了额外的安全保障。当用户不在家等熟悉地点时,该功能要求用户首次输入生物特征信息,并在一小时后再次输入信息才能访问和更改某些数据,如修改AppleID密码或关闭失窃设备保护功能

Microsoft Edge在哪设置显示下载按钮-Microsoft Edge设置显示下载按钮的方法Microsoft Edge在哪设置显示下载按钮-Microsoft Edge设置显示下载按钮的方法Mar 06, 2024 am 11:49 AM

大家知道MicrosoftEdge在哪设置显示下载按钮吗?下文小编就带来了MicrosoftEdge设置显示下载按钮的方法,希望对大家能够有所帮助,一起跟着小编来学习一下吧!第一步:首先打开MicrosoftEdge浏览器,单击右上角【...】标识,如下图所示。第二步:然后在弹出菜单中,单击【设置】,如下图所示。第三步:接着单击界面左侧【外观】,如下图所示。第四步:最后单击【显示下载按钮】右侧按钮,由灰变蓝即可,如下图所示。上面就是小编为大家带来的MicrosoftEdge在哪设置显示下载按钮的

哪些免费的dll修复工具可以使用?哪些免费的dll修复工具可以使用?Feb 19, 2024 pm 08:35 PM

免费的dll修复工具有哪些导语:随着电脑使用的频繁,有时我们可能会遇到一些dll文件损坏或丢失的问题,这会导致某些软件无法正常运行,给用户带来了很大的困扰。幸运的是,市面上有一些免费的dll修复工具可以帮助我们解决这个问题。本文将介绍几款常用的免费dll修复工具,并对其功能和特点进行分析。一、DLL-FilesFixerDLL-FilesFixer是一

专访 OSL Group CEO:交易所也可以“不走寻常路”专访 OSL Group CEO:交易所也可以“不走寻常路”Feb 29, 2024 pm 07:25 PM

整理:Babywhale,ForesightNews目前获得了香港虚拟资产交易所牌照的交易所有两家:OSL和HaskKeyExchange。很多人在不同的Web3媒体平台上会看到HashKeyExchange上线各种代币的消息,而OSL的类似新闻却相对较少。但实际上,OSL正积极进取,只不过它选择了一条似乎与传统交易所不同的发展道路。潘志勇在获得了BGX的投资后,担任了OSL的新董事局主席和首席执行官。OSL此前在公众眼中相对低调,现在面临着转变的压力,潘志勇将如何引领公司走向前景,这将成为一个

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
Repo: How To Revive Teammates
1 months agoBy尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

SublimeText3 Linux new version

SublimeText3 Linux new version

SublimeText3 Linux latest version

MantisBT

MantisBT

Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

SAP NetWeaver Server Adapter for Eclipse

SAP NetWeaver Server Adapter for Eclipse

Integrate Eclipse with SAP NetWeaver application server.