


Teach you step by step how to do keyword matching project (search engine) ---- Day 22, teach you how to do it on day 22
Latest interview experience: Interview Feelings (2), Interview feelings
The latest architecture: high-concurrency data collection architecture application (Redis application)
Comment: I just adjusted my mentality today and continued to write the articles that I had not finished before. I have also taken a break in the past few months. I went home to do hard work, which was also regarded as exercising my body. After all, anything It can't change your health. I also recommend handsome guys in the IT industry to exercise more and exercise other parts of your body.
Day 22
Starting point: Teach you step by step how to do keyword matching project (search engine) ---- Day 1
Review: Teach you step by step how to do keyword matching project (search engine) ---- Day 21
Xiao Shuaishuai is a person who likes to make summaries. Based on the knowledge he has learned before, he summarized it as follows:
1. The problem of expansion and type of baby attributes has been well controlled initially, but there are still big obstacles in promotion, operation and maintenance.
2. To split keywords, we use scws extension and our own native business word splitting solution. Word splitting effectively solves the difficulty of matching phrases.
3. All the initial work seems to have been completed, and only the final finishing work is needed to make the project officially operational.
Xiao Shuaishuai has a strong sense of initiative. He wrote a code by himself without asking Boss Yu. The code is mainly to connect all the steps.
For the construction of extended CharList of baby attributes, please refer to: Teach you step by step how to do keyword matching projects (search engine) ---- Day 12 ~ Teach you step by step how to do keyword matching projects (search engine) ---- Day 18
The main steps of Selector are as follows:
1. Get baby attributes.
2. Use business knowledge to expand baby attributes to form CharList
3. Get keywords from the thesaurus
4. Keyword splitting algorithm
5. Matching degree algorithm
6. Return the matching keyword list
The code is as follows:
<span> 1</span> <?<span>php </span><span> 2</span> <span>#</span><span>@Filename:selector/Selector.php</span> <span> 3</span> <span>#</span><span>@Author:oshine</span> <span> 4</span> <span> 5</span> <span>require_once</span> <span>dirname</span>(<span>__FILE__</span>) . '/SelectorItem.php'<span>; </span><span> 6</span> <span>require_once</span> <span>dirname</span>(<span>__FILE__</span>) . '/charlist/CharList.php'<span>; </span><span> 7</span> <span>require_once</span> <span>dirname</span>(<span>__FILE__</span>) . '/charlist/CharlistHandle.php'<span>; </span><span> 8</span> <span>require_once</span> <span>dirname</span>(<span>dirname</span>(<span>__FILE__</span>)) . '/lib/Logger.php'<span>; </span><span> 9</span> <span>10</span> <span>class</span><span> Selector </span><span>11</span> <span>{ </span><span>12</span> <span>13</span> <span>private</span> <span>static</span> <span>$charListHandle</span> = <span>array</span><span>( </span><span>14</span> "黑名单" => "BacklistCharListHandle", <span>15</span> "近义词" => "LinklistCharListHandle" <span>16</span> <span> ); </span><span>17</span> <span>18</span> <span>public</span> <span>static</span> <span>function</span> select(<span>$num_iid</span><span>) </span><span>19</span> <span> { </span><span>20</span> <span>$selectorItem</span> = SelectorItem::createFromApi(<span>$num_iid</span><span>); </span><span>21</span> <span>22</span> Logger::trace(<span>$selectorItem</span>-><span>props_name); </span><span>23</span> <span>24</span> <span>$charlist</span> = <span>new</span><span> CharList(); </span><span>25</span> <span>26</span> <span>foreach</span> (self::<span>$charListHandle</span> <span>as</span> <span>$matchKey</span> => <span>$className</span><span>) { </span><span>27</span> <span>28</span> <span>$handle</span> = self::createCharListHandle(<span>$className</span>, <span>$charlist</span>, <span>$selectorItem</span><span>); </span><span>29</span> <span>$handle</span>-><span>exec</span><span>(); </span><span>30</span> <span>31</span> <span> } </span><span>32</span> <span>33</span> <span>$selectWords</span> = <span>array</span><span>(); </span><span>34</span> <span>35</span> <span>$keywords</span> = DB::makeArray("select word from keywords"<span>); </span><span>36</span> <span>foreach</span> (<span>$keywords</span> <span>as</span> <span>$val</span><span>) { </span><span>37</span> <span>#</span><span> code...</span> <span>38</span> <span>$keywordEntity</span> = SplitterApp::<span>split</span>(<span>$val</span>["word"<span>]); </span><span>39</span> <span>40</span> <span>#</span><span> code...</span> <span>41</span> <span>if</span>(MacthExector::macth(<span>$keywordEntity</span>,<span>$charlist</span><span>)){ </span><span>42</span> <span>$selectWords</span>[] = <span>$val</span>["word"<span>]; </span><span>43</span> <span> } </span><span>44</span> <span>45</span> <span> } </span><span>46</span> <span>47</span> <span>return</span> <span>$selectWords</span><span>; </span><span>48</span> <span> } </span><span>49</span> <span>50</span> <span>public</span> <span>static</span> <span>function</span> createCharListHandle(<span>$className</span>, <span>$charlist</span>, <span>$selectorItem</span><span>) </span><span>51</span> <span> { </span><span>52</span> <span>if</span> (<span>class_exists</span>(<span>$className</span><span>)) { </span><span>53</span> <span>return</span> <span>new</span> <span>$className</span>(<span>$charlist</span>, <span>$selectorItem</span><span>); </span><span>54</span> <span> } </span><span>55</span> <span>throw</span> <span>new</span> <span>Exception</span>("class not exists", 0<span>); </span><span>56</span> <span> } </span><span>57</span> }
For test-driven code programming, please refer to:
The same principle is also used. Write the test code first, and then complete the MatchExector code.
The main function of MatchExector is to calculate the matching degree.
1. If there is only one word in the blacklist, the matching degree will definitely be zero.
2. If it is a core word, then calculate it according to the algorithm mentioned before, please refer to: Teach you step-by-step keyword matching project (search engine) ---- Day 19
<span> 1</span> <?<span>php </span><span> 2</span> <span>#</span><span>@Filename:mathes/MatchExector.php</span> <span> 3</span> <span>#</span><span>@Author:oshine</span> <span> 4</span> <span> 5</span> <span>class</span><span> MatchExector { </span><span> 6</span> <span> 7</span> <span>public</span> <span>static</span> <span>function</span> match(KeywordEntity <span>$keywordEntity</span>,CharList <span>$charlist</span><span>){ </span><span> 8</span> <span> 9</span> <span>$matchingDegree</span> = 0<span>; </span><span>10</span> <span>$elementWords</span> = <span>$keywordEntity</span>-><span>getElementWords(); </span><span>11</span> <span>foreach</span> (<span>$elementWords</span> <span>as</span> <span>$word</span><span>) { </span><span>12</span> <span>#</span><span> code...</span> <span>13</span> <span>if</span>(<span>in_array</span>(<span>$word</span>, <span>$charlist</span>-><span>getBlacklist())) </span><span>14</span> <span>return</span> <span>false</span><span>; </span><span>15</span> <span>if</span>(<span>in_array</span>(<span>$word</span>, <span>$charlist</span>-><span>getCore())) </span><span>16</span> <span>$matchingDegree</span>+=<span>$keywordEntity</span>->calculateWeight(<span>$word</span><span>); </span><span>17</span> <span>18</span> <span> } </span><span>19</span> <span>20</span> <span>if</span>(<span>$matchingDegree</span>>0.8<span>) </span><span>21</span> <span>return</span> <span>true</span><span>; </span><span>22</span> <span>return</span> <span>false</span><span>; </span><span>23</span> <span>24</span> <span> } </span><span>25</span> <span>26</span> }
Relatively speaking, the entire code has achieved the functions it should have. Xiao Shuaishuai is very happy, because after the project is completed, there will definitely be a project bonus, and maybe a rich dinner,
My mouth is watering just thinking about it.
Xiao Shuai Shuai handed the code to Boss Yu and waited expectantly for Boss Yu’s final affirmation.
What will be Boss Yu’s reaction after reading it? Please pay attention to Chapter 3: In-depth study of keyword matching projects (1)
Chapter 2 has been completed, source code address: Teach you step by step how to do keyword matching projects (Chapter 2 completed)

explorer.exe是什么进程在我们使用Windows操作系统的时候,经常会听到一个名词"explorer.exe".那么,你是否好奇这个进程到底是什么?在本文中,我们将详细解释explorer.exe是什么进程以及其功能和作用。首先,explorer.exe是Windows操作系统的一个关键进程,它负责管理和控制Windows资源管理器(Window

10月29日,AMD终于发布了备受用户期待的重磅产品,即基于全新RDNA2架构的RX6000系列游戏显卡。这款显卡与之前推出的基于全新ZEN3架构的锐龙5000系列处理器相辅相成,形成了一个全新的双A组合。这一次的发布不仅使得竞争对手“双英”黯然失色,也对整个DIY硬件圈产生了重大影响。接下来,围绕笔者手中这套AMD锐龙5600X和RX6800XT的组合作为测试例子,来见证下现如今的AMD究竟有多么Yse?首先说说CPU处理器部分,上一代采用ZEN2架构的AMD锐龙3000系列处理器其实已经令用

内存是计算机中非常重要的组件之一,它对计算机的性能和稳定性有着重要影响。在选择内存时,人们往往会关注两个重要的参数,即时序和频率。那么,对于内存性能来说,时序和频率哪个更重要呢?首先,我们来了解一下时序和频率的概念。时序指的是内存芯片在接收和处理数据时所需的时间间隔。它通常以CL值(CASLatency)来表示,CL值越小,内存的处理速度越快。而频率则是内

0x0000004e是什么故障在计算机系统中,故障是一个常见的问题。当计算机遇到故障时,系统通常会因为无法正常运行而出现停机、崩溃或者出现错误提示。而在Windows系统中,有一个特定的故障代码0x0000004e,这是一个蓝屏错误代码,表示系统遇到了一个严重的错误。0x0000004e蓝屏错误是由于系统内核或驱动程序问题导致的。这种错误通常会导致计算机系统

Apple在周二推出了iOS17.4更新,为iPhone带来了一系列新功能和修复。这次更新包括了全新的表情符号,同时欧盟用户也能够下载其他应用商店。此外,更新还加强了对iPhone安全性的控制,引入了更多的「失窃设备保护」设置选项,为用户提供更多选择和保障。"iOS17.3首次引入了“失窃设备保护”功能,为用户的敏感资料增加了额外的安全保障。当用户不在家等熟悉地点时,该功能要求用户首次输入生物特征信息,并在一小时后再次输入信息才能访问和更改某些数据,如修改AppleID密码或关闭失窃设备保护功能

大家知道MicrosoftEdge在哪设置显示下载按钮吗?下文小编就带来了MicrosoftEdge设置显示下载按钮的方法,希望对大家能够有所帮助,一起跟着小编来学习一下吧!第一步:首先打开MicrosoftEdge浏览器,单击右上角【...】标识,如下图所示。第二步:然后在弹出菜单中,单击【设置】,如下图所示。第三步:接着单击界面左侧【外观】,如下图所示。第四步:最后单击【显示下载按钮】右侧按钮,由灰变蓝即可,如下图所示。上面就是小编为大家带来的MicrosoftEdge在哪设置显示下载按钮的

免费的dll修复工具有哪些导语:随着电脑使用的频繁,有时我们可能会遇到一些dll文件损坏或丢失的问题,这会导致某些软件无法正常运行,给用户带来了很大的困扰。幸运的是,市面上有一些免费的dll修复工具可以帮助我们解决这个问题。本文将介绍几款常用的免费dll修复工具,并对其功能和特点进行分析。一、DLL-FilesFixerDLL-FilesFixer是一

整理:Babywhale,ForesightNews目前获得了香港虚拟资产交易所牌照的交易所有两家:OSL和HaskKeyExchange。很多人在不同的Web3媒体平台上会看到HashKeyExchange上线各种代币的消息,而OSL的类似新闻却相对较少。但实际上,OSL正积极进取,只不过它选择了一条似乎与传统交易所不同的发展道路。潘志勇在获得了BGX的投资后,担任了OSL的新董事局主席和首席执行官。OSL此前在公众眼中相对低调,现在面临着转变的压力,潘志勇将如何引领公司走向前景,这将成为一个


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

SublimeText3 Chinese version
Chinese version, very easy to use

mPDF
mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

SublimeText3 Linux new version
SublimeText3 Linux latest version

MantisBT
Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

SAP NetWeaver Server Adapter for Eclipse
Integrate Eclipse with SAP NetWeaver application server.
