Home  >  Article  >  Backend Development  >  In-depth study of keyword matching projects-the introduction of filters

In-depth study of keyword matching projects-the introduction of filters

WBOY
WBOYOriginal
2016-08-08 09:32:03969browse

In-depth study of keyword matching project (1) - Introduction of filters

When you start reading this article, please first understand the step-by-step instructions on how to do a keyword matching project (search engine)----Day 1~ The step-by-step guide on how to do a keyword matching project (search engine)----Twenty Two days

Foreword

Continuing from the previous chapter, after Xiao Shuai Shuai presented the completed code and test results, Boss Yu was asked to go online without saying anything. The reason is very simple: everyone wants to contribute, and benefits are indispensable. Yes, we will upgrade the things we encounter later. During that meal, Xiao Shuai Shuai was very happy and satisfied. Of course, Xiao Ding Ding, Boss Yu, and Director Wang were also with him.

Cause of the problem

With the operation of the project, the application of collecting keywords is becoming more and more widespread. Suddenly one day, Xiao Ding Ding came to find Xiao Shuai Shuai. Of course, Xiao Shuai Shuai was very happy, but after hearing the conversation that followed, he was greatly disappointed.

"Xiao Shuai Shuai, look, why do these words have &,?,%, there are a lot of messy words, these words are not suitable for use, please help me how to remove them." Xiao Ding Ding's eyes are also enough It was so powerful that it made Xiao Shuai Shuai tremble.

“Sister, okay, I’ll go take a look first. This matter will be taken care of by me. I’ll let you know when it’s resolved.”

了 Xiaoshuai went back to check and found that there were a lot of such weird words in the database. Xiao Shuai was panicked and had to find Yu Boss again.

“There are many words like &,?,% in the dictionary keywords now, and according to the matching algorithm, these words are all available, what should we do?”.

              Boss Yu listened to Xiao Shuaishuai’s explanation and came up with a plan. We control these keywords. Since these keywords are not used in the business anyway, we don’t want them to be included in the database. This can reduce the number of keywords in the database. quantity, and can ensure high availability of keywords.

                                                                                                      Xiao Shuai Shuai is different from the boss, and he is beyond his reach.

Solution

                                                                                                              Added a function to control the entry of etymology into the database, the admission that conforms to the rules, and the discard that does not conform to the rules - the function of the filter.

Design plan

The upgraded version is decisive and colorful.

Class diagram:

Code plan

1. The codes of InputSource, FileSource and TaobaoHotsSource remain unchanged.

2. New filter code is added to the Source code, as follows:

<span> 1</span> <span>class</span><span> Source {
</span><span> 2</span> 
<span> 3</span>     <span>public</span> <span>$keywords</span><span>;
</span><span> 4</span> 
<span> 5</span>     <span>public</span> <span>function</span><span> run() {
</span><span> 6</span> 
<span> 7</span>         <span>foreach</span> (<span>$this</span>->keywords <span>as</span> <span>$word</span><span>) {
</span><span> 8</span>             <span>#</span><span> code...</span>
<span> 9</span>             <span>if</span>(Filter::is(<span>$word</span>)) <span>continue</span><span>;
</span><span>10</span>             
<span>11</span>             <span>$keyword</span> = <span>new</span><span> Keyword();
</span><span>12</span>             <span>$keyword</span>->word = <span>$word</span><span>;
</span><span>13</span>             <span>$keyword</span>-><span>save();
</span><span>14</span> <span>        }
</span><span>15</span> <span>    }
</span><span>16</span> 
<span>17</span> }

3. Added Filter code:

<span> 1</span> <?<span>php
</span><span> 2</span> 
<span> 3</span> <span>#</span><span>@Filename: filter/Filter.php</span>
<span> 4</span> <span>#</span><span>@Author: oShine</span>
<span> 5</span> 
<span> 6</span> <span>class</span><span> Filter {
</span><span> 7</span> 
<span> 8</span>     <span>private</span> <span>static</span> <span>$filterWords</span> = <span>array</span>("*","?","%",".","&"<span>);
</span><span> 9</span> 
<span>10</span>     <span>public</span> <span>static</span> <span>function</span> is(<span>$word</span><span>){
</span><span>11</span>         <span>$pattern</span> = "/(".<span>implode</span>("|", self::<span>$filterWords</span>).")/"<span>;
</span><span>12</span>         <span>return</span> <span>preg_match</span>(pattern, <span>$word</span><span>);
</span><span>13</span> <span>    }
</span><span>14</span> 
<span>15</span> }

Summary

Good code structure, adding and removing functions is very simple, and work efficiency is improved. Xiao Shuai Shuai fell in love with this job deeply.

The above introduces the in-depth study of the keyword matching project - the introduction of filters, including aspects of it. I hope it will be helpful to friends who are interested in PHP tutorials.

Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn