Home  >  Article  >  Backend Development  >  mysql query efficiency problem

mysql query efficiency problem

WBOY
WBOYOriginal
2016-08-18 09:16:151237browse

The problem is this:
The data of a field cate in the database is stored separated by commas, such as: 1,2,3,4,5,
The parameter passed in during query is an array, such as: array(1,2 );
My stupid method is to loop this array and then use like to splice it together, such as:

<code>(cate like '%1,%' or cate like '%2,%')</code>

But this efficiency is too low when the amount of data is large. Is there a more effective way? Thanks. .

Reply content:

The problem is this:
The data of a field cate in the database is stored separated by commas, such as: 1,2,3,4,5,
The parameter passed in during query is an array, such as: array(1,2 );
My stupid method is to loop this array and then use like to splice it together, such as:

<code>(cate like '%1,%' or cate like '%2,%')</code>

But this efficiency is too low when the amount of data is large. Is there a more effective way? Thanks. .

<code>程序循环你传入的数组,循环体内部调用mysql的find_in_set函数,当字串不是太长时,find_in_set的效率比like快很多。</code>

1. Modify the field type to SET. Using FINd_IN_SET is much faster than using like, but it is also a string operation. When used on fields, the entire table must be scanned. The SET type has some limitations. It is recommended that SET is suitable for state collections with small value ranges, fixed values, and overall query. For example, record which provinces in China a person has visited: you can directly compare whether two people have visited the same provinces, or directly obtain the difference. You can efficiently query which people have only visited a certain province or certain provinces, but check which people have visited a certain province. Provinces or certain provinces still scan the entire table.

<code>FIND_IN_SET(str,strlist)
如果字符串str是在的strlist组成的N子串的字符串列表,返回值的范围为1到N。
</code>
SQL> SELECT FIND_IN_SET('b','a,b,c,d');
SELECT FIND_IN_SET('b','a,b,c,d')
2

2.
用中间(映射)表,可以借助索引提高查询效率。

不管怎么写,怎么改,用函数也好,这样的数据结构稍微量大效率都很低,索引几乎用不上了
只能改下你们的储存方式,不要把1,2,3,4,5,这种放到一个字段,分开来,或者用nosql

可以考虑下MySQL的正则查询

重新设计数据库吧!!!无论怎么说 字符串匹配的花销都不小。把cate字段抽出来形成一个新的关系表

如果查询的频繁建议缓存数据后用程序来查缓存或者重新设计表,MYSQL里面用正则效率很低的。

你应该去问设计数据表的那个人。

建议修改数据存储格式吧....不是说这么存问题有多大

而是这么存了之后却又需要快捷高效的查...那势必就是自己给自己添堵了 能调整尽快调整

where cate like '%1,%' or cate like '%2,%' 这个条件本身有两个问题都导致无法使用索引:
1、or条件; 2、like '%...'格式。
针对这个查询做优化的:
1、or改为union all
2、在程序里做模糊查询, 将所有数据取出来(或者按10%取,取10次),循环做strpos的模糊匹配。 因为php的array操作时非常快速的,所以这样操作,会比直接入库查询快,但是会耗内存。
另外这样的话,分页这些,也就只有在程序里做了。

其实这个数据库设计是有问题的, 将'1,2,3..'这种格式的字符串设计成字段,mysql效率很低,建议根据位运算去设计cate字段,数据多的话,cate可以为bigint, 这样查询会非常快,也省去了like、 or这些mysql的瓶颈。

不建议使用mysql自带的函数,没太明白你说的1,2 1,2,3,4,5 是具体的什么意思,我猜可能是每条记录所属的类型是哪些,
例如:

<code>老九门 是哪些?包括 语文、数学、英语、化学、生物、物理、地理、历史、政治
这九门都有一个类型是 “老九门”,那么这九门里又有理科和文科,
老九门、理科、文科分别是1、2、3
那么 语文、数学、英语的类型字段是1,2,3
    化学、物理、生物的类型字段是1,2
    地理、历史、政治的类型字段是1,3</code>

(建议不要把业务逻辑放到mysql中处理,mysql字段类型越简单越好,下面会用到)

<code>语文、数学、英语、化学、生物、物理、地理、历史、政治已经有了各自的类型所属
是否考虑将1,2,3  1,2  1,3 定义成数组格式呢
这里应用php的写法
            $arr = array(
                        1=>array(1,2,3),
                        2=>array(1,2),
                        3=>array(1,3)
                    );
 在往表中插入数据时,你的cate字段就存1 2 3就可以了,到时候直接查询1或者2或者3就可以了,业务逻辑去脚本程序
 中处理,不要在mysql中做太多的处理,越简单越好
 如果还是不太懂的话 也可以
             $arr = array(
                        '1,2,3'=>1,
                        '1,2'=>2,
                        '1,3'=>3
                    );
            在程序中处理完后再去查相应的mysql数据就可以了</code>
Statement:
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn