Python操作ES的方式及与Mysql数据同步的方法-mysql教程-PHP中文网

首页

数据库

mysql教程

Python操作ES的方式及与Mysql数据同步的方法

王林

Jun 01, 2023 pm 09:49 PM

mysqlpythones

Python操作Elasticsearch的两种方式

# 官方提供的：Elasticsearch
# pip install elasticsearch
# GUI：pyhon能做图形化界面编程吗？
	-Tkinter
  -pyqt
# 使用（查询是重点）
# pip3 install elasticsearch
https://github.com/elastic/elasticsearch-py
from elasticsearch import Elasticsearch
obj = Elasticsearch([&#39;127.0.0.1:9200&#39;,&#39;192.168.1.1:9200&#39;,&#39;192.168.1.2:9200&#39;],)
# 创建索引（Index）
# body:用来干什么？mapping：{},setting:{}
# result = obj.indices.create(index=&#39;user&#39;,ignore=400)
# print(result)
# 删除索引
# result = obj.indices.delete(index=&#39;user&#39;, ignore=[400, 404])
# 插入和查询数据（文档的增删查改），是最重要
# 插入数据
# POST news/politics/1
# {&#39;userid&#39;: &#39;1&#39;, &#39;username&#39;: &#39;lqz&#39;,&#39;password&#39;:&#39;123&#39;}
# data = {&#39;userid&#39;: &#39;1&#39;, &#39;username&#39;: &#39;lqz&#39;,&#39;password&#39;:&#39;123&#39;}
# result = obj.create(index=&#39;news&#39;, doc_type=&#39;politics&#39;, id=1, body=data)
# print(result)
# 更新数据
&#39;&#39;&#39;
不用doc包裹会报错
ActionRequestValidationException[Validation Failed: 1: script or doc is missing
&#39;&#39;&#39;
# data ={&#39;doc&#39;:{&#39;userid&#39;: &#39;1&#39;, &#39;username&#39;: &#39;lqz&#39;,&#39;password&#39;:&#39;123ee&#39;,&#39;test&#39;:&#39;test&#39;}}
# result = obj.update(index=&#39;news&#39;, doc_type=&#39;politics&#39;, body=data, id=1)
# print(result)
# 删除数据
# result = obj.delete(index=&#39;news&#39;, doc_type=&#39;politics&#39;, id=1)
# 查询
# 查找所有文档
# query = {&#39;query&#39;: {&#39;match_all&#39;: {}}}
#  查找名字叫做jack的所有文档
# query = {&#39;query&#39;: {&#39;match&#39;: {&#39;desc&#39;: &#39;娇憨可爱&#39;}}}
# query = {&#39;query&#39;: {&#39;term&#39;: {&#39;from&#39;: &#39;sheng&#39;}}}
query = {&#39;query&#39;: {&#39;term&#39;: {&#39;name&#39;: &#39;娘子&#39;}}}
# term和match的区别
# term是短语查询，不会对term的东西进行分词
# match 会多match的东西进行分词，再去查询
# 查找年龄大于11的所有文档
# allDoc = obj.search(index=&#39;lqz&#39;, doc_type=&#39;doc&#39;, body=query)
allDoc = obj.search(index=&#39;lqz&#39;, doc_type=&#39;doc&#39;, body=query)
print(allDoc)
import json
print(json.dumps(allDoc))
# print(allDoc[&#39;hits&#39;][&#39;hits&#39;][0][&#39;_source&#39;])
# 如何集成到django项目中：创建索引，提前创建好就行了
# 插入数据，查询数据，修改数据
# query = {&#39;query&#39;: {&#39;term&#39;: {&#39;name&#39;: &#39;娘子&#39;}}}
# allDoc = obj.search(index=&#39;lqz&#39;, doc_type=&#39;doc&#39;, body=query)
# json格式直接返回
# saas ：软件即服务，不是用人家服务，而是写服务给别人用----》正常的开发
# 舆情监测系统：（爬虫）
# 只监控微博---》宜家：微博，百度贴吧，上市公司
# 公安：负面的，---》追踪到哪个用户发的---》找上门了
# qq群，微信群----》舆情监控（第三方做不了，腾讯出的舆情监控，第三方机构跟腾讯合作，腾讯提供接口，第三方公司做）
# 平台开发出来，别人买服务---》买一年的微博关键字监控

ERP：公司财务，供应链

某个大公司，金蝶，用友，开发了软件----》你们公司自己买服务器---》软件跑在你服务器上
saas模式：公司买服务，10年服务----》账号密码---》登进去就能操作---》出了问题找用友---》服务器在别人那---》政务云，各种云---所有东西上云

---政府花钱买的东西---》用友敢泄露吗？
---未来的云计算---》只能能上网---》计算机运算能力有限---》上云买服务---》计算1+。。。+100 ---》买了计算服务，直接拿到结果

# 第二种使用方式
# https://github.com/elastic/elasticsearch-dsl-py
# pip3 install elasticsearch-dsl
from datetime import datetime
from elasticsearch_dsl import Document, Date, Nested, Boolean,analyzer, InnerDoc, Completion, Keyword, Text,Integer
from elasticsearch_dsl.connections import connections
connections.create_connection(hosts=["localhost"])
class Article(Document):
    title = Text(analyzer=&#39;ik_max_word&#39;, search_analyzer="ik_max_word", fields={&#39;title&#39;: Keyword()})
    author = Text()
    class Index:
        name = &#39;myindex&#39;  # 索引名
    def save(self, ** kwargs):
        return super(Article, self).save(** kwargs)
if __name__ == &#39;__main__&#39;:
    # Article.init()  # 创建映射
    # 保存数据
    # article = Article()
    # article.title = "测试数据"
    # article.author = "egon"
    # article.save()  # 数据就保存了
    #查询数据
    # s=Article.search()
    # s = s.filter(&#39;match&#39;, title="测试")
    # results = s.execute()
    # # 类比queryset对象，列表中一个个对象
    # # es中叫Response，当成一个列表，列表中放一个个对象
    # print(results)
    #删除数据
    # s = Article.search()
    # s = s.filter(&#39;match&#39;, title="测试").delete()
    #修改数据
    s = Article().search()
    s = s.filter(&#39;match&#39;, title="测试")
    results = s.execute()
    print(results[0])
    results[0].title="xxx"
    results[0].save()
    # 其他操作，参见文档

mysql和Elasticsearch同步数据

# 只要article表插入一条数据，就自动同步到es中
# 第一种方案：
	-每当aritcle表插入一条数据（视图类中，Article.objects.create()，update）
  -往es中插入一条
  -缺陷：代码耦合度高，改好多地方
# 第二种方案：
	-重写create方法，重写update方法
  -缺陷：同步操作---》es中插入必须返回结果才能继续往下走
# 第三种方案：
	-用celery，做异步
  -缺陷：引入celery，还得有消息队列。。。
# 第四种方案：（用的最多）
	-重写create方法，重写update方法，用信号存入，异步操作
  -缺陷：有代码侵入
# 第五种方案：（项目不写代码，自动同步），第三方开源的插件
	-https://github.com/siddontang/go-mysql-elasticsearch----go写
  -你可以用python重写一个，放到git上给别人用（读了mysql的日志）
  -跟平台无关，跟语言无关
  -如何使用：
  	-源码下载---》交叉编译---》可执行文件--》运行起来--》配置文件配好，就完事了
    # 配置文件
    [[source]]
    schema = "数据库名"
    tables = ["article"]
    [[rule]]
    schema = "数据库名"
    table = "表明"
    index = "索引名"
    type = "类型名"
  # 缺陷：
  	-es跟mysql同步时，不希望把表所有字段都同步，mysql的多个表对着es的一个类型
  # 话术升级：
  	-一开始同步
    -用了开源插件（读取mysql日志，连接上es，进行同步）
    -用信号自己写的
    -再高端：仿着他的逻辑，用python自己写的，----》（把这个东西开源出来）

haystack的使用

django上的一个第三方模块 ---》你使用过的django第三方模块有哪些？
可以在django上实现全文检索
相当于orm--》对接es，solr，whoosh
https://www.yisu.com/article/218631.htm
不支持es，6以上版本
haystack+Elasticsearch实现全文检索
es的原生操作：ELasticsearch Elasticsearch-dsl

Redis补充

#1  只有5种数据结构：
	-多种数据结构：字符串，hash，列表，集合，有序集合
#2  单线程，速度为什么这么快？
  -本质还是因为是内存数据库
  -epoll模型（io多路复用）
  -单线程，没有线程，进程间的通信
#3 linux上 安装redis#下载
  https://redis.io/download/
  #解压
  tar -xzf redis-5.0.7.tar.gz
  #建立软连接
  ln -s redis-5.0.7 redis
  cd redis
  make&&make install
  # bin路径下几个命令：redis-cli,redis-server,redis-sentinel
  # 在任意位置能够执行redis-server 如何做？配置环境变量
#4  启动redis的三种方式
  	-方式一：(一般不用，没有配置文件)
    	-redis-server
    -方式二：（用的也很少）
    	redis-serve --port 6380
    -方式三：（都用这种，配置文件）
    	daemonize yes #是否以守护进程启动
      pidfile /var/run/redis.pid   #进程号的位置，删除
      port 6379    #端口号
      dir "/opt/soft/redis/data"  #工作目录
      logfile 6379.log #日志位置  
      # 启动：redis-server redis.conf1
#5 客户端连接
  redis-cli -h 127.0.0.1 -p 6379
#6 使用场景
  -看md文档

以上是Python操作ES的方式及与Mysql数据同步的方法的详细内容。更多信息请关注PHP中文网其他相关文章！

声明

本文转载于：亿速云。如有侵权，请联系admin@php.cn删除

在MySQL中使用视图的局限性是什么？May 14, 2025 am 12:10 AM

mysqlviewshavelimitations：1）他们不使用Supportallsqloperations，限制DatamanipulationThroughViewSwithJoinSorsubqueries.2）他们canimpactperformance，尤其是withcomplexcomplexclexeriesorlargedatasets.3）

确保您的MySQL数据库：添加用户并授予特权May 14, 2025 am 12:09 AM

porthusermanagementInmysqliscialforenhancingsEcurityAndsingsmenting效率databaseoperation.1）usecReateusertoAddusers，指定connectionsourcewith@'localhost'or@'％'。

哪些因素会影响我可以在MySQL中使用的触发器数量？May 14, 2025 am 12:08 AM

mysqldoes notimposeahardlimitontriggers，butacticalfactorsdeterminetheireffactective：1）serverConfiguration impactactStriggerGermanagement; 2）复杂的TriggerSincreaseSySystemsystem load; 3）largertablesslowtriggerperfermance; 4）highConconcConcrencerCancancancancanceTigrignecentign; 5）; 5）

mysql：存储斑点安全吗？May 14, 2025 am 12:07 AM

Yes,it'ssafetostoreBLOBdatainMySQL,butconsiderthesefactors:1)StorageSpace:BLOBscanconsumesignificantspace,potentiallyincreasingcostsandslowingperformance.2)Performance:LargerrowsizesduetoBLOBsmayslowdownqueries.3)BackupandRecovery:Theseprocessescanbe

mySQL：通过PHP Web界面添加用户May 14, 2025 am 12:04 AM

通过PHP网页界面添加MySQL用户可以使用MySQLi扩展。步骤如下：1.连接MySQL数据库，使用MySQLi扩展。2.创建用户，使用CREATEUSER语句，并使用PASSWORD()函数加密密码。3.防止SQL注入，使用mysqli_real_escape_string()函数处理用户输入。4.为新用户分配权限，使用GRANT语句。

mysql：blob和其他无-SQL存储，有什么区别？May 13, 2025 am 12:14 AM

mysql'sblobissuitableForStoringBinaryDataWithInareLationalDatabase，而alenosqloptionslikemongodb，redis和calablesolutionsoluntionsoluntionsoluntionsolundortionsolunsolunsstructureddata.blobobobsimplobissimplobisslowderperformandperformanceperformancewithlararengelitiate;

mySQL添加用户：语法，选项和安全性最佳实践May 13, 2025 am 12:12 AM

toaddauserinmysql，使用：createUser'username'@'host'Indessify'password'; there'showtodoitsecurely：1）choosethehostcarecarefullytocon trolaccess.2）setResourcelimitswithoptionslikemax_queries_per_hour.3）usestrong，iniquepasswords.4）Enforcessl/tlsconnectionswith

MySQL：如何避免字符串数据类型常见错误？May 13, 2025 am 12:09 AM

toAvoidCommonMistakeswithStringDatatatPesInMysQl，CloseStringTypenuances，chosethirtightType，andManageEngencodingAndCollationsEttingsefectery.1）usecharforfixed lengengters lengengtings，varchar forbariaible lengength，varchariable length，andtext/blobforlabforlargerdata.2 seterters seterters seterters seterters

See all articles