Home  >  Q&A  >  body text

python2 爬虫builtwith模块

看了书教的 识别网站所有技术
教程识别的网站返回很清楚

但是我试了俩个都貌似没什么效果,而且有的还报了错误

伊谢尔伦伊谢尔伦2765 days ago1108

reply all(1)I'll reply

  • 伊谢尔伦

    伊谢尔伦2017-04-18 10:19:18

    This is used to check which technologies the website uses. If no technology is used, the returned information will be less. The author can check more websites!

    In [5]: import builtwith
    
    In [6]: builtwith.parse('http://httpbin.org')
    Out[6]: {u'web-servers': [u'Nginx']}
    
    In [7]: builtwith.parse('http://ctb988.com')
    Out[7]: {u'programming-languages': [u'Java']}
    
    In [8]: builtwith.parse('http://m8128.com')
    Out[8]: {u'programming-languages': [u'Java'], u'web-frameworks': [u'JavaServer Pages']}
    
    In [9]: builtwith.parse('http://www.douban.com')
    Out[9]: 
    {u'analytics': [u'Piwik'],
     u'javascript-frameworks': [u'jQuery'],
     u'tag-managers': [u'Google Tag Manager']}
    
    In [10]: builtwith.parse('http://example.webscraping.com')
    Out[10]: 
    {u'javascript-frameworks': [u'jQuery', u'Modernizr', u'jQuery UI'],
     u'programming-languages': [u'Python'],
     u'web-frameworks': [u'Web2py', u'Twitter Bootstrap'],
     u'web-servers': [u'Nginx']}
    

    reply
    0
  • Cancelreply