Node.js implements WeChat applet to capture web content-JS Tutorial-php.cn

Home

Web Front-end

JS Tutorial

Node.js implements WeChat applet to capture web content

不言

Oct 20, 2018 pm 05:17 PM

cheerionode.jsWeChat appletweb scraping

The content of this article is about node.js implementing WeChat applet to capture web content. It has certain reference value. Friends in need can refer to it. I hope it will be helpful to you.

Recently I am studying the cloud development function of WeChat applet. The biggest advantage of cloud development is that there is no need to build a server on the front end. You can use cloud capabilities to write a WeChat mini program from scratch, which avoids the cost of buying a server. For individuals who want to try to practice WeChat mini programs from the front end to the backend Development is still a good choice. It is possible to launch a WeChat mini program in one day.

Advantages of cloud development

Cloud development provides developers with complete cloud support, weakening the back-end and operation and maintenance concepts. There is no need to build a server and use the API provided by the platform. Core business development can achieve rapid launch and iteration. At the same time, this capability is compatible with the cloud services already used by developers and is not mutually exclusive.

Cloud development currently provides three basic capabilities to support:

Cloud function: code running in the cloud, WeChat private protocol is naturally authenticated, and developers only need to write their own Business logic code
Database: A JSON database that can be operated on the front end of the mini program and read and written in cloud functions
Storage : Upload/download cloud files directly on the front end of the mini program, and manage them visually in the cloud development console

Okay, I have introduced so much knowledge about cloud development, students with perceptual knowledge can study it Research. Official document address: https://developers.weixin.qq....

Web content capture

The applet is about answering questions, so the source of the questions is a problem. Searching on the Internet, pasting one question one by one is one way to do it, but with such repetitive work, I would probably give up after about 10 posts. So I thought of web scraping. I just happened to pick up the node I learned before.

Must-have tools:

Cheerio. A package similar to server-side JQuery. It is mainly used to analyze and filter the crawled content.
fs module of node. This is the module that comes with node and is used to read and write files. This is used to write the parsed data into a json file.
Axios (optional). Used to crawl the HTML pages of the website. Because the data I want is rendered after clicking a button on the web page, it cannot be captured by directly accessing this URL. I have no choice but to copy the desired content, save it as a string, and parse the string.

Next, you can use npm init to initialize a node project, and press Enter to generate a package.json file.
Then npm install --save axios cheerio installs the cheerio and axios packages.

The key is to use cheerio to implement a function similar to jquery. Just cheerio.load(quesitons) the captured content, and then you can follow the jquery operation to get the DOM and assemble the data you want.

Finally use fs.writeFile to save the data to the json file, and you're done.

The specific code is as follows

let axios = require("axios");

let cheerio = require("cheerio");

let fs = require("fs");

// 我的html结构大致如下，有很多条数据
const questions = `

举头望明月，__________。

回首白云低

低头思故乡

当春乃发生

红掌拨清波

__________，却话巴山夜雨时。

何当共剪西窗烛

在天愿做比翼鸟

世味年来薄似纱

两岸青山相对出

.......... `; const $ = cheerio.load(quesitons); var arr = []; for (var i = 0; i { if (err) throw err; console.log("json文件已成功保存！"); });

The file format after saving to json is as follows, so that it can be uploaded to the cloud server through the json file.

Node.js implements WeChat applet to capture web content

Notes

For the database developed by WeChat applet cloud, the data format for uploading json files is required Note that I was always prompted with a format error before, but later I discovered that the JSON data is not an array, but similar to JSON Lines, that is, each record object is separated by n instead of commas. Therefore, it is necessary to do a small process on the json file written by node before it can be uploaded successfully.

The above is the detailed content of Node.js implements WeChat applet to capture web content. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:segmentfault思否. If there is any infringement, please contact admin@php.cn delete

Python vs. JavaScript: A Comparative Analysis for DevelopersMay 09, 2025 am 12:22 AM

The main difference between Python and JavaScript is the type system and application scenarios. 1. Python uses dynamic types, suitable for scientific computing and data analysis. 2. JavaScript adopts weak types and is widely used in front-end and full-stack development. The two have their own advantages in asynchronous programming and performance optimization, and should be decided according to project requirements when choosing.

Python vs. JavaScript: Choosing the Right Tool for the JobMay 08, 2025 am 12:10 AM

Whether to choose Python or JavaScript depends on the project type: 1) Choose Python for data science and automation tasks; 2) Choose JavaScript for front-end and full-stack development. Python is favored for its powerful library in data processing and automation, while JavaScript is indispensable for its advantages in web interaction and full-stack development.

Python and JavaScript: Understanding the Strengths of EachMay 06, 2025 am 12:15 AM

Python and JavaScript each have their own advantages, and the choice depends on project needs and personal preferences. 1. Python is easy to learn, with concise syntax, suitable for data science and back-end development, but has a slow execution speed. 2. JavaScript is everywhere in front-end development and has strong asynchronous programming capabilities. Node.js makes it suitable for full-stack development, but the syntax may be complex and error-prone.

JavaScript's Core: Is It Built on C or C ?May 05, 2025 am 12:07 AM

JavaScriptisnotbuiltonCorC ;it'saninterpretedlanguagethatrunsonenginesoftenwritteninC .1)JavaScriptwasdesignedasalightweight,interpretedlanguageforwebbrowsers.2)EnginesevolvedfromsimpleinterpreterstoJITcompilers,typicallyinC ,improvingperformance.

JavaScript Applications: From Front-End to Back-EndMay 04, 2025 am 12:12 AM

JavaScript can be used for front-end and back-end development. The front-end enhances the user experience through DOM operations, and the back-end handles server tasks through Node.js. 1. Front-end example: Change the content of the web page text. 2. Backend example: Create a Node.js server.

Python vs. JavaScript: Which Language Should You Learn?May 03, 2025 am 12:10 AM

Choosing Python or JavaScript should be based on career development, learning curve and ecosystem: 1) Career development: Python is suitable for data science and back-end development, while JavaScript is suitable for front-end and full-stack development. 2) Learning curve: Python syntax is concise and suitable for beginners; JavaScript syntax is flexible. 3) Ecosystem: Python has rich scientific computing libraries, and JavaScript has a powerful front-end framework.

JavaScript Frameworks: Powering Modern Web DevelopmentMay 02, 2025 am 12:04 AM

The power of the JavaScript framework lies in simplifying development, improving user experience and application performance. When choosing a framework, consider: 1. Project size and complexity, 2. Team experience, 3. Ecosystem and community support.

The Relationship Between JavaScript, C , and BrowsersMay 01, 2025 am 12:06 AM

Introduction I know you may find it strange, what exactly does JavaScript, C and browser have to do? They seem to be unrelated, but in fact, they play a very important role in modern web development. Today we will discuss the close connection between these three. Through this article, you will learn how JavaScript runs in the browser, the role of C in the browser engine, and how they work together to drive rendering and interaction of web pages. We all know the relationship between JavaScript and browser. JavaScript is the core language of front-end development. It runs directly in the browser, making web pages vivid and interesting. Have you ever wondered why JavaScr

See all articles