Home >Backend Development >PHP Tutorial >PHP simulates login and gets data through CURL
cURL is a powerful PHP library. Using PHP's cURL library, you can simply and effectively crawl web pages and collect content. Set cookies to complete simulated login web pages. Curl provides With its rich functions, developers can get more information about cURL from the PHP manual.
/*Simulate login and obtain data through CURL
Some websites require permission authentication, and you must log in to the website to be effective Catch the web page and collect the content.
This requires curl to set cookies to simulate logging in to the web page. PHP's curl is very efficient in capturing web page content.
It is relatively high and supports multi-threading, while file_get_contents() is slightly less efficient
. The code to simulate login is as follows:
*/
function login_post($url, $cookie, $post){
$ch = curl_init(); //Initialize the curl module
curl_setopt($ch, CURLOPT_URL, $url); //Log in and submit the address
curl_setopt($ ch, CURLOPT_HEADER, 0); //Whether to display header information
curl_setopt($ch, CURLOPT_RETURNTRANSFER, 0); //Whether to automatically display the returned information
curl_setopt ($ch, CURLOPT_COOKIEJAR, $cookie); //Set cookie information to be saved in the specified folder
curl_setopt($ch, CURLOPT_POST, 1); //Submit in POST mode
curl_setopt($ch, CURLOPT_POSTFIELDS, http_build_query($post));//Information to be executed
curl_exec($ch); //Execute CURL
curl_close($ch);
}
/*
The function login_post() declared in the above example needs to provide a url address, a cookie file to save
, and post data (user name, password and other information). Note that PHP comes with The http_build_query()
function can convert the array into a connected string. If the login is successful through this function, we need to obtain the
successful login page information. . The code to declare the function is as follows:
*/
function get_content($url, $cookie){
$ch = curl_init(); //Initialize the curl module
curl_setopt($ch, CURLOPT_URL, $url); //Log in to submit the address
curl_setopt($ch, CURLOPT_HEADER , 0); //Whether to display header information
curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1); //Whether to automatically display the returned information
curl_setopt($ch , CURLOPT_COOKIEJAR, $cookie);//Set cookie information and save it in the specified folder
$rs = curl_exec($ch); //Execute curl to transfer the page content
curl_close($ch);
return $rs; //return string
}
/*
Use curlopt_cookiefile in get_content() to read the cookie information saved by login
Finally, the page content is returned. Our purpose is to obtain the information after simulated login , that is,
is only useful information that can be obtained after a successful normal login. The following is an example code
*/
$post = array(
'_username' => 'g@ydma.cn',
'_password' => '123456',
'_submit' => ; 'Login'
);
$url = "http://www.ydma.cn/login/check"; //Login address, and original Website consistent
$cookie = dirname(__FILE__).'/cookie_ydma.txt'; //Set the path where cookies are saved
$url2 = "http:// www.ydma.cn/course/59"; //The address to obtain information after logging in
login_post($url, $cookie, $post); //Calling simulated login
$content = get_content($url2, $cookie); //After logging in, call the get_content() function to obtain the page information specified after logging in
@ unlik($cookie); //Delete the cookie file
file_put_contents('save.txt', $content); //Save the captured page content
Related recommendations:
How to get php parameters with # and other special symbols in the URL
The above is the detailed content of PHP simulates login and gets data through CURL. For more information, please follow other related articles on the PHP Chinese website!