15 PHP interview questions about high concurrency (summary)-PHP Tutorial-php.cn

Home

Backend Development

PHP Tutorial

15 PHP interview questions about high concurrency (summary)

青灯夜游

Jul 20, 2020 pm 05:13 PM

phpHigh concurrency

15 PHP interview questions about high concurrency (summary)

Recommended related articles: "In-depth discussion of solutions and solutions for "high concurrency and large traffic" access"

1. What is rabbitmq

A message queue technology using the AMQP advanced message queue protocol. The biggest feature is that consumption does not require ensuring that the provider exists, achieving a high degree of decoupling between services

2. Why use rabbitmq

in distributed The system has a series of advanced functions such as asynchronous, peak clipping, load balancing, etc.;
has a persistence mechanism, and process messages and information in the queue can also be saved.
Realize decoupling between consumers and producers.
For high-concurrency scenarios, the use of message queues can turn synchronous access into serial access to reach a certain amount of current limit, which is beneficial to database operations.
You can use the message queue to achieve the effect of asynchronous order placement. During the queue, logical orders are placed in the background.

3. Scenarios using rabbitmq

Asynchronous communication between services
Sequential consumption
Timing task
Request peak cutting

4. How to ensure that messages are sent to RabbitMQ correctly? How to ensure that the message receiver consumes the message?

Sender confirmation mode

Set the channel to confirm mode (sender confirmation mode), then all Messages published on the channel are assigned a unique ID.

Once the message is delivered to the destination queue, or the message is written to disk (persistent message), the channel will send a confirmation to the producer (containing the unique message ID).

If an internal error occurs in RabbitMQ and the message is lost, a nack (not acknowledged, unconfirmed) message will be sent.

The sender confirmation mode is asynchronous, and the producer application can continue to send messages while waiting for confirmation. When the confirmation message reaches the producer application, the callback method of the producer application will be triggered to process the confirmation message.

Receiver confirmation mechanism

Receiver message confirmation mechanism

Consumer receives each message All messages must be confirmed (message reception and message confirmation are two different operations). Only when the consumer confirms the message can RabbitMQ safely delete the message from the queue.

The timeout mechanism is not used here. RabbitMQ only confirms whether the message needs to be resent through the interruption of the Consumer connection. In other words, as long as the connection is not interrupted, RabbitMQ gives the Consumer enough time to process the message. Ensure the ultimate consistency of data;

The following are some special cases

If the consumer receives the message and disconnects before confirming it When connecting or unsubscribing, RabbitMQ will consider that the message has not been distributed and re-distribute it to the next subscribed consumer. (There may be hidden dangers of repeated consumption of messages and duplication needs to be removed)
If the consumer receives the message but does not confirm the message and the connection is not disconnected, RabbitMQ considers the consumer busy. No further messages will be distributed to this consumer.

#5. How to avoid repeated delivery or repeated consumption of messages?

When producing messages, MQ internally generates an inner-msg-id for each message sent by the producer, which is used as the basis for deduplication (message delivery fails and is retransmitted) to avoid duplication. The message enters the queue;

When consuming messages, it is required that there must be a bizId in the message body (globally unique for the same business, such as payment ID, order ID, post ID, etc.) as the basis for deduplication to avoid The same message is consumed repeatedly.

6. On what basis is the message transmitted?

Because the creation and destruction of TCP connections is expensive, and the number of concurrency is limited by system resources, it will cause a performance bottleneck. RabbitMQ uses channels to transmit data. A channel is a virtual connection established within a real TCP connection, and there is no limit to the number of channels on each TCP connection.

7. How to distribute messages?

If the queue has at least one consumer subscription, the message will be sent to the consumer in a round-robin manner. Each message will only be distributed to one subscribed consumer (provided the consumer can process the message and confirm it normally). Multiple consumption functions can be realized through routing

#8. How to route messages?

Message provider->Routing->One to multiple queues When a message is published to the exchange, the message will have a routing key (routing key), which is set when the message is created. Queues can be bound to switches through queue routing keys.

After the message reaches the exchanger, RabbitMQ will match the routing key of the message with the routing key of the queue (there are different routing rules for different exchangers);

Commonly used exchangers are mainly Divided into three types

fanout: If the exchange receives a message, it will be broadcast to all bound queues
direct: If the routing keys match exactly, the message is delivered to the corresponding queue
topic: allows messages from different sources to reach the same queue. When using the topic exchanger, you can use wildcards

9. How to ensure that messages are not lost?

Message persistence, of course, the premise is that the queue must be persistent. The way RabbitMQ ensures that persistent messages can be recovered from server restarts is to write them to a persistent log file on disk. , when publishing a persistent message to a durable exchange, Rabbit will not send a response until the message is submitted to the log file.

Once a consumer consumes a persistent message from the persistent queue, RabbitMQ will mark the message in the persistence log as waiting for garbage collection. If RabbitMQ is restarted before a persistent message is consumed, Rabbit will automatically rebuild the exchange and queues (and bindings) and republish the message in the persistence log file to the appropriate queue.

10. What are the benefits of using RabbitMQ?

1. High degree of decoupling between services

2. High asynchronous communication performance

3. Traffic peak shaving

11, rabbitmq cluster

Mirror cluster mode

The queue you create, regardless of metadata or messages in the queue, will It exists on multiple instances, and every time you write a message to the queue, the message will be automatically sent to the queues of multiple instances for message synchronization.

The advantage is that if any of your machines goes down, it doesn't matter, other machines can be used. The disadvantages are, first, the performance overhead is too high. The messages are synchronized on all machines, resulting in heavy network bandwidth pressure and consumption! Second, if you play like this, there is no scalability. If a queue is heavily loaded and you add a machine, the new machine will also contain all the data of the queue, and there is no way to linearly expand your queue

12.mq Disadvantages

Reduced system availability

The more external dependencies introduced into the system, The easier it is to hang up, originally you are just the interface for system A to call the three systems BCD and system ABCD. There is no problem. If you add MQ, what will happen if MQ hangs up? If MQ hangs up and the entire system collapses, wouldn't you be doomed?

Increased system complexity

If you add MQ abruptly, how can you ensure that messages are not consumed repeatedly? How to deal with message loss? How to ensure the order of message delivery? Big head, big head, a lot of problems, endless pain

13. Consistency problem

A system returns directly after processing, success, people They all thought that your request was successful; but the problem is, if among the three systems of BCD and the two systems of BD successfully write the library, but the C system fails to write the library, what should be done? Your data is inconsistent.

So the message queue is actually a very complex architecture. There are many benefits to introducing it, but you also have to make various additional technical solutions and architectures to avoid the disadvantages it brings. It is best to do it later , you will find, oh my God, the system complexity has increased by an order of magnitude, maybe 10 times more complicated. But at critical moments, you still have to use it

14. Distributed transactions

Segmented submission. There will be an arbiter and then send messages to all nodes. Success will occur only after all nodes are acked. Otherwise, you have to wait for resending.

15. How to design for a sudden large traffic situation like live broadcast.

NGINX plus machine
cdn cache static page
redis Queue, let users come in slowly.
Add cache. Cache user data, such as user information.
Database uses master-slave
Elastic expansion
Current limiting circuit breaker

Related tutorial recommendations: "PHP Tutorial"

The above is the detailed content of 15 PHP interview questions about high concurrency (summary). For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:博客园. If there is any infringement, please contact admin@php.cn delete

How can you protect against Cross-Site Scripting (XSS) attacks related to sessions?Apr 23, 2025 am 12:16 AM

To protect the application from session-related XSS attacks, the following measures are required: 1. Set the HttpOnly and Secure flags to protect the session cookies. 2. Export codes for all user inputs. 3. Implement content security policy (CSP) to limit script sources. Through these policies, session-related XSS attacks can be effectively protected and user data can be ensured.

How can you optimize PHP session performance?Apr 23, 2025 am 12:13 AM

Methods to optimize PHP session performance include: 1. Delay session start, 2. Use database to store sessions, 3. Compress session data, 4. Manage session life cycle, and 5. Implement session sharing. These strategies can significantly improve the efficiency of applications in high concurrency environments.

What is the session.gc_maxlifetime configuration setting?Apr 23, 2025 am 12:10 AM

Thesession.gc_maxlifetimesettinginPHPdeterminesthelifespanofsessiondata,setinseconds.1)It'sconfiguredinphp.iniorviaini_set().2)Abalanceisneededtoavoidperformanceissuesandunexpectedlogouts.3)PHP'sgarbagecollectionisprobabilistic,influencedbygc_probabi

How do you configure the session name in PHP?Apr 23, 2025 am 12:08 AM

In PHP, you can use the session_name() function to configure the session name. The specific steps are as follows: 1. Use the session_name() function to set the session name, such as session_name("my_session"). 2. After setting the session name, call session_start() to start the session. Configuring session names can avoid session data conflicts between multiple applications and enhance security, but pay attention to the uniqueness, security, length and setting timing of session names.

How often should you regenerate session IDs?Apr 23, 2025 am 12:03 AM

The session ID should be regenerated regularly at login, before sensitive operations, and every 30 minutes. 1. Regenerate the session ID when logging in to prevent session fixed attacks. 2. Regenerate before sensitive operations to improve safety. 3. Regular regeneration reduces long-term utilization risks, but the user experience needs to be weighed.

How do you set the session cookie parameters in PHP?Apr 22, 2025 pm 05:33 PM

Setting session cookie parameters in PHP can be achieved through the session_set_cookie_params() function. 1) Use this function to set parameters, such as expiration time, path, domain name, security flag, etc.; 2) Call session_start() to make the parameters take effect; 3) Dynamically adjust parameters according to needs, such as user login status; 4) Pay attention to setting secure and httponly flags to improve security.

What is the main purpose of using sessions in PHP?Apr 22, 2025 pm 05:25 PM

The main purpose of using sessions in PHP is to maintain the status of the user between different pages. 1) The session is started through the session_start() function, creating a unique session ID and storing it in the user cookie. 2) Session data is saved on the server, allowing data to be passed between different requests, such as login status and shopping cart content.

How can you share sessions across subdomains?Apr 22, 2025 pm 05:21 PM

How to share a session between subdomains? Implemented by setting session cookies for common domain names. 1. Set the domain of the session cookie to .example.com on the server side. 2. Choose the appropriate session storage method, such as memory, database or distributed cache. 3. Pass the session ID through cookies, and the server retrieves and updates the session data based on the ID.

See all articles