This figure illustrates the application of learning techniques in a drug recommendation system.-AI-php.cn

Home

Technology peripherals

This figure illustrates the application of learning techniques in a drug recommendation system.

PHPz

May 08, 2023 pm 08:07 PM

AISmart medical

This figure illustrates the application of learning techniques in a drug recommendation system.

Introduction: The topic shared this time is the application of graph representation learning technology in drug recommendation systems.

mainly includes the following four parts:

Research background and challenges
Discriminative drug package recommendation
Generative drugs Package recommendation
Summary and outlook

1. Research background and challenges

1. Research background

#The overall shortage of medical resources and uneven distribution bring heavy pressure

Drug recommendation is a sub-problem of smart medical care. Let’s start with the general background of smart medical care. There is an urgency for smart medical care in our country. As the population grows and the aging population intensifies, , people’s demand for high-quality medical services continues to rise. Two sets of data in the figure: first, the number of visits to medical institutions across the country was 6.05 billion, a year-on-year increase of 22.4%; second, statistics on the medical and health conditions of various countries in The Lancet show that only 57.4% of Chinese doctors have a bachelor's degree or above, including doctors, The number of practitioners per 10,000 people in 16 categories of health occupations, including nurses and community health workers, in China is only one-third of that in the United States. The number of people diagnosed and treated in our country continues to rise, but medical resources and medical standards are still insufficient compared with developed countries. In addition, there is also the problem of uneven distribution of medical resources. The medical level of primary medical institutions is relatively limited, while the supply of top-level institutions exceeds demand. Therefore, how to make full use of the diagnosis and treatment experience of high-level medical institutions to help improve the medical level of primary medical institutions is an important issue that needs to be solved urgently.

Smart medical care, artificial intelligence technology brings the dawn

With the acceleration of the digitalization of medical institutions in recent years, a large number of medical institutions in my country, especially high-level medical institutions such as tertiary hospitals, have accumulated very rich electronic medical record data. If big data artificial intelligence technology can be used to fully mine this information and extract relevant knowledge, it may help us understand some of the diagnosis and treatment methods and ideas of medical experts in these high-level institutions, and then support smart follow-up visits, medical image analysis, chronic disease follow-up, etc. A series of downstream smart medical applications are of significant significance.

2. Research Challenges

Nowadays, more and more medical AI technologies are becoming more widely used. It also promotes the fairness and universalization of medical services. Some AI technologies, such as medical image analysis, have achieved some impressive results, but they are rarely used in drug recommendation systems. The reason is that drug recommendation systems are very different from traditional recommendation systems, and there are also technical problems with many difficulties.

Pack recommendation system

This figure illustrates the application of learning techniques in a drug recommendation system.

##The first challenge is that the application scenario of traditional recommendation systems based on collaborative filtering and other methods is mainly to recommend one item to one user at a time. Their input is the representation of a single item and a single user, and the output is one of the two. Score the degree of matching between them. However, in drug recommendation, doctors often need to prescribe a group of drugs to patients at one time. The drug recommendation system is actually a package recommendation system, called a package recommendation system, which recommends a set of drugs to a user at the same time. How to combine drug recommendation with the package recommendation system is the first big challenge we face.

Drug-drug interactions

This figure illustrates the application of learning techniques in a drug recommendation system.

The second challenge of the drug recommendation system is the diverse interactions between drugs. Some drugs have synergistic effects that promote each other's effects, while some drugs have antagonisms that offset each other's effects. Even the combined use of some drugs can lead to toxicity or other side effects. The patient in the picture is suffering from some kind of kidney disease. The left part shows the medicines prescribed by the doctor for the patient. Some of the medicines have synergistic effects and can promote the effectiveness of the medicine. The right part shows the symptomatic high-frequency drugs analyzed statistically. It can be seen that these drugs may not be selected due to some antagonistic effects. The following drugs may be toxic to some existing drugs, so they are not used by this patient.

#In addition, drug interaction effects are individualized. We found in the statistics that a large number of drugs with antagonistic or even toxic effects are used simultaneously. According to analysis, doctors actually prescribe drugs based on the patient's condition and considering the interaction effects. For example, some patients with healthy kidneys can often tolerate a certain amount of drug nephrotoxicity. Therefore, we need to conduct personalized modeling and analysis of the interactions between drugs.

3. Graph representation learning technology has become a new possibility

This figure illustrates the application of learning techniques in a drug recommendation system.

In summary, combined with the above challenges, graph representation learning technology is very suitable for solving problems existing in drug recommendation systems. With the rapid development of graph neural networks, people realize that graph neural network technology can very effectively model the combination effect between nodes and the relationship between nodes. This inspires us that graph representation learning technology may become a useful tool for building drug recommendation systems. A sharp tool.

#For example, in the figure, we can construct a drug package into a graph based on the interactions within it, and model it through the existing graph neural network. Based on the above ideas, we used graph deep learning technology to do two works on the drug recommendation system, which were published in WWW and TOIS journals respectively. The following is a detailed introduction.

2. Discriminative medicine package recommendation

First introduce the medicine package recommendation we published on WWW2021 paper. This article adopts the discriminative model definition method widely used in package recommendation systems for modeling, and also uses graph representation learning technology as the core technology part.

1. Data description

Electronic case data

This figure illustrates the application of learning techniques in a drug recommendation system.

#First introduce the data description used in the work.

The electronic medical records we used in our research work are from a real electronic medical record database of a large tertiary hospital. Each electronic medical record includes the following information: Type information: First, the patient's basic information, including the patient's age, gender, medical insurance, etc.; second, the patient's laboratory information, including abnormalities in laboratory results that the doctor is concerned about, and the type of abnormality: high, low, positive or not etc.; the third is the condition description written by the doctor for the patient: including information such as why the patient was admitted to the hospital, as well as preliminary physical examination; and finally, a set of medicines prescribed by the doctor for the patient.

#This electronic medical record data is heterogeneous data, including structured information such as age, gender, laboratory tests, and unstructured text information such as disease description.

Drug data

This figure illustrates the application of learning techniques in a drug recommendation system.

In order to study the interactions between drugs, we collected some drug attributes and interaction data from two large online open source drug knowledge bases, DrugBank and Pharmaceutical Network. Drug interactions are natural language descriptions based on some templates. As shown in the picture above, the description column talks about how a certain drug may increase metabolism or weaken metabolism, etc. The middle word is the template, and the front and back are the filled drug names. Therefore, as long as the model classification is clear, all drug interactions in the database can be marked.

Therefore, under the guidance of professional doctors, we consider drug interactions into no interaction, synergy and antagonism. In the third category, the templates were annotated and the classification of drug interactions was obtained.

2. Data preprocessing and problem definition

In terms of data preprocessing, for electronic medical record data, we divide it into two Part: The patient's basic information and laboratory information, we process it into a one-hot vector; for the disease description text part, we convert it into fixed-length text through some Padding and Cut off. For drug interaction data: we convert it into a drug interaction matrix.

At the same time, the problem is defined as follows: given a group of patient descriptions and corresponding Ground-truth drug packages, we will train a personalized scoring function, which A given patient and sample package can be input and a match score output. Clearly, this is how a discriminant model is defined.

3. Model Overview

This figure illustrates the application of learning techniques in a drug recommendation system. The thesis title proposed in this article is DPR: Drug Package Recommendation via Interaction -aware Graph Induction. The model consists of three parts:

Pre-training part, we obtain the initial representation of patients and drugs based on the NCF framework.

#Drug package construction part, we propose a method to construct a drug package into a drug graph based on the type of drug interaction relationship.

The last part is the graph-based drug package recommendation framework, in which two different variants are designed to understand how to build it from two different perspectives. interactions between model drugs.

Pre-training

This figure illustrates the application of learning techniques in a drug recommendation system.

##First of all, the pre-training part is conducted according to the traditional one-to-one recommendation method. Given a case, the drugs that the doctor used for the patient are positive cases, and the drugs that have not been used are negative cases. Pre-trained by BPRLoss, used drugs score higher than those that have not been used.

The pre-training part is mainly to capture basic drug effect information, providing a basis for capturing more complex interactions later. For the One-hot part, we use MLP to extract features; for the text part, we use LSTM to extract text features.

This figure illustrates the application of learning techniques in a drug recommendation system. ##Compared with traditional recommendation, the core issue of drug recommendation is how to consider the interaction between drugs and obtain the representation of the drug package. Based on this, this paper proposes a pharmaceutical package modeling method based on a graphical model.

First, the marked drug interaction relationships will be converted into a drug interaction matrix, in which different values represent different interaction types. Then based on this moment, any given drug package can be converted into a heterogeneous drug graph. The nodes in the graph correspond to the drugs in the drug package, and the node attributes are that the nodes correspond to the pre-trained Embedding in the previous step. At the same time, in order to avoid excessive calculations, we did not construct the drug graph into a complete graph, that is, we did not allow an edge between any two drugs, but selectively retained them. Specifically, only those labeled The edges of drug pairs that have been passed and the edges whose frequency exceeds a certain threshold.

Drug map construction

In order to carry out the drug map Effective Representation,We propose two ways to formalize edge attributes on,drug graphs.

This figure illustrates the application of learning techniques in a drug recommendation system.

The first form is DPR-WG, which uses a weighted graph to represent the drug graph. The first step is to initialize the full value of the edge based on the marked drug interactions, where -1 represents antagonism, 1 represents synergy, and 0 represents no interaction or unknown. The mask vector is then used to perform personalized updates to the edge weights in the drug graph.

This mask vector reflects the interaction of different drugs and the degree of personalized impact on individual patients. Its calculation method is to use a nonlinear layer plus a Sigmoid function such that The value of each dimension is from 0 to 1, thereby realizing the role of feature selection and personalized adjustment of drug interactions. The drug graph update process is to first calculate an update factor in DPR-WG, and then update the update factor by multiplying or adding the weight of the corresponding edge. In subsequent experiments, it was found that the update method had little impact on the results. In the process of drug graph representation, we designed a method to represent drugs based on weighted graphs.

In summary, we first designed an information update process for weighted graphs: aggregating neighbor information. During the aggregation process, based on the edge weight , personalize the degree of aggregation. We then used a Self Attention mechanism to calculate the weights between different nodes, and used an aggregation MLP to aggregate the graph to obtain the final representation of the entire drug graph. Subsequently, the patient representation and drug image representation are input into the scoring function, and the output can be obtained for recommendation.

In addition, this article uses BPRLoss to train the model and introduces a negative sampling method, corresponding to 1 positive sample and 10 negative samples.

This figure illustrates the application of learning techniques in a drug recommendation system. #The second variant is to use attribute graphs to represent drug graphs. The first step is to initialize the edge vector by fusing the node vectors at both ends of the edge through an MLP. Then the mask vector is also used to update the edge vector. At this time, the update method is no longer an update factor, but calculates an update vector. The update vector is multiplied element by element with the edge vector of the drug to obtain the updated edge attributes. vector. We specially designed a GNN for attribute graphs. The message passing process first calculates the message based on the edge vector and the node Embedding at both ends for propagation, and obtains the Graph Embedding through se

lf attention and aggregation methods.

We can also use BPRLoss for training. The difference is that we have introduced an additional cross-entropy loss function for edge classification. We hope that the edge vector can include the interaction between drugs. Function category information. Because the initialized sign in the previous variant naturally retains this information, but the graph of this variant does not, this information is supplemented by introducing a loss function.

From the experimental results, our two models exceed other discriminant models in different evaluation indicators. At the same time, we also conducted a case analysis: using the t-SNE method to project the previously mentioned mask vector onto a two-dimensional space. As shown in the figure, for example, the drugs used by pregnant women, infants, and liver patients have a very obvious tendency to cluster into clusters, which proves the effectiveness of our method.

3. Generative drug package recommendation

The above discriminant model can only be used in existing drug packages Selection, without the ability to generate new drug packages, will affect the recommendation effect. Next, we will introduce the extension work of the previous work published in the TOIS journal. The purpose is to hope that the model can generate new medicines tailored for new patients. Medicine package.

This work retains the core idea of graph representation learning in the previous paper, while completely changing the problem definition, defining the model as a generative model, and introducing sequences Generative and reinforcement learning technologies have greatly improved the recommendation effect.

1. Discriminant recommendation -> Generative recommendation

This figure illustrates the application of learning techniques in a drug recommendation system.

##Discriminant The core difference between the formula model and the generative model is that the discriminant model scores the matching degree between a given patient and a given drug package, while the generative model generates candidate drug packages for the patient and selects the best drug package.

2. Heuristic generation method

This figure illustrates the application of learning techniques in a drug recommendation system.

In view of the above mentioned To address the shortcomings of the discriminant model, we designed some heuristic generation methods: by adding and deleting some drugs in drug packages for similar patients, some drug packages that have never appeared in the historical records are formed for the model to select. Experimental results prove that this simple method is very effective and provides a basis for subsequent methods.

3. Model Overview

This figure illustrates the application of learning techniques in a drug recommendation system.

The following is published in TOIS Interaction-aware Drug Package Recommendation via Policy Gradient article. The model proposed in this article is called DPG, which is different from the DPR in the previous article. G here is Generation.

This model mainly contains three parts, namely information dissemination on the drug interaction diagram, patient characterization and drug package generation module, and the above The biggest difference is the medicine package generation module.

Drug Interaction Chart

This figure illustrates the application of learning techniques in a drug recommendation system.

First build the drug interaction graph part. This article retains the method of graph neural network to capture the interaction between drugs. The difference is that in the discriminant model, the drug package is given and can be easily converted into a drug graph. In the generative model, the drug graph is not fixed. Due to the amount of calculation, it is impossible to construct all drug packages into graphs.

This article includes all drugs in a drug interaction graph. It also uses Attributed graph to formalize the graph, while also retaining the edge classification loss. function, retaining the Embedding information of the edges, and finally a GNN based on this drug interaction graph was constructed.

After several rounds (usually 2) of message passing, we extract the node Embedding as the drug Embedding to be used.

Patient characterization

This figure illustrates the application of learning techniques in a drug recommendation system.

The patient characterization part is also used. MLP and LSTM are used to extract the patient's representation vector, and also calculate the mask vector, which is subsequently used to capture the patient's personalized representation vector.

Drug package generation based on sequence generation

This figure illustrates the application of learning techniques in a drug recommendation system.

The drug package generation task can be regarded as a sequence generation task, which is implemented using the recurrent neural network RNN. However, this method also brings two major challenges:

The first challenge is how to consider the generated drugs and existing drugs during the generation process. interactions between. To this end, we propose a method based on drug interaction vectors to explicitly model the interactions between drugs.

#The second challenge is that the sample package is a set, which is inherently unordered, but sequence generation tasks often target ordered sequence methods. To this end, we proposed a reinforcement learning method based on policy gradient, and added a method based on SCST to improve the effect and stability of this algorithm.

Drug package generation based on maximum likelihood

This figure illustrates the application of learning techniques in a drug recommendation system.

First introduce how to consider the interaction between drugs in the process of generating drug packages based on the great nature. This part is also the basis for the reinforcement learning part that will be used later. Sequence generation methods based on maximum likelihood have been widely used in the field of NLP. During the generation process, each generated drug depends on other previously generated drugs.

In order to take into account the interaction between drugs without bringing excessive computational burden to the model, we propose to explicitly Calculate the interaction vector between the newly generated drug and the previous drug. This vector calculation method comes from a layer in the previous graph neural network.

At the same time, we add the mask vector and the interaction vector to multiply the corresponding elements to introduce the patient's personalized information.

Finally, sum the interaction vectors of all drugs and use MLP to fuse them to obtain a comprehensive interaction vector.

The subsequent integration of this vector into the classic sequence model for generation solves the first challenge.

This figure illustrates the application of learning techniques in a drug recommendation system.

Different from the classic sequence generation, the medicine package is actually a collection, and there should be no duplicate medicines, so we Later, a restriction was added to prevent the model from generating the drug that has already been generated, ensuring that the generated result must be a set. Finally, we used the MLE loss function based on maximum likelihood to train the model.

Drug package generation based on reinforcement learning

This figure illustrates the application of learning techniques in a drug recommendation system.

The biggest disadvantage of the above method based on maximum likelihood is that the drug package has a strict order. Some methods of manually specifying the order for the drugs, such as sorting by frequency, sorting by first letter, etc., will destroy the drugs. The characteristics of the package collection will also lose part of the model's performance, so we proposed a drug package generation model based on reinforcement learning. The goal of the model in reinforcement learning is to maximize the artificially set reward function. After the model generates a complete drug package, giving a reward loss function that is independent of order can reduce the model's dependence on order.

This article uses F-value as reward, which is an order-independent function and is the evaluation index we are concerned about. This article uses F-value as the evaluation index, and adopts a training method based on policy gradient in the training method, and will not be deduced in detail here.

This figure illustrates the application of learning techniques in a drug recommendation system.

Among the training methods based on policy gradient, one of the most well-known methods is to use a baseline to reduce the variance of the gradient estimate. , thereby increasing the stability of training. Therefore, we used a training method based on SCST, namely the Self-critical sequence training method. The baseline also comes from the reward obtained by the drug package generated by the model itself. The way I generate it is the normal sequence generation method of Greedy search.

We hope that the reward of the drug package sampled by the model based on Policy gradient is higher than the drug package generated by the traditional Greedy search. Based on this, this article designs a loss function for reinforcement learning, as shown in the figure. The derivation process will not be introduced in detail here.

Maximum likelihood pre-training reinforcement learning

This figure illustrates the application of learning techniques in a drug recommendation system.

In addition, one of the characteristics of reinforcement learning is that training is difficult, so we combine the above two training methods. First, we use the extremely natural estimation method to pre-train the model, and then use the reinforcement learning method. method to fine-tune the model parameters.

Experimental results

The next step is the experiment of the model result.

This figure illustrates the application of learning techniques in a drug recommendation system.

In the above table, all medicine packages are generated using Greedy search. First of all, the performance of methods based on generative models is generally better than methods based on discriminative models. This experiment proves that generative models will be a better choice. This model surpasses all other baselines in F value. In addition, the performance of the model based on reinforcement learning greatly surpassed the model based on maximum likelihood, proving the effectiveness of the reinforcement learning method.

This figure illustrates the application of learning techniques in a drug recommendation system.

We then conducted a series of ablation experiments. We removed the interaction graph, including the interaction mask vector and the reinforcement learning module for ablation, and the results proved that each of our modules is effective. At the same time, it can be seen that if the SCST module is removed, the model effect drops significantly, which proves that reinforcement learning is indeed difficult to train. Without Baseline restrictions, the entire training process will be very jittery.

This figure illustrates the application of learning techniques in a drug recommendation system.

Finally, we have also done a lot of case analysis, and we can see that pregnant women and babies have obvious personalized preferences. At the same time, we added some additional common diseases such as stomach disease, heart disease, etc. The mask vectors of these diseases are very scattered and do not form clusters. The conditions of patients with common diseases are diverse, and there will be no particularly personalized situations. Unlike pregnant women and infants, there are very obvious screenings for drugs. For example, certain pediatric drugs need to be designated, and some drugs cannot be used by pregnant women.

At the same time, we projected the interaction vector of the drugs. We can see that the two drugs, synergy and antagonism, form two different oppositions when interacting with each other. situation, indicating that the model captures the different effects brought about by two different interactions.

4. Summary and Outlook

In summary, our research is mainly about the personalization of interaction perception Drug package recommendations include discriminative drug package recommendations and generative drug package recommendations.

The two have in common that they both use graph representation learning technology to model the interaction between drugs, and both use mask vectors to consider the impact of patient conditions on each other. Personalized perception of effects.

The biggest difference between the two works is the difference in problem definition. For the discriminant model, what we want is a scoring function, so for the generative model, what we want is a generating function. It has been proved through experiments that the generative model is actually a better definition of the problem.

The above is the detailed content of This figure illustrates the application of learning techniques in a drug recommendation system.. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete

You Must Build Workplace AI Behind A Veil Of IgnoranceApr 29, 2025 am 11:15 AM

In John Rawls' seminal 1971 book The Theory of Justice, he proposed a thought experiment that we should take as the core of today's AI design and use decision-making: the veil of ignorance. This philosophy provides a simple tool for understanding equity and also provides a blueprint for leaders to use this understanding to design and implement AI equitably. Imagine that you are making rules for a new society. But there is a premise: you don’t know in advance what role you will play in this society. You may end up being rich or poor, healthy or disabled, belonging to a majority or marginal minority. Operating under this "veil of ignorance" prevents rule makers from making decisions that benefit themselves. On the contrary, people will be more motivated to formulate public

Decisions, Decisions… Next Steps For Practical Applied AIApr 29, 2025 am 11:14 AM

Numerous companies specialize in robotic process automation (RPA), offering bots to automate repetitive tasks—UiPath, Automation Anywhere, Blue Prism, and others. Meanwhile, process mining, orchestration, and intelligent document processing speciali

The Agents Are Coming – More On What We Will Do Next To AI PartnersApr 29, 2025 am 11:13 AM

The future of AI is moving beyond simple word prediction and conversational simulation; AI agents are emerging, capable of independent action and task completion. This shift is already evident in tools like Anthropic's Claude. AI Agents: Research a

Why Empathy Is More Important Than Control For Leaders In An AI-Driven FutureApr 29, 2025 am 11:12 AM

Rapid technological advancements necessitate a forward-looking perspective on the future of work. What happens when AI transcends mere productivity enhancement and begins shaping our societal structures? Topher McDougal's upcoming book, Gaia Wakes:

AI For Product Classification: Can Machines Master Tax Law?Apr 29, 2025 am 11:11 AM

Product classification, often involving complex codes like "HS 8471.30" from systems such as the Harmonized System (HS), is crucial for international trade and domestic sales. These codes ensure correct tax application, impacting every inv

Could Data Center Demand Spark A Climate Tech Rebound?Apr 29, 2025 am 11:10 AM

The future of energy consumption in data centers and climate technology investment This article explores the surge in energy consumption in AI-driven data centers and its impact on climate change, and analyzes innovative solutions and policy recommendations to address this challenge. Challenges of energy demand: Large and ultra-large-scale data centers consume huge power, comparable to the sum of hundreds of thousands of ordinary North American families, and emerging AI ultra-large-scale centers consume dozens of times more power than this. In the first eight months of 2024, Microsoft, Meta, Google and Amazon have invested approximately US$125 billion in the construction and operation of AI data centers (JP Morgan, 2024) (Table 1). Growing energy demand is both a challenge and an opportunity. According to Canary Media, the looming electricity

AI And Hollywood's Next Golden AgeApr 29, 2025 am 11:09 AM

Generative AI is revolutionizing film and television production. Luma's Ray 2 model, as well as Runway's Gen-4, OpenAI's Sora, Google's Veo and other new models, are improving the quality of generated videos at an unprecedented speed. These models can easily create complex special effects and realistic scenes, even short video clips and camera-perceived motion effects have been achieved. While the manipulation and consistency of these tools still need to be improved, the speed of progress is amazing. Generative video is becoming an independent medium. Some models are good at animation production, while others are good at live-action images. It is worth noting that Adobe's Firefly and Moonvalley's Ma

Is ChatGPT Slowly Becoming AI's Biggest Yes-Man?Apr 29, 2025 am 11:08 AM

ChatGPT user experience declines: is it a model degradation or user expectations? Recently, a large number of ChatGPT paid users have complained about their performance degradation, which has attracted widespread attention. Users reported slower responses to models, shorter answers, lack of help, and even more hallucinations. Some users expressed dissatisfaction on social media, pointing out that ChatGPT has become “too flattering” and tends to verify user views rather than provide critical feedback. This not only affects the user experience, but also brings actual losses to corporate customers, such as reduced productivity and waste of computing resources. Evidence of performance degradation Many users have reported significant degradation in ChatGPT performance, especially in older models such as GPT-4 (which will soon be discontinued from service at the end of this month). this

See all articles