Home > Article > Technology peripherals > Application of multimodal DNN models in drug interaction prediction tasks
First of all, let me share with you the relevant background of drug discovery.
##Drugs The field of discovery has been very hot in recent years, especially the use of AI to assist pharmaceutical companies in their drug discovery work, including drug research and development. The drug research and development process is very long. Usually, the research and development process of the first drug for a certain type of clinical disease requires billions of funds and more than ten years. It is mainly divided into the following stages:
# (1) Research on disease targets and confirmation of disease core proteins.
# (2) Verify the effectiveness of the drug before clinical trials: including research on drug toxicity, effectiveness, taking methods, etc.
# (3) Clinical trials.
# (4) FDA approval and certification.
#So the traditional drug research and development process cycle is very long. In addition, from the initial recognition of more than 10,000 drugs, 5 drugs entered the clinical trial stage, and finally only 1 drug was approved for marketing. In this context, how to assist pharmaceutical companies to more quickly select effective drugs from candidate drugs, and how to quickly pre-investigate the effects, effects, and effectiveness of drugs in the clinical testing phase has become a hot research topic. field, in which AI, especially deep neural network technology, can greatly accelerate the drug development process.
The content shared today does not involve drug screening. It mainly focuses on the side effects and efficacy of candidate drugs. The main research purpose is Reduce drug toxicity and improve drug effectiveness. As shown in the figure above, DDI (Drug-Drug Interaction) refers to the interaction between drugs. Conduct cross-analysis between research drugs and existing drugs to discover the side effects of the research drugs, such as their effects on the body, etc., and discover and classify them in advance through experiments. For a simple example, "medicine is divided into three parts by poison". Where is the toxicity of the drug mainly reflected? In many cases, it occurs when a drug is combined with other drugs, a chemical interaction between two or more drugs. The picture in the lower right corner shows three drugs. Itraconazole is a tumor-related drug. If it is mixed with abemaciclib, it will cause serious side effects, such as liver failure, liver and kidney failure. Use of these two drugs in dysfunctional patients can have serious consequences. If abeciclib and dabrafenib are mixed, it will cause a decrease in serum concentration and cause other diseases. Therefore, a large number of tests are required in the process of developing new drugs, but it is impossible to use real people to test, and can only be tested with mice or other animals. The content shared today is to use multi-modal neural networks to predict drug DDI in advance based on existing (including under development and known) drug ingredients, allergies, etc.As shown in the figure above, drug interactions can be summarized as a DDI matrix, which describes the results of drug interactions, such as drugs d1 (abexiclib) and d2 (dabrafenib) Will cause y1 (decreased serum concentration). This study included 37,264 DDI data, involving 572 drugs (d) and 65 reaction results (y, such as decrease in serum concentration, etc.). And based on this data, a Drug Knowledge Graph (DKG) was constructed: the nodes are drugs and the edges are the relationships between drugs. The DKG triple is {D: drug, R: inter-drug relationship, T: tail entity}.
#In addition to the above data, the multimodal model also incorporates the heterogeneous features of the drug (HF, Heterogeneous Features): {Target: target, substructure: Composition/chemical structure, enzyme: enzyme}, the dimensions of each feature are different, for example, the target information is a protein. Finally, the DDI matrix, DKG and HF are fused on the same probability distribution for modeling.
Next, we will introduce the framework of the heterogeneous multi-modal MDNN model.
## This model is referred to as MDNN. The basic data is mainly divided into two parts: DDI matrix and heterogeneous data. The model framework mainly consists of the following three parts:
# #(1) Based on the DKG part: Mainly expresses information about the components of the drug itself (active ingredients, toxic ingredients), relationships between drugs, etc. by constructing a drug knowledge graph.
(2) Based on the HF part: Describe the drug itself by integrating heterogeneous characteristic data such as targets, enzymes, and molecular structures. Basic feature information.
(3) Multi-modal fusion neural network: Effectively fuse the two parts of feature data, DKG and HF, and perform Unified modeling.
2. DKG module constructionThe following will introduce the construction process based on DKG.
The above picture shows the main content of the drug knowledge graph. The left picture is the DDI matrix. The matrix contains nearly 600 drugs, and the composition and action information of these drugs are stored in the database (DrugBank, that is, "Drug Bank"). The picture on the right shows an example of drug information in the "Drug Bank", such as heterogeneous basic features such as enzymes, carriers, targets, etc. Among them, 4 features are relatively important. Taking the drug DB05812 as an example, in addition to target, In addition to enzymes and molecular structures, there are also carriers and transporters. However, these two types of data are relatively sparse and do not have as many dimensions as other features. The distinction in the current data set is not large. Therefore, these two data are not used for the time being. The main data used are targets, enzymes, and molecular structures.
As shown in the figure above, the knowledge graph is mainly composed of nodes and edges, where the nodes are drugs and ingredients, and the edges are relationships. The relationship displayed by the triplet in the example is the toxic component relationship, that is, there is a toxic component relationship between the node drug "DB05812" and the node component "P02768". Based on the drug ingredient relationship triples obtained from the "Drug Bank", the DKG knowledge graph is formed, which contains 572 kinds of drugs. The edges (relationships) of the triples are called semantic relations. There are 157 relationship types in total. Ingredients There are 1043 types of tail entity nodes. Each DKG can extract corresponding information from the "Drug Bank" according to task requirements and construct it, so the DKG is equivalent to a subgraph in the "Drug Bank" knowledge graph.
Based on DKG, two types of information are summarized. The above figure shows the semantic relationship information model of construction. Based on the toxic components, first calculate the inner product of the drug (d) and the relationship (r) in the previous layer, and sum it up through the current layer weight (W1) to get the π function, that is, the edge and node information of the drug are summed together through the π function , and then perform a weighted summation of the π function and the previous layer component (t) to obtain e, that is, the edge information is obtained.
Similarly, the above figure shows the construction of the topological structure information model of the graph. In addition to toxic components, drugs may also contain multiple other component relationships. Multiple relationships (edges, i.e. e) of the same drug and their corresponding weights W2 can be connected to finally obtain the E corresponding to each drug. Through the above method, DKG edges and topological structure information are effectively fused and represented.
As shown below, in addition to the side information and DDI information mentioned above, There is also very rich multi-modal information in drugs: the same drug can act on multiple targets; various drugs also have different molecular structures, representing their corresponding molecular properties; drugs will react under the action of different enzymes. combined with different targets. These three types of information are vectorized, and then the similarity between drugs is measured through simple Jaccard similarity to obtain the corresponding similarity matrix.
#Final fusion The three similarity matrices obtain the E' corresponding to each drug, that is, the information based on the isomeric characteristics of the drug is obtained. The dimension of this feature vector is also small, including information on 1162 targets, 583 structures and 202 enzymes.
As shown in the figure above, the DKG result E and HF result E' of each drug are finally spliced and fused through the fusion layer to obtain:
Then use the softmax function to get the output layer:
##
The entire model framework is as shown above. The framework structure is not complicated, but it combines drug information relatively effectively.
Next, I will share with you the model effect.
The above figure shows the comparison results with the more commonly used algorithm currently, the MDNN algorithm It has achieved State of the Art results in Acc, AUC, F1, AUPR, Precision, Recall and other evaluation indicators. (The above algorithm is not incorporated into the GNN algorithm.)
##The above figure shows the MDNN fusion of DKG, Differences between effects of HK and no fusion. It is easy to see that the result of the fusion is better than using either method alone.
Regarding multi-modal parameter adjustment, that is, the analysis of parameter sensitivity is as shown in the figure above, respectively. It shows the various evaluation indicators when the number of neural network layers l, the number of nodes Ns and other parameters change. corresponding fluctuations.
In addition, multi-task analysis was also conducted. The task A prediction model was constructed through the training set. To predict the DDI between drugs in the training set and the drugs in the test set; the prediction model of task B is also built through the training set, but predicts the DDI between drugs in the test set. If the training set and test set drugs are strictly separated, the model prediction effect will be significantly reduced.
#There are still many problems that need to be solved in the field of drug research and development: how to effectively discover/screen drugs instead of just studying DDI.
Finally, to summarize the content shared this time, the MDNN algorithm itself is not complicated. The important work is the use of multi-modal data and structural information:
(1) Constructing drugs based on drug effects The knowledge graph constructs multi-modal data features based on the characteristics of the drug itself (target, molecular structure, enzyme), and finally integrates these two features to build an MDNN model.(2) Improved the prediction problem of DDI;
(3) Comparison with existing methods ,MDNN works best on the data set.
#But in practical applications, the MDNN model still has many areas that need further optimization and improvement, such as better methodology or better data.
A1: The knowledge graph data set of this study is public, and the original data set "Drug Bank" is also a public data set. However, depending on the drugs in each research field, the knowledge graph data sets constructed will also be different, and there is no unified and universal knowledge graph.
A2: In addition to biopharmaceuticals, there are many applications in the e-commerce field. For example, in the User-Item class data set, User has a lot of multi-modal information, such as occupation, age, shopping tags, etc. Item products also have a lot of information, and there are many relationships between them, such as shopping, evaluation, favorites, clicks, etc. Behavior. If the domain data conforms to the heterogeneous form, you can try to use these methods for analysis. The difficulty lies in building a knowledge graph in a specific domain.
The above is the detailed content of Application of multimodal DNN models in drug interaction prediction tasks. For more information, please follow other related articles on the PHP Chinese website!