Home >Technology peripherals >AI >How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends

How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends

王林
王林forward
2023-04-08 17:31:111761browse

Preface

In the past few years, artificial intelligence technology headed by neural networks has deepened the understanding of different types of data. Excavation has profoundly changed human life and greatly promoted the process of social development [1]. As one of the most active research directions in the field of artificial intelligence, graph neural networks (GNNs) have been widely used in daily life such as personalized recommendations due to their excellent performance. They are also used in cutting-edge scientific fields such as new crown drug research and development. With the vigorous development of graph neural network technology, people have found that graph neural network systems with task performance as the single design goal still have problems such as vulnerability to malicious attacks. Therefore, people are increasingly eager to build reliable graph neural networks.

#In recent years, building a trustworthy artificial intelligence system has become a general consensus among countries around the world [2][3]. How to comprehensively establish a trustworthy graph neural network has become a major problem that needs to be solved urgently. This article is the latest review of trustworthy graph neural networks by the Monash team (Shirui Pan, Xingliang Yuan, Bang Wu, He Zhang) together with Hanghang Tong (UIUC) and Jian Pei (SFU, who will join Duke) (36 double-column pages, 299 documents).

This review starts from the research background and characteristics, proposes an open framework of trustworthy graph neural network, and focuses on the "trustworthy graph neural network" Believe in the six dimensions of GNN (robustness, explainability, privacy, fairness, accountability, environmental well-being) and its technical methods. At the same time, this review explores the interactive relationships between different trustworthiness dimensions, proposes future research directions for trustworthy graph neural networks, and draws a detailed and comprehensive technical roadmap for establishing trustworthy graph neural networks.

How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends

Review name: Trustworthy Graph Neural Networks: Aspects, Methods and Trends

Full text link: https://arxiv.org/pdf/2205.07424. pdf

Github: https://github.com/Radical3-HeZhang/Awesome-Trustworthy-GNNs

1 Introduction

Graphs are a kind of graphics with extremely strong representation ability. Data types, by describing the characteristics of entities and depicting the relationships between entities, have been widely used to describe data in many fields such as biology, chemistry, physics, linguistics, and social sciences. In recent years, the vigorous development of graph neural network technology has revolutionized the performance of various graph computing tasks and promoted its widespread application in real life.

In daily life, graph neural networks can provide users with personalized search and service in consumer applications such as information streaming media, online shopping, and social software by considering the interactive relationship between users and user/service content. Recommended services. In the frontiers of science, by using graph data to represent complex systems, researchers can use graph neural networks to discover the hidden patterns behind the motion of celestial bodies. By applying it to fake news detection and COVID-19 drug development, graph neural networks have greatly improved the well-being of our society.

Although researchers have designed methods to further improve the performance of graph neural networks from many perspectives (such as self-supervised learning, improving model depth, etc.), in some key areas, task performance is not the design of graph neural networks. the only goal. For example, anomaly detection systems based on graph neural networks need to be robust to malicious attacks, credit scoring systems based on graph neural networks should not reject loan applications due to factors such as age and gender of users, and drug discovery applications based on graph neural networks Researchers should be provided with a full explanation of their results.

Based on the above needs, people are increasingly eager for graph neural network-based systems to be credible. Against this background, this review aims to summarize the latest progress of "Trustworthy GNNs", provide a technical roadmap for relevant researchers and practitioners, and provide a basis for future research and development of trustworthy GNNs. Provide direction for industrial development.

The main contributions of this review are: 1) It describes the trustworthy graph neural network with an open framework that contains many trustworthy dimensions, and proposes the application of graph neural network and other common artificial intelligence technologies (such as CNN) in Typical differences in trustworthiness research; 2) A comprehensive and comprehensive summary of existing methods for different trustworthiness dimensions of graph neural networks; 3) Proposed that the relationship between different trustworthiness dimensions is important for building trustworthy graph neural networks The network system is crucial, and the existing research work is summarized from both the method and effectiveness levels; 4) By taking the concept of trustworthy graph neural network as a whole, potential future research directions are proposed.

2 Graph neural network and trustworthiness

In order to facilitate readers’ understanding, this article first introduces the following core concepts.

Graph data: Graph How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends is generally composed of node set How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends and edge set How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends. The number of nodes in this graph is How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends, and the number of edges between nodes is How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends. Given graph How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends, the corresponding topology can be represented by the adjacency matrix How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends, where How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends represents the connection relationship between node How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends and node How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends. That is, if nodes How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends and How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends are connected to each other, then How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends, otherwise How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends. If the nodes in the graph How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends have attributes, a feature matrix How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends can be used to describe these attribute information. Therefore, the graph How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends can also be represented as How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends.

Graph neural networks (GNNs): Graph neural networks are a general term for a series of neural networks that can be used for computing tasks on graph data (such as node classification, link prediction, graph classification). A typical operation step in graph neural networks is message passing. During the message propagation process, the graph neural network updates the representation of the current node by aggregating the information of all neighbor nodes. On the basis of combining with other operations (such as nonlinear excitation), the graph neural network can calculate the corresponding data representation after multiple representation update iterations.

Trustworthiness: Trustworthy is used to describe a system that is worthy of being trusted. It describes the trust relationship between the trust initiator (the trustor) and the trust receiver (the trustee). . In the context of trusted graph neural networks, the trust receiver (the trustee) is the graph neural network system, and the trust initiator (the trustor) can be users, developers, regulatory authorities or even the entire society.

Trustworthy GNNs are defined as graph neural networks that take into account both trustworthiness and excellent performance. These trustworthy dimensions include but are not limited to the robustness, interpretability, privacy, and Fairness, responsibility and well-being. The original definition is as follows: “In this survey, we define trustworthy GNNs as competent GNNs that incorporate core aspects of trustworthiness, including robustness, explainability, privacy, fairness, accountability, well-being, and other trust-oriented characteristics in the context of GNNs. ”

How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends

3 Review Framework

This review mainly introduces the research background, the definition of trustworthy graph neural network, the definitions of different trustworthiness dimensions, measurement and research differences, and the relationship with existing reviews in the first chapter. and its main contributions. In Chapter 2, it introduces the basic concepts and computational tasks of graph neural networks. In Chapters 3 to 8, it starts from robustness, explainability, privacy, fairness, accountability, and environmental well-being respectively. Six aspects introduce and summarize typical technical methods and discuss future research directions. In Chapter 9, this review summarizes the complex relationship between the above six credibility dimensions from both the method and effectiveness levels. Finally, Chapter 10 of the review considers the trustworthy graph neural network as a whole, and proposes five directions for future research and industrialization in order to comprehensively construct a trustworthy graph neural network system.

4 Robustness

Robustness refers to the ability of a graph neural network to maintain stable prediction results when faced with interference. The prediction results of graph neural networks can be affected by a variety of perturbations (especially various attacks on graph neural networks). This brings severe challenges to the application of graph neural networks in scenarios involving personal and property safety, such as fraud detection in banking financial systems and traffic prediction and planning for autonomous driving systems. Therefore, research on robustness is an indispensable key link in trustworthy graph neural networks.

This review summarizes the current related work on the robustness of graph neural networks, and introduces the classification of adversarial attacks and defenses and some typical methods. Among them, the classification of adversarial attacks is derived from the analysis of the attack threat model (threat model), as shown in the figure below; the classification of defenses is more based on the specific execution phase of the technology (target phase).

In introducing the different categories of methods, the authors discuss the differences between the characteristics of these methods and those used in other common artificial techniques (such as CNN). For example, different classifications and name variations for specific types of perturbation operations and attack goals.

How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends

In addition, the author also discusses and summarizes the applicability of these attack and defense methods. For example, the author compared and analyzed different types of defense methods from three perspectives: stage of application, modularity, and deployment compatibility.

Finally, this review proposes two future research directions based on the current status quo, namely developing standardized robustness evaluations and improving the scalability of existing defense methods (defence scalability). .

5 Explainability (Explainability)

Explainability refers to the ability to make the prediction results of the graph neural network understandable by humans. If its predictions cannot be understood by people, then people will not trust graph neural networks. The lack of trust will further limit its application in scenarios involving fairness (such as credit risk prediction), information security (such as chip design), and life safety (such as autonomous driving). Therefore, a trustworthy graph neural network system needs to provide explanations for its predictions.

After introducing basic concepts such as explanation forms and categories of explanation methods, this review divides the work involving the interpretability of graph neural networks into self-explanation (interpretable GNNs) and post-hoc explainers. Two categories. Self-explanation (Interpretable GNNs) mainly includes contribution estimation, introduction of interpretable module, embedding prototype learning and rationale generation methods. Post-hoc explainers mainly include gradient/feature-based methods, perturbation-based methods, surrogate methods, and decomposition methods. ), generation methods, and other methods.

How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends

After introducing the principles of typical methods, this review makes a comprehensive comparison of these methods, as shown in the following table:

How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends

In addition, this review covers self-explanation and post-explanation (interpretability and explainability), background knowledge required to provide explanations (white/grey/black-box knowledge), reasoning principles for obtaining explanations (reasoning rationale), and other limitations. The current work is summarized in four aspects: other limitations. Finally, this review proposes that establishing strictly model-agnostic methods and evaluation benchmarks for real applications are two directions for future research on the interpretability of graph neural networks. .

6 Privacy

Privacy is also a trustworthy dimension that cannot be ignored when building a trustworthy graph neural network. In the process of building and maintaining a trusted graph neural network system, sensitive and private information such as the model itself or graph data is at risk of being leaked. Therefore, this review first summarizes the current research related to privacy data leakage, and then introduces various privacy protection methods.

In response to the issue of privacy leakage, this review starts with the current threat model of privacy attacks. It first introduces the goals and capabilities of potential threats, and then introduces model extraction attack and membership inference attack. , model inversion attack (model inversion attack), these three common privacy attacks and potential privacy leakage risks in other scenarios.

Then, this review introduces the four types of federated learning, differential privacy, insusceptible training, and security computation in the graph neural network scenario. Privacy protection technology. In addition, after classifying and introducing the above technologies, the author also discussed their specific application scenarios and the trade-offs between privacy protection, model accuracy, and implementation efficiency brought about by the end of privacy protection.

How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends

At the end of the chapter, in view of the current status of current research, the author believes that the current defense against privacy risks (leakage from gradient) and privacy attacks is The research on attacks) is still short-term and needs further attention and exploration in the future.

7 Fairness

By protecting the key interests of vulnerable groups or individuals, a fair system can win people’s trust. A fair graph neural network system means that its prediction results exclude bias against certain groups or individuals. Currently, graph neural networks mainly complete graph computing tasks in a data-driven manner. However, the message propagation mechanism in graph neural networks may further amplify biases already present in the data. Since then, due to factors such as personal preferences or behavioral biases, people will further deepen the bias in graph data during their interaction with graph neural network services.

How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends

Introduced basic concepts such as fairness definition (group fairness, individual fairness, counterfactual fairness), usage stages (pre-processing, in-processing, post-processing) Finally, this review divides the current methods for fairness research in graph neural networks into fair representation learning methods and fair prediction enhancement methods. After introducing the basic principles of these methods, a comprehensive comparison of these methods is performed, as shown in the table below.

How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends

Finally, this review proposes to explore the definition and evaluation of fairness, its influence on task performance, and the explanation of unfairness. Source (revealing unfairness) is a research direction that needs to be focused on in future fairness research on graph neural networks.

8 Accountability

With the emergence of more and more extensive application scenarios and complex system structures, individuals, enterprises and governments Institutions have put forward higher requirements for effective accountability in trustworthy graph neural networks. In recent years, enterprises and government agencies in China, the United States, and Europe have put forward their own plans and guidance on how to build an accountability framework for artificial intelligence. Based on the above content, this review summarizes three requirements for the graph neural network accountability framework:

(1) Reasonable assessment and certification processes should be designed and accompanied by the entire graph Development and operation cycle of neural network system;

(2) The assurance of auditability of the development and operation process should be ensured;

(3) Sufficient coordination and feedback mechanisms should be established to facilitate human intervention in the system (adjusting) , remediating and other measures, and can punish inappropriate behavior.

Then, this review introduces two major categories of work currently available for building accountability frameworks in trustworthy graph neural network systems: benchmarking and security evaluation.

This review introduces the research on benchmarking analysis according to the different development stages of graph neural networks—model design (architecture design), model training (model training), and model validation (model validation). For security assessment, the author mainly introduces related research on verifying system integrity (integrity verification), and introduces the verification of data integrity (data integrity) and program integrity (procedure integrity) according to different verification objects.

Finally, this review proposes three research directions for the accountability of trustworthy graph neural networks. First, provide more violation detection for violations of different natures; second, cover the entire process of program integrity testing and data integrity testing of all components; third, continue to improve the auditability and auditability of the system. Establish more coordination and feedback mechanisms.

9 Environmental well-being

Trusted graph neural networks should conform to the social values ​​of the environment in which they are deployed. Currently, global warming is a major environmental problem that human society urgently needs to solve. Achieving the ambitious goal of carbon neutrality requires the joint efforts of all walks of life. In order to reduce the environmental impact of graph neural network systems, after introducing related metrics such as the number of nodes per joule, this review summarizes various methods to improve the efficiency of graph neural networks.

(1) Scalable graph neural network and efficient data communication: With the explosive growth of graph data, large-scale data sets pose challenges to the efficient operation of graph neural networks. In order to meet this challenge, current technologies mainly include sampling methods, scalable architectures, industrial applications, efficient data communication, etc.

(2) Model compression technology: With the development of technology, researchers have proposed deeper and more complex graph neural network models to improve their performance. However, the scale of these models limits their deployment on edge computing devices with limited computing resources. Therefore, model compression technology is an effective way to solve this challenge. Related technologies include knowledge distillation, model pruning, reducing parameter size, model quantisation, etc.

(3) Development framework and accelerator: The irregularity of graph data, the alternation of sparse and dense calculations in the model, and the diversity of models and applications make the graph neural network system need to use a specially designed framework. and accelerator to improve its efficiency. In order to solve this problem, current methods mainly include software frameworks (SW frameworks), hardware accelerators (HW accelerators), efficiency bottleneck analysis (analysing the efficiency bottleneck), software and hardware co-design (SW-HW co-design), etc.

Finally, this review proposes that the exploration of efficient GNNs and the study of accelerators for GNNs are two future research directions that will promote the well-being of the graph neural network environment. .

10 The relationship between different credibility aspects

The current research on promoting the credibility of graph neural networks mainly focuses on the above six dimensions One of the reviews, this review proposes that the relationship between the above six credibility dimensions cannot be ignored when constructing a trusted graph neural network, and summarizes this from the following two perspectives:

1) From the trusted graph How the methods from one aspect of trustworthy GNNs are adapted to address objectives in other aspects.

2) Why advancing one aspect of trustworthy GNNs can promote or inhibit other aspects (why advancing one aspect of trustworthy GNNs can promote or inhibit other aspects).

How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends

11 Future Research Directions

Aiming at potential research hotspots, this review treats the trustworthy graph neural network as a whole and analyzes the limitations of current methods. In order to fill the current research gaps and promote the industrialization of trustworthy graph neural networks, this review proposes the following five research directions:

A. Embracing trustworthy design concepts (shift to trustworthy GNNs)

Building a trustworthy graph neural network requires researchers and graph neural network practitioners to fully embrace the concept of trustworthiness. When designing a graph neural network, not only must its task performance be considered, but also the concept of trustworthiness must be introduced into the graph neural network. in design philosophy. Some existing work has taken into account both interpretability and fairness in design, which has greatly improved the credibility of graph neural networks. In addition, addressing a series of open issues faced in the move to trustworthy graph neural networks, such as balancing and trade-offs between different trustworthiness dimensions (such as robustness in autonomous driving and environmental well-being) in specific applications, is also A challenging research direction.

B. Exploring other aspects of trustworthy GNNs

Trustworthy graph neural networks actually contain more content than the six dimensions introduced in this review. For example, generalization is also considered an important dimension of trustworthy systems. Some current research explores the relationship between extrapolation of graph neural networks and the activation functions they use. These works enrich the connotation of trustworthiness and promote the construction of trustworthy graph neural networks. In addition, the review proposed that properly handling the design principles related to trusted systems (such as the "New Generation Artificial Intelligence Governance Principles - Developing Responsible Artificial Intelligence" issued by the National New Generation Artificial Intelligence Governance Professional Committee) are also important for trustworthy graph neural networks. important research content in future development.

C. Studying diversified relations

This review only touches on part of the complex relationships between different dimensions of trustworthy graph neural networks. Exploring other interrelationships, such as explainability and fairness, is critical to fully understanding and building trustworthy graph neural network systems. Furthermore, these relationships are not only complex but exist on multiple levels. For example, counterfactual fairness and robustness are conceptually similar. Therefore, exploring the interrelationships between different dimensions of trustworthy graph neural networks from different levels such as concepts, methods, and effectiveness is also a promising research direction.

D. Design model-agnostic methods

Currently, many methods to improve the credibility of graph neural networks require the use of specially designed graph neural network architectures. These methods will not work if the target network infrastructure cannot be accessed or modified (such as using a cloud service). This greatly reduces the usefulness of these credibility enhancement methods in real-world scenarios. In contrast, model-agnostic methods can be flexibly applied to graph neural network systems in a plug-and-play manner. In addition, such methods can also be combined in the form of functional modules. Therefore, designing a model-agnostic approach will greatly improve its practicality and facilitate the construction of trustworthy graph neural networks.

E. Establishing a technology ecosystem for trustworthy GNNs

As a booming field, the development of trustworthy graph neural networks cannot be separated from the support of the technology ecosystem. The technology ecosystem here includes but is not limited to toolkits, datasets, metrics, and pipelines. Due to the inherent characteristics of graph data, some current toolkits (tools) such as IBM's AI360 may not be directly used to evaluate graph neural networks. For example, the existence of edges between nodes breaks the independent and identically distributed (IID) assumption on the nodes, which leads to the need to consider the interdependence between nodes when studying the fairness of graph neural networks. In addition, due to the diversity of application scenarios, building a trusted graph neural network also requires supporting technical facilities such as data sets, metrics, evaluation standards, and software platforms suitable for different tasks and scenarios. Therefore, establishing the corresponding technology ecosystem is a key step in the research and industrialization of trustworthy graph neural networks. ​

The above is the detailed content of How to build a trustworthy GNN? The latest review is here! Trustworthy Graph Neural Networks: Dimensions, Methods, Trends. For more information, please follow other related articles on the PHP Chinese website!

Statement:
This article is reproduced at:51cto.com. If there is any infringement, please contact admin@php.cn delete