Home > Article > Technology peripherals > Financial event analysis and application based on pre-trained models
Financial event analysis The main tasks can be divided into three parts:
① The first part is the intelligent analysis of unstructured data. Information in the financial field has its own unique features compared with Internet information. Information in the financial field often exists in unstructured form, and there are some special file formats, such as PDF, which poses a greater challenge to extracting clean and accurate data from files or data. The PDF format is a typesetting and printing format and does not have as clear paragraphs as other file types. PDF is more for typesetting, so there is only some location information in the file. It is more challenging to parse accurately formatted and semantically clear text from unstructured data. Moreover, when the format semantics in the document are unclear, event analysis will bring noise, and these dirty data will cause a lot of interference in model training and inference. Therefore, in order to improve the accuracy of the model, the unstructured data should be parsed first.
② The second part is the understanding of event semantics, which is a technically important part. This mainly involves event detection, event element extraction and event relationship extraction.
③ Based on the understanding of events, the third module that is more important in the task is introduced, event graph analysis, which includes event chain analysis and event prediction.
In order to complete the above tasks, two more important systems are introduced. The first is the financial event system. A financial event system includes relevant entities in the financial field, and these entities also have different application scenarios. In order to better support these subjects and scenarios, a corresponding event system needs to be established, which involves a lot of domain knowledge and requires domain experts to provide corresponding knowledge as input. This will help us build a more complete and scientific system that can cover corresponding scenarios. Of course, in addition to expert knowledge, technology will also be needed for inductive learning, which can also provide a complete scenario-based and scalable event system.
Because the introduction of expert knowledge is mainly for the more important events in the scene. For some medium and long-tail events, it is mainly solved by some technologies based on learning. With the financial event graph, we combine event extraction technology. After event extraction, event relationship classification, and learning event representation, we can construct a graph for analysis and prediction.
With clear tasks and technical support, we can process news or documents and summarize and answer many questions. For example, what type of event occurred in which company, and what elements were involved in the event, such as common elements time, location, people, or elements related to the event type. For example, the event "company stock issuance" includes the elements issuance price, Circulation etc. In addition, we can also pay attention to this information, such as people's evaluation of this event (sentiment analysis). and predictions of what will happen to the company in the future after certain types of events occur. If you can answer the above questions, it will be valuable in many scenarios.
Let’s look at a specific example.
The above figure divides the financial event system into two levels. The first level has five categories, mainly based on specific objects. Distinguish, such as corporate events, stock events, industry and macro events, debt events and fund events. Different event types are defined for different event objects. These are the most common objects in finance. In the second level, we further subdivide each type of object. For example, common types of events in a company, such as shareholder reduction, etc. (refer to the picture above). Take events such as shareholders reducing their holdings as an example. The event elements it contains include the time of the reduction, the shareholders who reduced their holdings, the transaction price at the time of the reduction, and other information. Defining a complete event system that is targeted to the scenario is an important prerequisite for event analysis to achieve its goals. The degree of detail in the definition of the event system will determine the degree of fine-grained event information that the final event analysis can achieve.
#The event graph is introduced below.
Event graphs belong to graphs and contain nodes and edges. In the event graph, event nodes can be events or entities in the events, such as companies. An edge is the relationship between events, or between events and entities, or between entities.
Let’s look at an example. The picture above is a news article introducing Amazon’s acquisition of iRobot. A total of four events were described in this news report, two of which were acquisitions, occurring at different times. The other two are the events of founding a company and the events of cooperation. These events are connected by the chronological sequence of their occurrence. In addition to events, this news also contains other entities and times, and these entities and times are also connected by corresponding relationships.
#In this way, it looks like an unstructured web page information. By parsing out the text and paragraphs in the document, performing semantic analysis on the paragraphs, and extracting event entities and relationships. , we can construct an event graph. That is to convert unstructured data into structured information. With structured information, it is easier to understand and process the information. Such information can be used in information acquisition scenarios such as search and question and answer, or in business scenarios such as risk monitoring and quantitative investment in the financial field.
Event chain is a special event graph and a simplified mode of the event graph. It mainly focuses on a series of events that occur to a participant and the relationship between events. As can be seen from the above figure, the event graph on the left can be simplified into the two event chains on the right. The event chain only contains simplified nodes, and other elements can be regarded as attribute information of the event chain. This simplification is helpful for the practical application of event graphs. After the relationships and nodes of the event graph are simplified, the model's learning and processing of the graph only focuses on one kind of node and one kind of relationship between events, which reduces the complexity of this problem. Although simplified, the main information in the event will still be retained, such as event type, event subject and other information.
Let’s look at how an event chain is predicted.
#The above picture is a prediction model of an event chain. It mainly consists of three parts. The first part is event representation, where there are current events, historical events and ultimately the events in the text that we want to predict. Historical events and events to be predicted, the trigger words and event elements they represent can be spliced together to better capture historical information. Combining the context representation and historical representation of events, we enter the second part, the sequence representation of events. Here, an LSTM network structure is used, which can capture the temporal relationship between events and integrate event sequence information into Event representation comes in. Finally, event representations containing chronological relationships are fed into a dynamic network, which can be used to predict candidate events and events in the given news. For example, if there are dozens of candidate event types, then we can predict which event may be a larger event type that will happen to this subject in the future based on the event types we have seen in the current news. The result is a distribution of event types.
What needs to be emphasized here is that the prediction results of this model are not to predict events that will definitely occur in the company in the future, but to provide such auxiliary information for analysis, prediction, Provide basis for research and judgment.
#The event chain and event prediction mentioned above are predictions and judgments about situations that have not yet occurred. Event prediction can actually provide some useful help for events that have already occurred.
Let’s look at another example. This example is based on events that have already occurred to provide some useful help.
This kind of event question and answer based on the event library can support natural language search for events. We first extract from the corpus and build a structured event library. Each type of event will create a table. Each field in the table represents different event elements. Combined with NL2SQL technology, the query statement can be converted into a database query statement. In this way we can find the exact event type in the table we just created. This is different from the experience of searching for events with the general search engines currently on the market. The results returned by the search engine are many kinds of news about this event type. Even if relevant news is found, the result you see is a complete report rather than specific information about a specific event. Through event Q&A based on the event database, we can conduct Q&A searches for events more accurately.
Through the example above, you can see how event analysis can be performed in specific scenarios. The stock price fluctuations of companies are the focus of the market. Event-based stock price fluctuations are also a scenario where NLP and the financial field are combined in event analysis, which can realize the transformation of market information and stock price fluctuations. The figure above shows two features in the model. One type represents a specific enterprise, and the other type represents the relationship between enterprises. We introduce event analysis to extract event types and different companies involved in the event. The extracted event types and two types of information that the company can provide, one is the representation of the event, which can become a feature and be integrated with the company's features. The relationship between enterprises and events can construct the association relationship module between enterprises. Then add existing features, such as enterprise-specific technical indicators or enterprise relationship maps. Combining the two we can construct a graph with enterprise information and relationship information. Based on this graph, we use Graph Attention Network to capture the impact of the relationship between companies on the stock price fluctuation performance of companies. Using stock price fluctuations as the target of model learning to train the model, the model network structure we obtained can achieve a judgment on the future characteristics of the company from features such as the news industry map and corporate relationship map. At the same time, through the analysis of the Attention model, we can also You can know which modules the fluctuations mainly come from, whether it is an event type module or an event relationship module. And the modeling of relationships between enterprises can also analyze how this impact is spread among related enterprises. This can also be called the utility of momentum spillover in the financial field.
#Comprehensive of the various event analysis technologies and scenarios we introduced earlier. We summarized a complete event analysis framework. (As shown below)
It provides several major capabilities:
① First, it can search for a certain event or various events of a certain subject.
② Secondly, it can provide search for various related events of a company in the same industry through the industrial chain, which can be an industrial chain or a company chain.
③ This framework can also predict possible future events based on a company’s search results.
The advantage of this framework is that it is based on a pre-trained model, has the ability to transfer learning, can support zero-shot/few-shot training, and has a fast cold start . Moreover, this systematic method has relatively comprehensive capabilities and can support a variety of scenarios.
#Not all scenarios require all modules in the framework, and the modules can be split out and used individually according to needs.
The two more important contents in financial event analysis technology are events. Detection and event extraction.
The goal of the event detection task is to extract events from text and classify them. Usually, trigger words are extracted for event detection, but there are also some methods for event detection without trigger word extraction. The challenge for this type of problem is the labeling of trigger words, or when problems such as semantic migration, event splitting, and event definition changes occur, we need to re-label the data. This type of problem often occurs in actual scenarios. Although we have our own set of event detection solutions, when applied to different fields or faced with different customer needs, their event systems are very different. How to combine existing models and data to better Supporting changes in the event system is the biggest challenge we currently face. The current approach is event detection based on cue words and pre-trained models. According to the prompt of the event type, we can better perform data migration or model training in few-shot scenarios.
The above figure shows the model and method based on trigger words. According to the prompt of the event type, the input of the model can be annotated at the token level, so that the trigger words of the corresponding events can be extracted. If the trigger word is transformed based on the current Prompt, the corresponding event type and trigger word can be changed accordingly. Through data comparison on public data sets, it can be seen that in the full amount of data, the performance of our model has been greatly improved compared to previous methods. Even in the case of few-shot, the performance of our model is better than other models. .
#The event extraction task is an extension of the event detection task function. It not only identifies the event type and trigger words, but also extracts the corresponding elements in the event. This has a greater effect on scenarios where event analysis requires relatively complete information. The example in the above figure can extract more fine-grained information. The challenges of event extraction include that in a complete event system, there will be many event elements, data annotation needs domain experts to complete, the cost is relatively high, and it is difficult to expand to many event types. Our current solution is to use pre-trained models and prompts. The generative method based on
can achieve good model performance in scenarios with less annotated data, and has higher data utilization and flexibility. , making it easier to extend to new event types.
#The event extraction task is an extension of the event detection task function. It not only identifies the event type and trigger words, but also extracts the corresponding elements in the event. This has a greater effect on scenarios where event analysis requires relatively complete information. The example in the above figure can extract more fine-grained information. The challenges of event extraction include that in a complete event system, there will be many event elements, data annotation needs domain experts to complete, the cost is relatively high, and it is difficult to expand to many event types. Our current solution is to use pre-trained models and prompts. The generative method can also achieve good model performance in scenarios with less annotated data, has higher data utilization and flexibility, and is easier to expand to new event type.
The above figure is a template-based generative event extraction method. This extraction method defines several important concepts. First of all, we need to pre-define the template before extraction. With the template, it can be provided to the model during training. When defining a new event, some elements are related to previous events. This information can also be provided to the model in advance. In this way, the model can learn new event types with a small amount of data.
Although the above method can solve the event extraction problem to a certain extent, there is still a lot of room for improvement. For example, the connection between trigger words and event elements of the current method is weak; all event types need to be polled during inference, which requires a large amount of calculation and long inference time; the event detection and event element extraction modules are organized by Pipeline, which leads to error propagation.
#We improved the model based on the questions raised above, Another generative event extraction method TDE-GTEE based on enhanced trigger word detection is proposed.
(Ge Shi, Yunyue Su, Yongliang Ma and Ming Zhou (2023). A Hybrid Detection and Generation Framework with Separate Encoders for Event Extraction. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume. Association for Computational Linguistics.)
In this method, the event detection module provides information about event trigger words. We will mark the trigger words in the input and select the corresponding template based on the event type information. This input and event template are later combined to form the representation of the event detection module. This representation is fed into the topic model and finally the filled template is obtained. The difference between this model and the previous model is that this model has an additional trigger word representation and also combines contextual information. This solves the first problem just mentioned, the problem that there are not many connections between trigger words and trigger word extraction elements. This model can construct the display relationship between the two. At the same time, through trigger word input, we can know the event type related to the trigger word. This way we can filter out irrelevant event types. The event detection module will only extract events for relevant event types. Finally, the event detection module and event extraction module can form an end-to-end joint training. This model can solve the other two problems mentioned above.
The TDE-GTEE model has reached SOTA level on both public datasets ACE and ERE. In the experiments we used the full amount of data. In order to verify the performance of this model with a small number of samples, we also made some adjustments to the event detection module, such as replacing the original multi-classification method with a Prompt-based method. Models can also achieve good results on zero-shot/few-shot. We believe that this model can be widely used in practical scenarios.
##3. Summary and OutlookEvent analysis technology is widely used in the financial field and has many scenarios. A comprehensive event analysis system framework will provide strong technical support for information understanding in the financial field.
Methods such as TDE-GTEE proposed by Lanzhou Technology in the field of financial event analysis can reach the SOTA level and are also very good in few-sample and zero-sample scenarios. Performance. We will continue to promote technological progress and productization in the field of financial event analysis.
4. Question and Answer Session#A1: There is a database specifically for event graphs. Some data providers provide such databases. For example, companies such as Lanzhou Technology and Ant Group will also build their own databases within their companies. Neo4j can be used to store and manage event graphs. Depending on the usage scenario, Neo4j is more suitable for storage and management in complex scenarios. If simplified into an event chain, the representation of events can be more flexible. What we do is to extract the data from the text to generate an event chain as training data for the model. There is no graph operation involved in training. A2: There are two main ways. In the process of constructing and evaluating specific models, we rely more on information from the stock market. It can be understood as backtesting in quantitative trading. We use historical data for training and evaluation. The other is that in real scenarios, we will use manual judgment, such as predicting stock price fluctuations in the next month based on corporate news and technical indicators of stock volume and price trading. We will look specifically at whether stock price fluctuations will actually occur this month. If fluctuations occur, the corresponding event type, corresponding stock price volume and price information and other elements will be analyzed to determine whether there is a strong causal relationship. Q2: How to evaluate the effect of stock fluctuation analysis?
The above is the detailed content of Financial event analysis and application based on pre-trained models. For more information, please follow other related articles on the PHP Chinese website!