Exploration of Large Model Applications—Enterprise Knowledge Steward-AI-php.cn

Home

Technology peripherals

Exploration of Large Model Applications—Enterprise Knowledge Steward

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Jan 08, 2024 am 08:49 AM

databaselarge modelEnterprise knowledge manager

Exploration of Large Model Applications—Enterprise Knowledge Steward

1. Background and challenges of traditional knowledge management

1. The necessity of enterprise knowledge management

In modern enterprises, knowledge Management is a crucial link. It can help enterprises effectively organize and utilize internal and external knowledge resources, thereby improving the efficiency and competitiveness of enterprises. In order to better manage knowledge, many companies have introduced the concept of knowledge stewards. Knowledge steward is a role or system specifically responsible for managing and disseminating enterprise knowledge. Through knowledge stewards, enterprises can better collect and organize

Exploration of Large Model Applications—Enterprise Knowledge Steward

##With the rapid development and Knowledge is growing explosively, and companies are faced with the challenge of sharing knowledge. How to effectively transfer and share knowledge within an enterprise has become an important issue. Through knowledge sharing, companies can not only improve work efficiency, but also avoid duplication of work.

Another way is to adopt a knowledge sharing model to establish a mechanism that can empower enterprises, thereby better optimizing processes and results, and improving enterprise operating efficiency. This model allows employees within the enterprise to share their knowledge and experience so that everyone on the team can benefit. By sharing knowledge, companies can avoid duplication of effort, reduce errors and mistakes, and be better able to respond to challenges and changes. This

In addition, as a knowledge steward, it can also provide key information and data to decision-makers to help them make more informed decisions. Knowledge Butler has powerful information retrieval and analysis capabilities, and can extract useful information from massive data, integrate and analyze it. This information and data can include market trends, competitor analysis, consumer insights, technology development, etc.

In addition, a very key factor is to reduce the workload of corporate employees and prevent information loss, and improve employee work efficiency and customer service levels, thereby achieving the goals of reducing costs and improving efficiency.

2. Enterprise knowledge management challenges

Before there was a large model, the logic of building a knowledge steward was quite complicated. Usually, we use the concept of knowledge base to build a knowledge base with the help of enterprise knowledge graph or internal data of the enterprise. However, there are many challenges faced during this construction process. First, the construction of a knowledge base requires a lot of manpower and time investment. Collecting, organizing and summarizing knowledge and information within an enterprise is a tedious and time-consuming task. A professional team is needed to process and manage this data and ensure its

Exploration of Large Model Applications—Enterprise Knowledge Steward

Knowledge fragmentation

Knowledge fragmentation is mainly reflected in two aspects. One aspect is that the enterprise's data is very scattered. For example, the data of the OA system has different departments and different teams. On the other hand, these data are basically provided in unstructured forms, such as Word, PDF, pictures, videos, etc. In the process of building knowledge stewards, how to quickly centralize the fragmented information is the first challenge.

Information overload

##In the rapid development of enterprise business, they are faced with a large amount of information and data How to establish a screening mechanism in massive amounts of data to ensure the accuracy and timeliness of information is also a major challenge under the ever-increasing situation.

Data security risks

Enterprises generally do not share their private data with Other institutions or organizations generally pay more attention to the data security of corporate private domain data, so they also need to deal with data security risks.

Difficulty in knowledge sharing and communication

Different companies have different organizational structures, some Some are more technical, some are more business-oriented, and some are a mixture of technology and business. In the process of communication between business and technology, poor communication is a problem that every enterprise will face in knowledge sharing.

2. Knowledge steward solution

1. What is enterprise knowledge steward

Enterprise knowledge steward is similar to a person’s brain to assist in the storage and understanding of the entire knowledge and create knowledge.

Exploration of Large Model Applications—Enterprise Knowledge Steward

Enterprise knowledge stewards are generally divided into three levels: the first level is the functional and technical needs, mainly responsible for the management of enterprise knowledge, including enterprise Data import, automatic classification and archiving of documents, and other basic functional requirements; the middle layer is the requirement of the application side, including providing some intelligent question and answer, intelligent search, summary generation, auxiliary writing and other functions; the upper layer is the requirement of the business side , including contract review, insurance customer service, and industry report generation.

There are generally three modes of interfaces presented by Knowledge Butler: the first interface is similar to a text box, providing knowledge exploration and analysis; the other is to use API tokens to Intelligent Agents involved in different application scenarios are published as API Tokens to integrate with the enterprise's business system; the third method is intelligent Agent, which explores and analyzes knowledge through conversation mode.

2. Enterprise knowledge steward solution

Enterprise knowledge steward is mainly responsible for enterprise-specific knowledge management and creation, including the following business scenarios:

Exploration of Large Model Applications—Enterprise Knowledge Steward

Intelligent Q&A

Combined with the company’s own private domain data, through After vectorization, it is stored in a vector database, and uses the question and answer mode to create intelligent question and answer scenarios. Through these scenarios, many more specific business needs can be derived.

Self-service document analysis

## Do some exploration and analysis through documents, such as To explore the paper, you can ask questions about the content of the paper, and you can also conduct independent analysis of the document, providing segmented preview, contextual retrieval, summary summary and other capabilities of the entire document.

Customized role scenario

Combined with the private domain data of different roles within the enterprise, Coupled with the prompt word mode, it provides the design of some customized scenarios, such as assisted writing of documents, intelligent meeting minutes, etc.

Contract review

adopts the human-computer dialogue mode to conduct various audits of the enterprise Review the contract information on some key terms to see if the corresponding information is accurate.

The main functions of the Enterprise Knowledge Butler product include:

Exploration of Large Model Applications—Enterprise Knowledge Steward

Intelligent Q&A : Combining specific questions and obtaining a source-based answer by retrieving the context.
Multi-role creative Q&A: Build intelligent application scenarios through prompt words and corporate private domain data.
#Document analysis: Import the entire document for summary or exploratory analysis.
Knowledge management: Enterprise data is fully automatically managed through the knowledge manager, and the entire process adopts a very simple model.
#Agent Build: Development platform, i.e. large model IDE functionality.

Functional architecture of Knowledge Butler:

Exploration of Large Model Applications—Enterprise Knowledge Steward

The bottom is the GPU calculation Power includes two categories, one is reasoning computing power, and the other is fine-tuning computing power. The middle layer is a secure and trustworthy enterprise private domain data memory - DingoDB multi-mode vector database.

The next layer is the functional points of the entire technical layer, including model fine-tuning management, knowledge document management, and intelligent application management.

The top one is for business scenario needs. In intelligent Q&A, you can customize some dialogues of roles, standard QA Q&A, and agents for intelligent applications, document-based auxiliary reading, contract review, and insurance. personal assistant.

##3. Exploration of core technology of knowledge steward

1. Knowledge steward construction process

Next, we will introduce the entire knowledge steward construction process through the intelligent question and answer scenario.

Exploration of Large Model Applications—Enterprise Knowledge Steward

First of all, there needs to be a data source. There may be structured and unstructured data. Generally speaking, the construction of knowledge base is based on unstructured data. Mainly, such as Word, PDF, Excel, as well as enterprise systems, Jira, knowledge management platforms, etc.

These data go through the knowledge processing link and are converted into vectors and stored in the database. You need to load the document first, then give the layout information or structure information of the document, do document vector analysis to generate file blocks, and then call the corresponding Embedding model based on the file blocks to convert them into vectors and store the vectors.

The process of intelligent question and answer interaction: after the user raises a question, first use the intelligent assistant to vectorize the question, and then go to the database to perform semantic retrieval to obtain the context of the article with similar semantics. By combining the context with the prompt words and reasoning through the large model, the answer is finally returned.

The overall process is a process of continuous iteration and feedback optimization. Only in this way can we obtain the exclusive intelligent expert role based on the enterprise's private domain data.

Exploration of Large Model Applications—Enterprise Knowledge Steward

#2. Knowledge steward construction core technology exploration

Unstructured data processing

Exploration of Large Model Applications—Enterprise Knowledge Steward

Unstructured data ETL processing requires the help of some tools. Knowledge Manager provides some special operators from the technical model. These operators can clean the entire Map, Filter, and Window-based changes, and convert data through the entire ETL Pipeline.

By parsing various files (such as PDF parsers), and then passing through the Hub Operators of different application scenarios corresponding to the middle layer, the Pipeline Hub can be quickly constructed, and then After the data is cleaned and converted, it is Embedding and finally stored in the vector database.

Accuracy and integrity data guarantee-lossless data parsing

To get a good To improve the model debugging effect, it is necessary to ensure accurate and complete data and have good data processing quality.

Exploration of Large Model Applications—Enterprise Knowledge Steward

Constructing a traditional data retrieval is very simple, but the actual knowledge is more complicated. In addition to the information in the text itself, there are also pictures and table data , paragraph information, etc. In this regard, Jiuzhang Yunji DataCanvas provides Layout parsing mode, which can realize the full storage of multi-modal data such as Layout information, tables, and pictures, and comprehensively improves the quality of the data parsing process.

Strong correlation retrieval-Reranking secondary filtering

After the document is vectorized , after saving to the DingoDB multi-modal vector database, retrieval is performed through Query. The retrieval results will include the results of the retrieval content itself, as well as the correlation results. At this time, it is necessary to perform secondary screening of Reranking on the Chunks recalled by the retrieval.

Exploration of Large Model Applications—Enterprise Knowledge Steward

#During Reranking secondary screening, the Retrieval Chunk and the corresponding Query must be related to each other. The analysis includes finding the closest semantic match, and then re-pushing the retrieval Chunk after secondary screening to the large language model.

Secure and trusted answer generation-multi-instruction fine-tuning

Exploration of Large Model Applications—Enterprise Knowledge Steward

In order to ensure the security and credibility of the answer generation process, Jiuzhang Yunji DataCanvas is based on the general large speech model, limits the prompt words for the recalled data, and combines the enterprise's private domain data with the large model Fine-tuning vertical knowledge and adding a wind direction control mechanism ensure high accuracy in answer generation.

Storage and retrieval capabilities-DingoDB multi-mode vector database

DingoDB can provide a variety of The standardized API supports data query through SQL and Python toolkits, and also provides an integrated way to implement structured and unstructured joint queries. For real-time scenarios, DingoDB provides the ability to query in real-time by writing in real-time, and can perform real-time retrieval while importing data.

Exploration of Large Model Applications—Enterprise Knowledge Steward

##DingoDB also provides calculation acceleration capabilities and supports pre- and post-filtering of Meta. , and range search based on similarity. DingoDB also provides multi-copy tools that can perform partial migration and data migration. It also provides diversified operation and maintenance and monitoring tools to reduce operation and maintenance costs. DingoDB can also provide automatic elastic sharding capabilities, which can dynamically balance data to different machines to achieve load balancing on each node.

Secure and trustworthy exclusive LLM-fine-tuned Pipeline

In enterprise private domain data For general scenarios, fine-tuning is needed to build a large language model exclusive to the enterprise in a certain scenario. The knowledge manager summarizes the pain points in the entire fine-tuning process and provides a tool-based approach in the product. Data on all problems can be obtained by uploading documents. After having the data, fine-tuning can be performed directly on the interface by configuring parameters. At the same time, the product also provides some fine-tuning data indicators to evaluate the results of fine-tuning.

Exploration of Large Model Applications—Enterprise Knowledge Steward

Quickly build large model applications-Large Model IDE

Traditional large model applications are often complex to build. Knowledge Butler built its own large model IDE based on Jiuzhang Yunji DataCanvas's own FS capabilities, which can provide a wealth of components and tools, and use a concise application construction method to build The template is published as an agent for intelligent applications.

Exploration of Large Model Applications—Enterprise Knowledge Steward

##4. Summary and Outlook

1. Knowledge Summary of the Butler Solution

The technical highlights of Knowledge Butler mainly include the following six aspects: high-precision retrieval, convenient ETL Pipeline, high availability and scalability, security compliance, intelligent data fusion, and rich scenarios .

Exploration of Large Model Applications—Enterprise Knowledge Steward

The core values of Knowledge Butler include: providing the basic capabilities of knowledge management and intelligent inspiration, and providing a safe and trustworthy application private Deployment mode includes all data of the enterprise, enabling knowledge integration and intelligent interaction. As an intelligent base, it provides flexible expansion capabilities and can develop new Agents based on large models on Knowledge Manager.

Exploration of Large Model Applications—Enterprise Knowledge Steward

2. Future Outlook

Knowledge Manager is AIFS based on Jiuzhang Yunji DataCanvas, providing a complete set of GPU computing power and model scheduling from bare metal to above, and realizing model fine-tuning. Pipeline mode. It uses the general language model and the company's private domain data to perform combination and fine-tuning to form the company's own large language model. Based on the scalability of the large language model, combined with the DingoDB multi-modal vector database, it can realize search Q&A, summary generation and other applications in the enterprise, and carry out enterprise knowledge management.

Exploration of Large Model Applications—Enterprise Knowledge Steward

The above is the detailed content of Exploration of Large Model Applications—Enterprise Knowledge Steward. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete

Tesla's Robovan Was The Hidden Gem In 2024's Robotaxi TeaserApr 22, 2025 am 11:48 AM

Since 2008, I've championed the shared-ride van—initially dubbed the "robotjitney," later the "vansit"—as the future of urban transportation. I foresee these vehicles as the 21st century's next-generation transit solution, surpas

Sam's Club Bets On AI To Eliminate Receipt Checks And Enhance RetailApr 22, 2025 am 11:29 AM

Revolutionizing the Checkout Experience Sam's Club's innovative "Just Go" system builds on its existing AI-powered "Scan & Go" technology, allowing members to scan purchases via the Sam's Club app during their shopping trip.

Nvidia's AI Omniverse Expands At GTC 2025Apr 22, 2025 am 11:28 AM

Nvidia's Enhanced Predictability and New Product Lineup at GTC 2025 Nvidia, a key player in AI infrastructure, is focusing on increased predictability for its clients. This involves consistent product delivery, meeting performance expectations, and

Exploring the Capabilities of Google's Gemma 2 ModelsApr 22, 2025 am 11:26 AM

Google's Gemma 2: A Powerful, Efficient Language Model Google's Gemma family of language models, celebrated for efficiency and performance, has expanded with the arrival of Gemma 2. This latest release comprises two models: a 27-billion parameter ver

The Next Wave of GenAI: Perspectives with Dr. Kirk Borne - Analytics VidhyaApr 22, 2025 am 11:21 AM

This Leading with Data episode features Dr. Kirk Borne, a leading data scientist, astrophysicist, and TEDx speaker. A renowned expert in big data, AI, and machine learning, Dr. Borne offers invaluable insights into the current state and future traje

AI For Runners And Athletes: We're Making Excellent ProgressApr 22, 2025 am 11:12 AM

There were some very insightful perspectives in this speech—background information about engineering that showed us why artificial intelligence is so good at supporting people’s physical exercise. I will outline a core idea from each contributor’s perspective to demonstrate three design aspects that are an important part of our exploration of the application of artificial intelligence in sports. Edge devices and raw personal data This idea about artificial intelligence actually contains two components—one related to where we place large language models and the other is related to the differences between our human language and the language that our vital signs “express” when measured in real time. Alexander Amini knows a lot about running and tennis, but he still

Jamie Engstrom On Technology, Talent And Transformation At CaterpillarApr 22, 2025 am 11:10 AM

Caterpillar's Chief Information Officer and Senior Vice President of IT, Jamie Engstrom, leads a global team of over 2,200 IT professionals across 28 countries. With 26 years at Caterpillar, including four and a half years in her current role, Engst

New Google Photos Update Makes Any Photo Pop With Ultra HDR QualityApr 22, 2025 am 11:09 AM

Google Photos' New Ultra HDR Tool: A Quick Guide Enhance your photos with Google Photos' new Ultra HDR tool, transforming standard images into vibrant, high-dynamic-range masterpieces. Ideal for social media, this tool boosts the impact of any photo,

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Assassin's Creed Shadows: Seashell Riddle Solution

3 weeks agoByDDD

What's New in Windows 11 KB5054979 & How to Fix Update Issues

2 weeks agoByDDD

Where to find the Crane Control Keycard in Atomfall

3 weeks agoByDDD

Roblox: Dead Rails - How To Complete Every Challenge

4 weeks agoByDDD

Atomfall guide: item locations, quest guides, and tips

4 weeks agoByDDD

Hot Tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Safe Exam Browser

Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.

Atom editor mac version download

The most popular open source editor

EditPlus Chinese cracked version

Small size, syntax highlighting, does not support code prompt function

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

Hot Topics

Where is the login entrance for gmail email?

7651

CakePHP Tutorial

1392

What is the format of the account name of steam

win11 activation key permanent

nyt mini crossword answers

110