search
HomeCommon ProblemWhat are the outstanding features of a data warehouse compared to an operational database?

The outstanding features are “massive data support” and “fast retrieval technology”. Data warehouse is a structured data environment for decision support systems and online analysis application data sources, and the database is the core of the entire data warehouse environment, where data is stored and provides support for data retrieval; compared with manipulative databases, it is outstanding It is characterized by support for massive data and fast retrieval technology.

What are the outstanding features of a data warehouse compared to an operational database?

The operating environment of this tutorial: Windows 7 system, Dell G3 computer.

Compared with operational databases, the outstanding features of data warehouses are “massive data support” and “fast retrieval technology”.

Data warehouse, the English name is Data Warehouse, which can be abbreviated as DW or DWH. A data warehouse is a strategic collection that provides support for all types of data for decision-making processes at all levels of an enterprise. It is a single data store created for analytical reporting and decision support purposes. Provides guidance on business process improvement, monitoring time, cost, quality and control for enterprises in need of business intelligence.

The data warehouse is a structured data environment for decision support systems (dss) and online analysis application data sources. Data warehousing studies and solves the problems of obtaining information from databases. Data warehouses are characterized by subject orientation, integration, stability and time variability.

Characteristics of data warehouse

Data warehouse is used for further mining of data resources and decision-making needs when a large number of databases already exist. And generated, it is not the so-called "large database". The purpose of building a data warehouse solution is to serve as the basis for front-end query and analysis. Due to the large redundancy, the required storage is also large. In order to better serve front-end applications, data warehouses often have the following characteristics:

1. The efficiency is high enough.

The analysis data of the data warehouse is generally divided into days, weeks, months, quarters, years, etc. It can be seen that the daily cycle data requires the highest efficiency, requiring 24 hours or even 12 hours. Customers can see yesterday's data analysis. Because some companies have a large amount of data every day, problems often occur with poorly designed data warehouses, and data can only be provided after a delay of 1-3 days, which is obviously not possible.

2. Data quality.

The various information provided by the data warehouse must be accurate. However, since the data warehouse process is usually divided into multiple steps, including data cleaning, loading, query, display, etc., it is complicated. The architecture will have more layers, so dirty data in the data source or imprecise code can lead to data distortion. When customers see wrong information, they may make wrong decisions through analysis, causing losses rather than benefits.

3. Extensibility.

The reason why some large-scale data warehouse system architectures are complex is because they take into account the scalability in the next 3-5 years. In this case, there is no need to spend money to rebuild the data warehouse system too quickly in the future. Can run very stably. Mainly reflected in the rationality of data modeling, there are some more middle layers in the data warehouse solution, so that the massive data flow has enough buffer, so that the amount of data will not be much larger and it will not be able to run.

As can be seen from the above introduction, data warehouse technology can awaken the data accumulated by enterprises for many years. It not only manages these massive data for enterprises, but also taps the potential value of data, thus becoming an operation and maintenance system for communication enterprises. One of the highlights.

Broadly speaking, a decision support system based on data warehouse consists of three components: data warehouse technology, online analytical processing technology and data mining technology. Data warehouse technology is the core of the system. What follows in this series In this article, we will focus on data warehouse technology, introduce the main technologies of modern data warehouses and the main steps of data processing, and discuss how to use these technologies to help operation and maintenance in communication operation and maintenance systems.

4. Topic-oriented

The data organization of the operational database is oriented towards transaction processing tasks. Each business system is separated from each other, and the data in the data warehouse is organized according to certain organized by subject area. The theme corresponds to the application-oriented nature of traditional databases. It is an abstract concept that synthesizes, classifies, analyzes and utilizes data in enterprise information systems at a higher level. Each topic corresponds to a macro analysis area. The data warehouse eliminates data that is not useful for decision-making and provides a concise view of a specific subject.

Composition of data warehouse

Data extraction tool

Extract data from various Take it out of the storage method, perform necessary transformation and organization, and then store it in the data warehouse. The ability to access various data storage methods is the key to data extraction tools. It should be able to generate COBOL programs, MVS job control language (JCL), UNIX scripts, and SQL statements to access different data. Data transformation includes deleting data segments that are not meaningful for decision-making applications; converting to unified data names and definitions; calculating statistics and derived data; assigning default values ​​to missing data; and unifying different data definition methods.

Database

is the core of the entire data warehouse environment, where data is stored and provides support for data retrieval. Compared with manipulative databases, its outstanding features are support for massive data and fast retrieval technology.

Metadata

Metadata is the data that describes the structure and creation method of the data in the data warehouse. It can be divided into two categories according to different uses, technical metadata and commercial metadata.

Technical metadata is the data used by data warehouse designers and managers to develop and daily manage the use of data warehouses. Including: data source information; description of data transformation; definition of objects and data structures in the data warehouse; rules for data cleaning and data updating; mapping of source data to destination data; user access rights, data backup history, and data import Historical records, information release history, etc.

Business metadata describes the data in the data warehouse from the perspective of commercial business. Including: description of business topics, included data, queries, and reports;

Metadata provides an information directory (information directory) for accessing the data warehouse. This directory comprehensively describes what data is in the data warehouse. How the data is obtained and how to access the data. It is the center of data warehouse operation and maintenance. The data warehouse server uses it to store and update data, and users use it to understand and access data.

Data Mart

A part of the data that is independent from the data warehouse for a specific application purpose or application scope, which can also be called department data or subject data. (subject area). In the implementation process of data warehouse, you can often start with the data mart of one department, and then use several data marts to form a complete data warehouse. What needs to be noted is that when implementing different data marts, field definitions with the same meaning must be compatible, so that it will not cause big trouble when implementing a data warehouse in the future.

In the well-known foreign Garnter report on data mart products, the agile business intelligence products in the first quadrant include QlikView, Tableau and SpotView, all of which are full-memory computing data mart products. They are very important in terms of big data. Traditional business intelligence product giants pose a challenge. Domestic BI products started late. Well-known agile business intelligence products include PowerBI, Yonghong Technology's Z-Suite, SmartBI, FineBI business intelligence software, etc. Among them, Yonghong Technology's Z-Data Mart is a hot memory computing data Market products. Domestic Deon Information is also a system integrator of data mart products.

Data warehouse management

Security and privilege management; track data updates; data quality checks; manage and update metadata; audit and report data warehouse usage and status ;Delete data;Copy, split and distribute data;Backup and recovery;Storage management.

Information publishing system

Sends data in the data warehouse or other related data to different locations or users. Web-based information publishing system is the most effective way to deal with multi-user access.

Access tool

Provides means for users to access the data warehouse. There are data query and reporting tools; application development tools; management information system (EIS) tools; online analysis (OLAP) tools; and data mining tools.

For more related knowledge, please visit the FAQ column!

The above is the detailed content of What are the outstanding features of a data warehouse compared to an operational database?. For more information, please follow other related articles on the PHP Chinese website!

Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
深入理解MySQL索引优化器工作原理深入理解MySQL索引优化器工作原理Nov 09, 2022 pm 02:05 PM

本篇文章给大家带来了关于mysql的相关知识,其中主要介绍了关于索引优化器工作原理的相关内容,其中包括了MySQL Server的组成,MySQL优化器选择索引额原理以及SQL成本分析,最后通过 select 查询总结整个查询过程,下面一起来看一下,希望对大家有帮助。

sybase是什么数据库sybase是什么数据库Sep 22, 2021 am 11:39 AM

sybase是基于客户/服务器体系结构的数据库,是一个开放的、高性能的、可编程的数据库,可使用事件驱动的触发器、多线索化等来提高性能。

visual foxpro数据库文件是什么visual foxpro数据库文件是什么Jul 23, 2021 pm 04:53 PM

visual foxpro数据库文件是管理数据库对象的系统文件。在VFP中,用户数据是存放在“.DBF”表文件中;VFP的数据库文件(“.DBC”)中不存放用户数据,它只起将属于某一数据库的 数据库表与视图、连接、存储过程等关联起来的作用。

数据库系统的构成包括哪些数据库系统的构成包括哪些Jul 15, 2022 am 11:58 AM

数据库系统由4个部分构成:1、数据库,是指长期存储在计算机内的,有组织,可共享的数据的集合;2、硬件,是指构成计算机系统的各种物理设备,包括存储所需的外部设备;3、软件,包括操作系统、数据库管理系统及应用程序;4、人员,包括系统分析员和数据库设计人员、应用程序员(负责编写使用数据库的应用程序)、最终用户(利用接口或查询语言访问数据库)、数据库管理员(负责数据库的总体信息控制)。

microsoft sql server是什么软件microsoft sql server是什么软件Feb 28, 2023 pm 03:00 PM

microsoft sql server是Microsoft公司推出的关系型数据库管理系统,是一个全面的数据库平台,使用集成的商业智能(BI)工具提供了企业级的数据管理,具有使用方便可伸缩性好与相关软件集成程度高等优点。SQL Server数据库引擎为关系型数据和结构化数据提供了更安全可靠的存储功能,使用户可以构建和管理用于业务的高可用和高性能的数据应用程序。

数据库的什么是指数据的正确性和相容性数据库的什么是指数据的正确性和相容性Jul 04, 2022 pm 04:59 PM

数据库的“完整性”是指数据的正确性和相容性。完整性是指数据库中数据在逻辑上的一致性、正确性、有效性和相容性。完整性对于数据库系统的重要性:1、数据库完整性约束能够防止合法用户使用数据库时向数据库中添加不合语义的数据;2、合理的数据库完整性设计,能够同时兼顾数据库的完整性和系统的效能;3、完善的数据库完整性有助于尽早发现应用软件的错误。

go语言可以写数据库么go语言可以写数据库么Jan 06, 2023 am 10:35 AM

go语言可以写数据库。Go语言和其他语言不同的地方是,Go官方没有提供数据库驱动,而是编写了开发数据库驱动的标准接口,开发者可以根据定义的接口来开发相应的数据库驱动;这样做的好处在于,只要是按照标准接口开发的代码,以后迁移数据库时,不需要做任何修改,极大方便了后期的架构调整。

mysql查询慢的因素除了索引,还有什么?mysql查询慢的因素除了索引,还有什么?Jul 19, 2022 pm 08:22 PM

mysql查询为什么会慢,关于这个问题,在实际开发经常会遇到,而面试中,也是个高频题。遇到这种问题,我们一般也会想到是因为索引。那除开索引之外,还有哪些因素会导致数据库查询变慢呢?

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
Repo: How To Revive Teammates
4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

PhpStorm Mac version

PhpStorm Mac version

The latest (2018.2.1) professional PHP integrated development tool

Safe Exam Browser

Safe Exam Browser

Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.

SublimeText3 English version

SublimeText3 English version

Recommended: Win version, supports code prompts!

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)