search
HomeCommon ProblemWhat are hive built-in functions?

What are hive built-in functions?

Feb 26, 2021 pm 12:02 PM
hivebuilt-in functions

hive built-in functions: 1. User-defined functions to process data; 2. Used to solve the need to input one line and output multiple lines [(On-to-many mapping)]; 3. User-defined aggregation Function, operates on multiple data rows and produces one data row.

What are hive built-in functions?

#The operating environment of this article: Windows 7 system, Dell G3 computer.

hive built-in function:

Definition:

UDF (User-Defined-Function), user-defined function pair The data is processed.

UDTF (User-Defined Table-Generating Functions) is used to solve the requirement of inputting one line and outputting multiple lines (On-to-many mapping).

UDAF (User Defined Aggregation Function) is a user-defined aggregation function that operates on multiple data rows and generates one data row.

Usage:

1. The UDF function can be directly applied to the select statement, format the query structure, and then output the content.

2. When writing UDF functions, you need to pay attention to the following points:

a) Custom UDF needs to inherit org.apache.hadoop.hive.ql.UDF.

b) Need to implement the evaluate function.

c) The evaluate function supports overloading.

hive’s local mode:

Most Hadoop jobs require the complete scalability provided by hadoop to process big data. However, sometimes the amount of input data to hive is very small. In this case, the time consumed to execute the task for the query may be much longer than the actual job execution time. For most of these situations, Hive can handle all tasks on a single machine through local mode. For small data sets, the execution time is significantly reduced.

In this way, operations with a relatively small amount of data can be executed locally, which is much faster than submitting tasks to the cluster for execution.

Configure the following parameters to enable Hive’s local mode:

hive> set hive.exec.mode.local.auto=true;(默认为false)

What are hive built-in functions?

Only when a job meets the following conditions can it truly use local mode:

1. The input data size of the job must be smaller than the parameter: hive.exec.mode.local.auto.inputbytes.max (default 128MB)

2. The number of maps of the job must be smaller than the parameter: hive.exec.mode .local.auto.tasks.max (default 4)

  3. The reduce number of job must be 0 or 1

Related free learning recommendations: php programming(Video)

The above is the detailed content of What are hive built-in functions?. For more information, please follow other related articles on the PHP Chinese website!

Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

MantisBT

MantisBT

Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

PhpStorm Mac version

PhpStorm Mac version

The latest (2018.2.1) professional PHP integrated development tool

MinGW - Minimalist GNU for Windows

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

ZendStudio 13.5.1 Mac

ZendStudio 13.5.1 Mac

Powerful PHP integrated development environment