What is the number of audio samples collected per second-Common Problem-php.cn

Home

Common Problem

What is the number of audio samples collected per second

青灯夜游

Sep 01, 2022 pm 03:39 PM

computer

The number of audio samples collected per second refers to the "sampling frequency", which is measured in samples per second or Hertz. A lower sample rate means fewer samples per second, which in turn means less audio data because there are fewer sample points to represent the amount of audio; a higher sample rate requires more storage space and processing power to handle.

What is the number of audio samples collected per second

The operating environment of this tutorial: Windows 7 system, Dell G3 computer.

When it comes to audio processing, there are a lot of terms that most people have heard of before, but don't really understand. I used to be one of these people before I had to go into audio processing. To do this, I want to talk about some of these terms, describe what they are, and show what they mean for the quality of your audio recording or stream. For the remainder of this article, we will assume that we are only dealing with one channel of uncompressed audio.

1. Sampling rate/sampling frequency

The first term we often hear is sampling rate or sampling frequency, both of which refer to the same thing. Some values you may have come across are 8kHz, 44.1kHz and 48kHz. What exactly is the sample rate of an audio file?

The sampling rate refers to the number of audio samples recorded per second. It is measured in samples per second or hertz (abbreviated as Hz or kHz, 1kHz is 1000Hz). An audio sample is simply a number that represents a measured sound wave value at a specific point in time. It is very important that these samples are taken at equal moments in time within a second. For example, if the sampling rate is 8000 Hz, then it is not enough to have 8000 samples in one second; they must be collected exactly in 1/8000th of a second. In this case, the number 1/8000 is called the sampling interval (in seconds), and the sampling rate is just the multiplicative reciprocal of that interval.

Sampling rate is similar to a video's frame rate or FPS (frames per second) measurement. A video is simply a series of pictures, often called "frames" here, displayed back to back very quickly, giving the illusion of continuous uninterrupted motion or movement (at least to us humans).

While audio sample rates and video frame rates are similar, the usual minimum numbers that guarantee usability in each are very different. For video, in order to ensure accurate description of motion, at least 24 frames per second are required; less than this number, the motion may appear unsmooth, and the illusion of continuous, uninterrupted motion cannot be maintained. This is especially true the more motion occurs between frames. Additionally, videos at 1 or 2 frames per second may have "momentary" events that are guaranteed to be missed between frames.

For audio, to unambiguously represent English speech, the minimum number of samples per second is 8000 Hz. Using a sampling rate lower than this number will result in speech being unintelligible for a variety of reasons, one of which is that similar utterances will be indistinguishable from each other. Lower sampling rates can confuse phonemes, or sounds in language, that have significant high-frequency energy; for example, at 5000 Hz, it is difficult to distinguish /s/ from /sh/ or /f/.

Now that we mentioned video frames, another term worth elaborating on is audio frames. Although audio samples and audio frames are both measured in Hertz, they are not the same thing. An audio frame is a group of audio samples from one time instance of one or more audio channels.

The most common sample rate values are the aforementioned 8kHz (most common in telephone communications), 44.1kHz (most common in music CDs), and 48kHz (most common in movie soundtracks). A lower sample rate means fewer samples per second, which in turn means less audio data because there are fewer sample points to represent the amount of audio. The choice of sampling rate depends on which acoustic artifacts need to be collected. Some acoustic artifacts such as speech intonation require a lower sampling rate than acoustic artifacts such as musical tunes on a music CD. It's worth noting that higher sample rates require more storage space and processing power to handle, although this may not be as much of an issue now when digital storage and processing power were the primary concern in the past.

2. Sampling depth/sampling accuracy/sampling size

In addition to the sampling rate, which is how many audio data points we have, there is also the sampling depth. Measured in bits per sample, sample depth (also called sample precision or sample size) is the second important property of an audio file or audio stream, and represents the level of detail, or "quality", of each sample. As we mentioned above, each audio sample is just a number, and while having many numbers helps represent audio, you also need the range or "mass" of each individual number to be large enough to accurately represent each sample or data point. What does "quality" mean? For an audio sample, it simply means that the audio sample can represent a higher amplitude range. A sampling depth of 8 bits means we have 2^8=256 different amplitudes, while a sampling depth of 16 bits means we have 2^16=65,536 different amplitudes, and so on for higher sampling depths. The most common sample depths for phone audio are 16-bit and 32-bit. In a digital recording, the more different amplitudes there are, the closer the digital recording will sound to the original acoustic event.

Again, this is similar to the 8-bit or 16-bit numbers we might hear about image quality. For images or videos, each pixel in the image or video frame also has a certain number of bits to represent the color. The higher the bit depth in a pixel, the more accurate the resulting pixel colors are, because the pixel has more bits to "describe" the color to be represented on the screen, and the pixel or image overall looks more like what people would see in real life. look. Technically, a pixel's bit depth indicates how many different colors that pixel can represent. If you allow each of R, G, and B to be represented by 8 bits, then each pixel is represented by 3 x 8 = 24 bits. This means there are 2^24~17 million different colors that can be represented by that pixel.

3. Bit rate

What links the sampling rate and sampling depth is the bit rate, which is a simple product of the two. Since sampling rate is measured in samples per second and sampling depth is measured in bits per sample, it is given by (samples per second) x (bits per sample) = Measured in bits per second, abbreviated as bps or kbps. It's worth noting that since sample depth and bitrate are related, they are often used interchangeably, albeit incorrectly.

The bitrate in audio varies from application to application. Applications that require high audio quality, such as music, typically have a higher bitrate, producing higher quality, or "clearer" audio. Telephone audio, including call center audio, does not require a high bitrate, so the bitrate of a regular phone call is usually much lower than that of a music CD. Whether it's sample rate or bit rate, lower values may sound worse, but again, depending on the application, lower values may save storage space and/or processing power.

To summarize, what exactly does compression mean when it comes to audio? Compressed audio formats, such as AAC or MP3, have bitrates that are smaller than the true product of sample rate and sample depth. These formats are implemented by "surgically" removing information from the bitstream, meaning that frequencies or amplitudes that are biologically inaudible to the human ear in dynamic situations are not stored, resulting in smaller overall file sizes. .

For more related knowledge, please visit the FAQ column!

The above is the detailed content of What is the number of audio samples collected per second. For more information, please follow other related articles on the PHP Chinese website!

Statement

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

把逻辑地址转换为内存物理地址的过程称作什么Jul 14, 2022 pm 02:52 PM

把逻辑地址转换为内存物理地址的过程称作“重定位”。程序发出的逻辑地址并不是数据或指令的真实存放地，但可以对这个地址进行转换来获得真实存放地的物理地址，相当于重新定位一次。

根据计算机网络覆盖范围，可将计算机网络分为哪几类Jul 12, 2022 pm 05:13 PM

根据计算机网络覆盖范围，可将计算机网络分为三类：1、局域网（LAN），是一种在小区域内使用的，由多台计算机组成的网络，覆盖范围通常局限在10千米范围之内；2、广域网（WAN），是一种远程网，涉及长距离的通信，覆盖范围可以是个国家或多个国家，甚至整个世界；3、城域网（MAN），其网络覆盖范围通常可以延伸到整个城市，借助通信光纤将多个局域网联通公用城市网络形成大型网络。

微机的字长是4个字节这意味着什么Jul 08, 2022 pm 12:04 PM

微机的字长是4个字节意味着：在CPU中整体传输和处理的二进制数为32位。因为一个字节是8位长（字长），所以4个字节就是32位了，也就是说cpu中能够同时处理32位的二进制数据。在计算机领域，字是用来表示一次性处理事务的固定长度；一个字的位数，即字长，是计算机一次可处理的二进制数字的数目。

微型计算机的性能主要取决于什么Jul 13, 2022 pm 03:40 PM

微型计算机的性能主要取决于CPU（中央处理器）的性能。CPU是计算机系统的运算和控制核心，是对计算机的所有硬件资源（如存储器、输入输出单元）进行控制调配、执行通用运算的核心硬件单元；中央处理器（CPU）的性能对计算机性能起决定性作用。

微型计算机的运算器控制器及内存储器的总称是什么Jul 14, 2022 pm 02:39 PM

微型计算机的运算器控制器及内存储器的总称是“主机”。在微型计算机中，运算器、控制器、存储器三个部分是信息加工、处理的主要部件；运算器和控制器总称为CPU（中央处理单元），而CPU与内存储器又总称为主机，这是计算机系统中最核心的硬件部分。

在计算机网络中使用MODEM时，它的功能是什么Jul 11, 2022 pm 12:02 PM

MODEM的功能为“实现模拟信号与数字信号之间的相互转换”。MODEM的中文名为“调制解调器”，它可以在发送端通过调制将数字信号转换成通信线路上传输的模拟信号，在接收端通过解调再将模拟信号转换为数字信号。

世界上第一台计算机的电子元器件是啥Jul 05, 2022 am 10:37 AM

世界上第一台计算机的电子元器件是“电子真空管”。世界上第一台计算机是“阿塔纳索夫-贝瑞计算机”，通常简称ABC计算机，采用电子真空管作为电子元件；该计算机电路系统中装有300个电子真空管执行数字计算与逻辑运算，机器使用电容器来进行数值存储，数据输入采用打孔读卡方法，还采用了二进位制。

计算机系统的内部总线主要可以分为哪些Jul 11, 2022 pm 02:38 PM

计算机系统的内部总线主要可以分为5类：1、数据总线，在CPU与RAM之间来回传送需要处理或是需要储存的数据；2、地址总线，用来指定在RAM之中储存的数据的地址；3、控制总线，将微处理器控制单元的信号，传送到周边设备；4、扩展总线，是外部设备和计算机主机进行数据通信的总线，例如ISA总线，PCI总线；5、局部总线，取代更高速数据传输的扩展总线。