

Converting a source program written in a high-level language into an executable program requires "compilation and linking". Source programs written in high-level languages cannot be directly executed on the machine and must be compiled and linked.
To run a program, it must go through four steps: preprocessing, compilation, assembly and linking. Next, we will explain these processes in detail through a few simple examples.
We need to explain some of the options used above.
If you use the gcc command without any options, the entire process of preprocessing, compilation, assembly, and linking will be performed by default. If the program is correct, you will get an executable file, which defaults to a.out
-E option: prompts the compiler to stop after performing preprocessing, and subsequent compilation, assembly, and linking will not be executed.
-S option: prompts the compiler to stop after compilation and not to perform assembly and linking.
-c option: prompts the compiler to stop after executing assembly.
So, these three options are equivalent to limiting the stop time of the compiler execution operation, rather than taking out a certain step separately for execution.
#Everyone should be familiar with the execution process of the above program, so I won’t waste any time.
1. Preprocessing:
Use the -E option to indicate that only precompilation will be performed, and a .i file will be generated accordingly.
Operations performed during the preprocessing process:
- Delete all "#define" and expand all macro definitions
- Process all conditional compilation instructions, For example, "#if", "#ifdef", "#elif", "#else", "#endif"
- processes the "#include" precompilation directive and inserts the included header file into the compilation The location of the instruction. (This process is recursive, because the included file may also contain other files)
- Remove all comments "//" and "/* */".
- Add line number and file name identifiers to facilitate the compiler to generate line number ideas for debugging later during compilation and to display the line number when compilation errors or warnings occur during compilation.
- Keep all #pragma pragmas as the compiler needs them.
Use a simple program to verify whether the facts are as mentioned above
Write a simple program, and then use the -E option to perform the preprocessing process and open the generated Compare the .i file with the source file, and the result is clear at a glance
Adding line numbers to the code will not be demonstrated here. We will not do it manually when writing code When adding line numbers, the line numbers we see are automatically added by the editing tools we use, and these line numbers cannot be seen by the compilation system. However, we find that if there is a problem with any line of our code, When compiling, a prompt will be given to tell which line of code has a problem. This has proven that the compiler will automatically add line numbers.
2. Compilation:
Use the -S option to indicate that the compilation operation will end after execution. A .s file is generated accordingly.
The compilation process is the core part of the entire program construction. If the compilation is successful, the source code will be converted from text form into machine language. The compilation process is to perform a series of lexical analysis, syntax analysis, and semantic analysis on the preprocessed files. After analysis and optimization, the corresponding assembly code file is generated.
- Lexical analysis:
Lexical analysis uses a program called lex to implement lexical scanning. It will analyze the input string according to the lexical rules previously described by the user. Divide it into individual tokens. The generated tokens are generally divided into: keywords, identifiers, literals (including numbers, strings, etc.) and special symbols (operators, equal signs, etc.), and then they are placed in the corresponding tables.
- Grammar analysis: The grammar analyzer parses the token sequence generated by lexical analysis according to the grammar rules given by the user, and then forms a grammar tree from them. For different languages, only their grammatical rules are different. There is also a ready-made tool for syntax analysis called: yacc.
- Semantic analysis:
Grammatical analysis completes the analysis of the syntax level of the expression, but it does not understand whether the statement is truly meaningful. Some statements are grammatically legal, but have no practical meaning. For example, when two pointers are multiplied, semantic analysis is required. However, the only semantics that the compiler can analyze are static semantics.
Static semantics: Semantics that can be determined at compile time. Usually includes declaration and type matching and type conversion. For example, when a floating-point expression is assigned to an integer expression, it implies a conversion from floating-point to integer, and semantic analysis needs to complete this conversion. For another example, converting a floating-point type into Assigning an expression to a pointer is definitely not possible. During semantic analysis, it will be found that the two types do not match, and the compiler will report an error.
Dynamic semantics: Semantics that can only be determined at runtime. For example, if you divide two integers, there is no problem with the syntax and the types match. It sounds like there is nothing wrong with it. However, if the divisor is 0, there will be a problem. This problem is not known in advance and can only be done during operation. Only when the time comes can we find out that there is something wrong with him. This is dynamic semantics.
- Intermediate code generation
Our code can be optimized. For some values that can be determined during compilation, they will be optimized, such as Speaking of 2 6 in the above example, its value can be determined to be 8 during compilation, but it is more difficult to directly optimize the syntax. In this case, the optimizer will first convert the syntax tree into intermediate code. Intermediate code is generally independent of the target machine and operating environment. (Does not include data size, variable address, register name, etc.). Intermediate codes have different forms in different compilers. The more common ones are three-address code and P-code.
The intermediate code allows the compiler to be divided into front-end and back-end. The compiler front-end is responsible for generating machine-independent intermediate code, and the compiler back-end converts the intermediate code into machine code.
- Target code generation and optimization
The code generator converts the intermediate code into machine code. This process depends on the target machine, because different machines have different Word length, register, data type, etc.
Finally, the target code optimizer optimizes the target code, such as selecting appropriate addressing methods, using unique ones to replace multiplication and division, and deleting redundant instructions.
3. Assembly
The assembly process is completed by calling the assembler as, which is used to convert the assembly code into instructions that the machine can execute. Almost every assembly statement Corresponds to a machine instruction.
Use the command as hello.s -o hello.o or use gcc -c hello.s -o hello.o to execute until the end of the assembly process, and the corresponding generated file is an .o file.
4. Links
The main content of the link is to correctly connect the parts that reference each other between the modules. Its job is to correct the references of some instructions to other symbol addresses. The linking process mainly includes address and space allocation, symbol resolution and redirection
Symbol resolution: sometimes also called symbol binding, name binding, name resolution, or address binding, it actually refers to the use of symbols Come and go to identify an address.
For example, Int A = 6; such a code, use A to identify a 4 -byte size space in the space. The content stored in the space is 4.
The process of addressing each target is called relocation.
The most basic link is called static linking, which is to compile the source code file of each module into a target file (Linux: .o Windows: .obj), and then link the target file and library together to form the final executable file. A library is actually a package of a set of target files. Some of the most commonly used codes are mutated into target files and then packaged and stored. The most common library is the runtime library, which is a collection of basic functions that support program running.
For more related knowledge, please visit: PHP Chinese website!
The above is the detailed content of What does it take to convert a source program written in a high-level language into an executable program?. For more information, please follow other related articles on the PHP Chinese website!

机器语言的特点:难学、难懂、难理解;无通用性;需要人为分配内存;运行速度最快。汇编语言的特点:程序的执行效率非常高、占用存储空间小、运行速度快;缺乏通用性,程序不易移植。高级语言的特点:容易、有一定通用性、计算机不能直接识别和执行。

能够把高级语言编写的源程序翻译成目标程序的系统软件是“编译程序”。编译程序属于采用生成性实现途径实现的翻译程序;它以高级程序设计语言书写的源程序作为输入,而以汇编语言或机器语言表示的目标程序作为输出。

汇编语言不是高级语言;它和机器语言一样,都属于低级语言。汇编语言和高级语言的区别:1、汇编语言的编程效率不高,而高级语言的编程效率高于汇编语言;2、高级语言的可读性比汇编语言高;3、汇编语言是一种面向机器的语言,而高级语言是简化靠近人的自然语言。

把高级程序设计语言编写的程序转换成等价的可执行程序需要经过编译和连接。高级程序设计语言可摆脱计算机指令系统和机器语言随机器不同的约束,把源程序转换为CPU能识别的目标代码。

快科技11月17日消息,今天上午,深开鸿宣布,其和乐聚机器人共同研发的,首款基于开源鸿蒙的KaihongOS人形机器人正式发布。根据官方介绍,这是一款搭载了KaihongOS以机器人为载体的万物智联教学系统,涵盖工业、服务等多场景。这个机器人装备了血氧心率传感器、温湿度传感器、红外测温传感器、人体感应传感器、NFC、OLED显示屏、LED灯。通过多元联合感知,它可以智能地执行决策同时,该机器人还可以与手机、平板、电脑等设备实时互联,摆脱传统线束连接方式,提升教学效率。同时,这款机器人还拥有17个

机器语言不是高级语言,是一种用于计算机硬件的低级语言,它直接与计算机的底层硬件进行交互,机器语言使用二进制编码表示指令和数据,每个指令对应着计算机硬件的一种操作或功能,高级语言是一种更接近人类语言的编程语言,提供了更高的抽象能力和开发效率,机器语言和高级语言在编程中扮演着不同的角色,各自有着各自的优势和适用场景。

C语言规定,在一个源程序中,main函数的位置:可以是任意的;在执行一个c语言编写的程序时,main函数就相当于是执行程序的入口;不论main函数在整个过程中的位置如何,一个C程序总是从mam函数开始执行的。

Go语言的特性分析:它是否属于高级语言?Go语言是一种由Google开发的静态强类型编程语言,它设计简洁,易于阅读和编写,同时具备高效的并发特性。那么,根据高级语言的定义,我们来分析一下Go语言是否属于高级语言。高级语言的定义高级语言是一种相对于机器语言更接近人类语言的编程语言。它具有语法简洁、易读易写、抽象层次高等特点,能够让程序员更专注于解决问题本身,而

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

SublimeText3 Chinese version
Chinese version, very easy to use

WebStorm Mac version
Useful JavaScript development tools

Zend Studio 13.0.1
Powerful PHP integrated development environment

SublimeText3 Linux new version
SublimeText3 Linux latest version

Safe Exam Browser
Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.
