Author:
Innokrea Team

Publication Date: 2023-09-14

Compilation vs. Interpretation (part 2)

Programming

In this article, we want to continue developing the topic of compilation, file formats, and the low-level aspects related to assembly language and processor architecture. If you're curious, then stick around with us.

Binary File Formats

These formats define how the structure of an executable binary, object, or library file should look. It depends on the system within which we compile a given program. The ELF format (Executable and Linkable Format) is a file format popular on Unix-like systems (Unix, Linux). You can recognize it by extensions such as .bin, .o, .elf, .ko, .so, or no extension at all.

https://linuxhint.com/understanding_elf_file_format/

The most popular binary file format on Windows is PE (Portable Executable). If you want to delve deeper into this topic, we invite you to visit the following pages.

https://en.wikipedia.org/wiki/Portable_Executable

We encourage you to read a comparison of executable file formats at this link:

https://en.wikipedia.org/wiki/Comparison_of_executable_file_formats

File ELF

An ELF (Executable and Linkable Format) file consists primarily of the following elements:

Header, which contains metadata and information about the file format.
Program segments, which determine how the program should be loaded into memory (only for executable files).
Sections, which store various program elements such as code, data, symbols, and more.

Each of these elements has its place and function within the ELF file structure, enabling the correct loading, management, and execution of program contents.

Figure 1 - Metadata Content for an ELF Executable File.

In the diagram, we can observe, among others:

Magic bytes - These are bytes at the beginning of the ELF file, serving as a unique identifier. They are used by the operating system to recognize and differentiate the ELF format from other file formats.

Class - The class field specifies the target system's architecture, determining whether the ELF file is intended for a 32-bit or 64-bit system. Two common values for this field are:

ELF32 (32-bit class): Indicates a 32-bit system.
ELF64 (64-bit class): Indicates a 64-bit system.

Data - This field indicates how binary data is ordered in memory. Computers can store multi-byte data with the most significant byte (MSB) first (big-endian) or the least significant byte (LSB) first (little-endian). If you're unsure about the encoding, attempting reverse engineering could lead to incorrect addresses. In the pwntools library, when reading an ELF file, you can specify the encoding.

Figure 2 - Little vs Big Endian, Data stored in reverse order depending on how we want to encode it. Source: iar.com.

Entry Point Address - The initial address for program entry.

If you're interested in delving deeper into the details of the other sections of the ELF format, you can visit the following blog:

https://blog.k3170makan.com/2018/09/introduction-to-elf-format-elf-header.html?fbclid=IwAR3F_O3PCi_y_Svj-EUxnes7WWHPOxt_mnYndCUXthsMtpJL3mj8BwJqu-0

What happens when we execute a program?

The following are the sequential steps:

Checking the "magic number"
Checking the ELF header
Checking the program header
Loading segments
Memory allocation
Copying segments to allocated memory
Jumping to the entry point (program start)

Figure 3 - Representation of program start after its execution, Source: EngMicroLectures.

At the lowest level, the processor communicates with memory, performs logical operations, and translates them into electrical impulses. If the processor needs to process data, it performs operations in registers, which are very fast memory cells located within the processor.

Ones and zeros read by the processor trigger specific operations at the electrical level, causing certain circuits to connect and perform logical operations.

All loops, conditionals (ifs), and objects are actually abstract constructs that assist humans.

Operation Codes, Assembly Language

It should be understood that the processor in our computers has access to only a limited number of mathematical/logical operations represented by operation codes -> https://en.wikipedia.org/wiki/Operation_code

Computers, as we have already mentioned, understand only machine code. So, how does assembly language relate to a sequence of these zeros and ones as shown in the image below?

Figure 4 - Contents of a Binary File.

Well, in assembly language, the aforementioned operation codes appear. These are numbers that constitute a part of an instruction sent to the processor for execution, indicating which operation is to be performed. Each assembly command such as add, sub, etc., has its own number, which gets translated into machine code during compilation. The set of codes for a specific processor is determined by its programming model.

Figure 5 - Operation Codes for the x86 Architecture, Source: Fraunhofer, FKIE.

What is ISA?

The Instruction Set Architecture (ISA) is a component of the abstract computer model that defines how software controls a processor. ISA serves as the interface between hardware and software, determining both what operations the processor can perform and how to execute them.

ISA provides the only way for users to communicate with the hardware.

Processors with the same programming model are compatible, meaning they can execute the same programs and produce the same results. In the early history of processors, the processor's programming model depended on the physical implementation of the processor and often emerged entirely from it. Currently, the trend is the opposite, where various physical implementations (microarchitectures) from different manufacturers adhere to the same ISA. For example, an AMD and an Intel processor, despite having different physical designs, can have the same set of instructions – somewhat like an API independent of physical implementation.

Figure 6 - Operation Codes for MIPS32, Source: Wikipedia.

In summary, there are various types of assembly languages tailored for different architectures with distinct instruction sets. An assembler program translates assembly language into machine code for a specific processor. Consequently, each CPU architecture has at least one assembler capable of converting assembly language into machine code for that processor.

This implies that:

There are many types of assembly languages like x86, ARM, MIPS.
Assembly language represents syntax.
Each processor has an ISA (Instruction Set Architecture) that specifies which assembly language instructions can be used.

If you'd like to learn more on this topic, we encourage you to visit the following links:

Compilation - Continued

Let's begin by recalling the entire compilation process using this diagram.

Figure 7 - Compilation process - Source: LearningLad.

Initially, the source code goes through the preprocessor, where header files are attached, and constants are resolved into values. Then, the compiler transforms the source code into assembly language code. Subsequently, the assembler (a program that compiles assembly language into binary format) generates object files (.obj, .o). Finally, the linker resolves dependencies between individual files and combines everything into a single binary file.

Figure 8 - Compilation of Multiple Files Simultaneously - Source: EngMicroLectures.

We can create programs consisting of multiple files and recompile only those that we modify, rather than the entire project created.

Syntactic, Lexical, and Semantic Analysis

It's also worth mentioning that the above diagrams do not fully cover the topic, as they omit both the parsing stage of source code and its optimizations. The initial stages can be described more precisely as follows:

Figure 9 - Initial Stage of Compiling Source Code to Assembly.

At the start, the compiler performs lexical analysis (identifies tokens) and then removes white spaces. You can think of this simplistically as finding words in a sentence. Syntactic analysis, also known as parsing, involves analyzing a sequence of characters in a natural or programming language according to the rules of a formal grammar. Parsing data involves processing information, organizing it, and providing structured data. Semantic analysis uses syntax trees and symbol tables to check whether the code is semantically correct. It checks things like type compatibility. Following this, code optimization may occur, which makes it execute faster and with fewer assembly language instructions.

Summary

We hope that today's post has shed light on the workings of processors and the compilation process. Next week, we will discuss what an interpreter actually is and why it significantly differs from the classical approach.

Sources:

NEW!

Neural networks – introduction

Discover the depths of deep learning with us! In this article, you will learn what secrets neural networks hide.

Innovation

2024-04-25

NEW!

Kubernetes – what is orchestration and why Docker is not enough?

Discover what Kubernetes is, its fundamental capabilities, and why Docker alone might not be enough for serious production environments.

Programming

2024-04-18

NEW!

Neural Networks Training with Limited Resources

Neural networks and their training on graphics processing units (GPUs). Learn why GPUs are becoming a popular choice.

Information

Innovation

2024-04-04

NEW!

CI/CD, SDLC, and other concepts in the DevOps culture

Learn how introducing CI/CD increases the efficiency of teams and the frequency and quality of your company's software deployments.

Programming

2024-03-28

NEW!

IT Strategic Analysis

Odkryj proces analizy strategicznej w obszarze IT: definiowanie celów, strategii, konkurencji i potrzeb klientów.

Strategy

2024-03-21

NEW!

Organization of the IT technology transfer process in the enterprise

The IT technology transfer process: planning, implementation, evaluation - key phases in business challenges and successes.

Strategy

2024-03-07

NEW!

Types of IT technology strategies

Overview of possible approaches and challenges when optimising IT strategy - types of technology strategies.

Strategy

2024-02-29

NEW!

Docker – Do It Right and Securely Part 2

Another portion of good practice in containerisation using Docker software.

Administration

2024-02-22

NEW!

Docker – Do It Right and Securely Part 1

Discover Docker best practices: official images, versioning, image minimisation, multi-stage build and more!

Administration

2024-02-15

NEW!

SDLC and DevOps Culture

Analysis of the impact of DevOps on the software development process, differences from traditional methods and integration in different stages of the software development lifecycle.

Programming

2024-02-08

NEW!

Linux – About Shells, Scripts, and Permissions

Discover the world of Linux, learn the differences between shells, how to create bash scripts, and the secrets of granting permissions on the system.

Administration

2024-02-01

NEW!

Internet of Things (IoT) – Part 4: CoAP protocol

Discover an alternative to MQTT - the CoAP protocol in IoT. Similar to HTTP, efficient on limited resources, secure with DTLS.

Information

2024-01-25

NEW!

INNOKREA recognized as a Top Development Company in 2024 by Techreviewer.co

We are delighted to have been recognized as one of the top software development companies in 2024.

Information

2024-01-24

NEW!

Internet Of Things (IoT) – Part 3

IOT - available technologies, software packages and what are the differences between the protocols specific to IOT?

Information

2024-01-18

NEW!

Internet Of Things (IoT) – Part 2

Technical aspects of the operation of Internet of Things (IOT) devices, protocol stacks and challenges that engineers have to face.

Information

2024-01-11

NEW!

Internet Of Things (IoT) – part 1

IOT - how rapidly the number of devices on the Internet is growing and how the number of sensors we use directly or indirectly is increasing.

Information

2024-01-04

NEW!

Machine Learning – Part 3: Data Representation Methods

Several widely used data representation methods - how to effectively transfer data to the data processing algorithm.

Information

2023-12-21

NEW!

Machine Learning – Part 2: Different Approaches

About the approaches used in this field, their advantages and disadvantages, as well as possible applications of some algorithms.

Information

2023-12-14

NEW!

Machine Learning – Part 1: Is It Worth It?

Machine learning is a field that is developing at an incredible pace and is penetrating virtually all sectors of the economy.

Information

2023-12-07

NEW!

Cryptography – stream ciphers

Low-level operation of encryption algorithms - what are LSFR registers and where stream ciphers are used.

Security

2023-11-30

NEW!

Cryptography – good password and good practices

About storing, saving and password policy.

Security

2023-11-23

NEW!

Cryptography – hash functions, hashes and passwords part 2

Passwords and their security on the Internet.

Security

2023-11-16

NEW!

Cryptography – hash functions and passwords

Cryptography - discover the secrets of hash functions, hashes and passwords.

Security

2023-11-09

NEW!

Cryptography – randomness in cybersecurity

What is randomness, how to properly estimate it, and how important is it in the field of cryptography.

Security

2023-11-02

NEW!

Cryptography – basic concepts and definitions

Discover encryption, decryption, and hashing while gaining professional knowledge from our cryptography series

Security

2023-10-26

NEW!

Authentication, Second Factor, and Session Hijacking

What is authentication, what is the second factor and what options do we have to confirm the user's identity

Security

2023-10-19

NEW!

Virtualization and containerization – good design principles

Virtualization, its types and good design principles for containers.

Programming

2023-10-12

NEW!

Ransomware – What It Is and How It Can Impact Your Company ? (part 2)

Real cases of attacks and good defense practices in cyberspace

Security

2023-10-05

NEW!

Ransomware – What It Is and How It Can Impact Your Company ?

Threats and good practices for counteracting ransomware

Security

2023-09-28

NEW!

Compilation vs. Interpretation (part 3)

Disadvantages of compiled languages and how interpreters respond to modern needs

Programming

2023-09-21

NEW!

Compilation vs. Interpretation (part 2)

File formats, and the low-level aspects related to assembly language and processor architecture

Programming

2023-09-14

NEW!

Compilation vs. Interpretation (part 1)

Understanding differences and similarities of code compilation and interpretation

Programming

2023-09-08

NEW!

Terraform – Infrastructure Management Automation (part 2)

The basics of managing state in Terraform and setting up a Github repository

Administration

2023-08-31

NEW!

Terraform – Infrastructure Management Automation (part 1)

Fundamentals of infrastructure automation with Terraform

Administration

2023-08-24

NEW!

Improve your online security with Innokrea – don’t let yourself be robbed (part 4)

The last episode of the cybersecurity series

Security

2023-08-17

NEW!

Enhance your online safety with Innokrea – don’t let yourself be robbed (part 3)

Tips #21 - #30 on cybersecurity

Security

2023-08-10

NEW!

Increase your Internet security with Innokrea – don’t let yourself be robbed (part 2)

Tips #11 - #20 to increase your safety on the Internet

Security

2023-08-03

NEW!

Increase your online security with Innokrea – don’t let yourself get robbed (part 1)

10 tips that will increase your online security

Security

2023-07-27

NEW!

Supercomputers (part 4)

Ways to increase the efficiency of supercomputers

Innovation

2023-07-17

NEW!

Supercomputers (part 3)

Network architecture of cluster systems and network topologies

Innovation

2023-06-27

NEW!

Top 100 iOS Development Companies 2023

INNOKREA Included in Techreviewer's Top 100 iOS Development Companies for 2023

Information

2023-06-26

NEW!

Supercomputers (part 2)

Technologies and cooling of supercomputers

Innovation

2023-06-19

NEW!

Supercomputers (part 1)

Introduction to the topic of supercomputers

Innovation

2023-06-01

NEW!

Masscan (part 2)

Specifics, advanced options and reflection attack.

Security

2023-05-25

NEW!

Masscan (part 1)

How to scan the entire Internet in minutes?

Security

2023-05-18

NEW!

SOLID – clean code in object-oriented programming

Descrption of rules for object-oriented programming

Programming

2023-05-11

NEW!

Clean code

How to write safe, high-quality code ?

Programming

Security

2023-04-27

NEW!

Hannover Messe 2023

Report on the participation of INNOKREA in the fair

Information

Innovation

2023-04-26

NEW!

Docker – how to simplify running and deploying applications ? (part 4)

Differences between ARG and ENV, docker-compose, orchestration and docker API

Administration

2023-04-13

NEW!

Docker – how to simplify running and deploying applications ? (part 3)

We present the basic problems that a novice Docker user may encounter

Administration

2023-04-06

NEW!

Docker – how to simplify running and deploying applications ? (part 2)

Basic Docker commands

Administration

2023-03-30

NEW!

Docker – how to simplify running and deploying applications ? (part 1)

We present the subject of virtualization, containerization and Docker.

Administration

2023-03-23

NEW!

Zabbix – Increase security, monitor your servers (part 3)

Description of the admin panel and selected problems with database monitoring in Zabbix

Administration

Security

2023-03-16

NEW!

Zabbix – Increase security, monitor your servers (part 2)

Ways to configure Zabbix software

Administration

Security

2023-03-09

NEW!

Zabbix – Increase security, monitor your servers (part 1)

Article about the popular server management tool

Administration

Security

2023-03-02

NEW!

What is BadUSB and why can it be a vector of an attack on your company? Part 2

How to protect a company against attacks through Rubber Ducky devices?

Security

2023-02-23

NEW!

What is BadUSB and why can it be a vector of an attack on your company? Part 1

How an inconspicuous flash drive can lead to taking control of your computers?

Security

2023-02-16

NEW!

IT technology replacement

The most important questions that need to be answered before replacing an IT technology.

Management

Strategy

2023-02-08

NEW!

Evaluation of IT strategy

Every IT strategy (even the best one) requires periodic review and reflection

Strategy

2023-01-20

NEW!

Deloitte Technology Fast 50 CE 2022

We are delighted to have been recognized in Deloitte's prestigious ranking as the fastest growing technology company in Central Europe.

Information

2022-12-01

NEW!

How to optimize business processes using the process mapping method?

In this article, you will learn how to optimize your business processes using the process mapping method to increase the efficiency.

Management

Strategy

2022-11-24

NEW!

Digitization of company processes – what is worth knowing?

Digitization of processes involves bringing technological solutions into a company and enables it to keep pace with changes in the economy.

Strategy

2022-11-15

NEW!

Pros and cons of digitization in manufacturing processes

Digitization of production processes has become one of the most important trends in the manufacturing industry. Learn about the pros and cons of digitization.

Strategy

2022-10-26

NEW!

Industry 4.0 and its impact on manufacturing companies

Industry 4.0 What impact does digitization have on manufacturing companies? What opportunities do digital technologies offer? What are the benefits of the digital revolution?

Management

2022-10-03

NEW!

How to increase operational efficiency in a manufacturing company?

What changes have occurred over the past few years? How do you create value with BPM? How does BPM improve operational efficiency in manufacturing companies?

Innovation

2022-09-20

NEW!

The Manifest Names Innokrea Among Poland’s Most Reviewed Software Developers

INNOKREA among top development companies in Poland

Information

2022-09-14

NEW!

Evaluation of IT projects

How to know if our projects are going in the right direction and will achieve their goals?

Management

2022-08-01

NEW!

Main challenges and risks of IT projects

Proper and pragmatic risk management can save a lot of money

Management

2022-06-27

NEW!

The impact of implementations of IT projects on the company’s production processes

IT project implementation might be challenging. It is important to take into considerations potential impact of the deployment on all business processes.

Innovation

Strategy

2022-05-12

NEW!

Planning of IT projects – constraints

Imagine an IT project without any constraints ?

Innovation

2022-02-15

NEW!

Planning of IT projects – smart goals

“If one does not know to which port one is sailing, no wind is favourable.” — Seneca

Strategy

2022-01-10

NEW!

Planning and selection of technologies – Assessment of investments in IT technology

How to assess the financial effectiveness of investments in IT technology ?

Financial

2021-12-06

NEW!

Planning and selection of IT technologies – readiness and ability to absorb it in a company

When considering the transfer of a given IT technology to a company, we should take into account its readiness and the company's ability to absorb it.

Strategy

2021-10-25

NEW!

Planning and selection of IT technologies in companies

General principles of technology selection

Strategy

2021-09-16

NEW!

Methods of creating an IT technological strategy

A key instrument for developing a targeted technological strategy (including IT) is to define its goals.

Strategy

2021-08-24

NEW!

The concept and formulation of a technological strategy

A competitive advantage might be built by mastering many different technologies. That process requires strategic analysis, choice, planning and implementation.

Strategy

2021-07-21

NEW!

Strategic conditions for the development of IT technology

Business strategy and IT strategy must be united to conquer targeted markets.

Strategy

2021-07-06

NEW!

Sources of IT technologies

Enterprises interested in building a competitive advantage on the market through technology development can use internal, external and/or combined sources. Which model suits the best your company ?

Strategy

2021-06-14

NEW!

Trajectories of Technological Development

IT technologies, like other technologies and products of human thought and work, in different periods of their existence have different meanings and value for their users.

Innovation

2021-06-02

Contact

Got a project in mind?

Fill the form and get a free consultation!

At INNOKREA, we take pride in crafting bespoke solutions and custom software tailored to our clients' unique needs. With a team of over 25 skilled engineers proficient in 100+ technologies, we ensure top-notch quality, earning the trust of 100+ clients worldwide, all backed by our ISO 9001 badge of excellence. At our core, we are dedicated to pushing the boundaries of product engineering, delivering cutting-edge solutions that drive success for businesses across industries.

Tomasz Klajbor

CEO at INNOKREA