Assembly to C

Starter code: asm-to-c-package.tar
Upload via Moodle as: asm-to-c.tar

Goals

Rubric

1 - author name(s) in each of your source files 0 - puzzle0 (we'll do this one in class) 4 - puzzle1 (2.5 points code, 1.5 points explanation) 4 - puzzle2 4 - puzzle3 4 - puzzle4 4 - puzzle5

Background

What does a compiler do?

That question has a long, complicated answer. But in brief, our compiler (gcc) takes C sources as input, and produces an executable program as output. The executable program contains, among other things, machine language instructions whose behavior implements the computations articulated in the original C code.

Machine language is just bits, and is thus hard to read. So if we want to understand the correspondence between C code and its corresponding machine language, we're better off asking gcc to output assembly language code instead. Assembly isn't particularly easy to read, but it's a lot easier than machine language. And as a general rule, each assembly language instruction corresponds to exactly one machine language instruction, and vice versa. There are some exceptions (e.g. sometimes one assembly language instruction is an alias for a sequence of two or three machine language instructions), but as a rough guide, you can think of assembly and machine language instructions as being in one-to-one correspondence. As a result, by understanding the assembly language generated by gcc, we will be very close to understanding the machine language as well.

For this assignment, you are going to practice understanding the correspondence between simple C code and its equivalent assembly language by solving a sequence of puzzles. For each puzzle, you will read some given assembly language and try to come up with the original C code that generated it. This is a form of reverse engineering, and it's pretty fun.

Though we could use gcc on mantis, for this assignment we're instead going to use an extremely handy tool called the Compiler Explorer. You'll put some C code into the input panel, and the output panel will show you the assembly language generated by the selected compiler. As you adjust your C code, you'll be able to watch the changes in the assembly language, and then compare your assembly code to the puzzle's code.

Your assignment

In the asm-to-c-package.tar package, you will find several files named puzzle0.asm, puzzle1.asm, etc. For each puzzle, your job will go like this:

Compiler Explorer settings

To create the puzzles, I used Compiler Explorer with the following settings. You should use the same settings.

Here's how all of that looks.

compiler explorer settings

Have fun!