[PPT] - assembly language source assembler code "machine code" PowerPoint Presentation

SLIDE 1

1

These slides may be freely used, distributed, and incorporated into other works.

assembly language source code "machine code" assembler

SLIDE 2

2

These slides may be freely used, distributed, and incorporated into other works.

To execute a program: 1 put the "machine code" into memory 2 jal __start (the OS does this)

memory "machine code"

SLIDE 3

3

These slides may be freely used, distributed, and incorporated into other works.

assembler's task

1.

assign addresses

2.

generate "machine code"

3.

(architecture dependent) further translation of assembly language source code

SLIDE 4

4

These slides may be freely used, distributed, and incorporated into other works.

previous architectures MIPS architecture

1 assembly language instruction 1 assembly language instruction 1 machine code instruction 1 or more machine code instructions

SLIDE 5

5

These slides may be freely used, distributed, and incorporated into other works.

This further translation is also called synthesis. MIPS example of synthesis: add $8, $9, -16 becomes addi $8, $9, -16

SLIDE 6

6

These slides may be freely used, distributed, and incorporated into other works.

Two operands in the source code add $8, $9 are expanded back out to become add $8, $8, $9

SLIDE 7

7

These slides may be freely used, distributed, and incorporated into other works.

integer multiplication and division each produce 2 32-bit results integer division produces

 quotient  remainder

integer multiplication of 2 32-bit operands produces a 64-bit result

SLIDE 8

8

These slides may be freely used, distributed, and incorporated into other works.

MIPS hardware implements 2 extra registers (called HI and LO) to hold these results. Here are 4 more MIPS instructions: mflo R mtlo R mfhi R mthi R

m move f from t to lo register LO hi register HI

SLIDE 9

9

These slides may be freely used, distributed, and incorporated into other works.

multiplication

mul $8, $9, $10 becomes mult $9, $10 mflo $8

HI LO X

SLIDE 10

10

These slides may be freely used, distributed, and incorporated into other works.

division

div $8, $9, $10 becomes div $9, $10 mflo $8 # quotient in LO rem $12, $13, $14 becomes div $13, $14 mfhi $12 # remainder in HI

SLIDE 11

11

These slides may be freely used, distributed, and incorporated into other works.

puts, putc, getc, and done are not TAL ! I/O is accomplished by requesting service from the operating system (OS). All architectures do this with a single instruction. On MIPS, this instruction is syscall (no operands)

SLIDE 12

12

These slides may be freely used, distributed, and incorporated into other works.

(note that this is specific to our simulator) To help the OS distinguish what service is required, $v0 ($2) is set:

$v0 I/O operation 11 putc 12 getc 4 puts 10 done

SLIDE 13

13

These slides may be freely used, distributed, and incorporated into other works.

synthesis of puts $8

first pass final synthesis li $2, 4 addi $2, $0, 4 move $4, $8 add $4, $8, $0 syscall syscall

SLIDE 14

14

These slides may be freely used, distributed, and incorporated into other works.

lw $8, X becomes la $8, X lw $8, 0($8) Oops! la must be synthesized.

SLIDE 15

15

These slides may be freely used, distributed, and incorporated into other works.

synthesis of la $8, my_label

 requires the address assigned for

my_label

 every address is assigned by the

assembler

MS part LS part

16 16 32

SLIDE 16

16

These slides may be freely used, distributed, and incorporated into other works.

la $8, my_label becomes lui $8, 0xMS part

ri $8, $8, 0xLS part

SLIDE 17

17

These slides may be freely used, distributed, and incorporated into other works.

after lui $8, 0xMS part $8

MS part 000 . . . 0

this is then logically ORed with

LS part 000 . . . 0

due to the instruction ori $8, $8, 0xLS part resulting contents of $8: $8

LS part MS part

SLIDE 18

18

These slides may be freely used, distributed, and incorporated into other works.

Synthesize lw $8, X Assume X is assigned address 0xaaee0018. first try: la $8, X lw $8, 0($8) with synthesis of the la instruction: lui $8, 0xaaee

ri $8, $8, 0x0018

lw $8, 0($8)

SLIDE 19

19

These slides may be freely used, distributed, and incorporated into other works.

Synthesize sb $12, X Assume X is assigned address 0x080001a0.

SLIDE 20

20

These slides may be freely used, distributed, and incorporated into other works.

Generate machine code for addi $8, $20, 15 From the TAL table: addi Rt, Rs, I Rt is $8 Rs is $20 I is 0000 0000 0000 1111 0010 00ss ssst tttt ii .. ii sssss is 10100 (for $20) ttttt is 01000 (for $8) 0010 0010 1000 1000 0000 0000 0000 1111 in hex 0x2288000f

16 bits

p code

SLIDE 21

21

These slides may be freely used, distributed, and incorporated into other works.

Generate machine code for lw $8, 12($sp) lw Rt, I(Rb) Rt is $8 Rb is $sp (which is $29) I is 12 1000 11bb bbbt tttt ii .. ii bbbbb is 11101 ttttt is 01000 1000 1111 1010 1000 0000 0000 0000 1100 in hex 0x8fa8000c

16

p code

value 12

SLIDE 22

22

These slides may be freely used, distributed, and incorporated into other works.

assembly language source code memory image assembler

assign addresses produce machine code

SLIDE 23

23

These slides may be freely used, distributed, and incorporated into other works.

Problem: forward references

.text beq $8, $11, later_in_code later_in_code: lw $20, X .data X: .word 16

SLIDE 24

24

These slides may be freely used, distributed, and incorporated into other works.

Simple solution: 2-pass assembler

 first pass:

 (MIPS-only) MAL  TAL synthesis  assign all addresses

 second pass:

 produce all machine code

More complex and more efficient: 1-pass assembler

 Keep a list of instructions that cannot

be completed due to yet-to-be-assigned

addresses. As addresses are assigned,

check the list and complete instructions.

SLIDE 25

25

These slides may be freely used, distributed, and incorporated into other works.

assign all addresses (and remember them) implies the use of a table holding the mapping of addresses to labels called a symbol table

SLIDE 26

26

These slides may be freely used, distributed, and incorporated into other works.

 As the assembler works on the source

code, it scans the characters in the file.

 Scanner (a SW module)

 breaks a set of characters into significant

groups known as tokens

 often, tokens are separated by white space

r special punctuation

.data a1: .word 3 loop: lw $7, 4($6)

SLIDE 27

27

These slides may be freely used, distributed, and incorporated into other works.

.data a1: .word 3 a2: .word 16:4 a3: .word 5 .text __start: la $6, a2 loop: lw $7, 4($6) mult $9, $10 b loop done

SLIDE 28

28

These slides may be freely used, distributed, and incorporated into other works.

2 segments: code and data

 The assembler places items into these 2

segments. So, it needs addresses.

 Use starting addresses of

data 0x0040 0000 code 0x0080 0000

 The variable internal to the assembler

that represents the next address to be assigned is the location counter.

SLIDE 29

29

These slides may be freely used, distributed, and incorporated into other works.

TAL equivalent of code: .text __start: lui $6, 0x0040 # la $6, a2

ri $6, $6, 0x0004

loop: lw $7, 4($6) mult $9, $10 beq $0, $0, loop # b loop

ri $2, $0, 10 # done

syscall

SLIDE 30

30

These slides may be freely used, distributed, and incorporated into other works.

As a result of processing the entire .data section, the memory image will be address contents 0x0040 0000 0x0000 0003 0x0040 0004 0x0000 0010 0x0040 0008 0x0000 0010 0x0040 000c 0x0000 0010 0x0040 0010 0x0000 0010 0x0040 0014 0x0000 0005

SLIDE 31

31

These slides may be freely used, distributed, and incorporated into other works.

.data a1: .word 3 a2: .word 16:4 a3: .word 5 .text __start: la $6, a2 loop: lw $7, 4($6) mult $9, $10 b loop done

SLIDE 32

32

These slides may be freely used, distributed, and incorporated into other works.

Machine code for la $6, a2 Synthesized: lui $6, 0x0040 (address from symbol table)

ri $6, $6, 0x0004

lui Rt, I Rt is $6 0011 1100 000t tttt ii .. ii ttttt is 00110 0011 1100 0000 0110 0000 0000 0100 0000 in hex 0x3c060040

16

p code

(1) (2) (1)

SLIDE 33

33

These slides may be freely used, distributed, and incorporated into other works.

Add to the memory image address contents 0x0080 0000 0x3c06 0040

SLIDE 34

34

These slides may be freely used, distributed, and incorporated into other works.

Machine code for ori $6, $6, 0x0004

ri Rt, Rs, I

Rt is $6 Rs is $6 0011 01ss ssst tttt ii .. ii ttttt is 00110 sssss is 00110 0011 0100 1100 0110 0000 0000 0000 0100 in hex 0x34c60004

16

p code

(2)

SLIDE 35

35

These slides may be freely used, distributed, and incorporated into other works.

Add it to the memory image as well, updating the location counter address contents 0x0080 0000 0x3c06 0040 0x0080 0004 0x34c6 0004

SLIDE 36

36

These slides may be freely used, distributed, and incorporated into other works.

.data a1: .word 3 a2: .word 16:4 a3: .word 5 .text __start: la $6, a2 loop: lw $7, 4($6) mult $9, $10 b loop done

SLIDE 37

37

These slides may be freely used, distributed, and incorporated into other works.

Scanning on, machine code for lw $7, 4($6)

lw Rt, I(Rb) Rt is $7 Rb is $6 I is 4 1000 11bb bbbt tttt ii .. ii 1000 1100 1100 0111 0000 0000 0000 0100 in hex 0x8cc70004

16

p code

SLIDE 38

38

These slides may be freely used, distributed, and incorporated into other works.

Add it to the memory image as well, updating the location counter address contents 0x0080 0000 0x3c06 0040 (lui) 0x0080 0004 0x34c6 0004 (ori) 0x0080 0008 0x8cc7 0004 (lw)

SLIDE 39

39

These slides may be freely used, distributed, and incorporated into other works.

.data a1: .word 3 a2: .word 16:4 a3: .word 5 .text __start: la $6, a2 loop: lw $7, 4($6) mult $9, $10 b loop done

SLIDE 40

40

These slides may be freely used, distributed, and incorporated into other works.

next comes mult $9, $10

0000 00ss ssst tttt 0000 0000 0001 1000 0000 0001 0010 1010 0000 0000 0001 1000

in hex 0x012a0018

p code

Rs Rt

01001 01010 Rd

SLIDE 41

41

These slides may be freely used, distributed, and incorporated into other works.

p code 000000 is used for

any arithmetic or logical instruction with 3 register operands

0000 00ss ssst tttt dddd d??? ???? ????

which

peration

SLIDE 42

42

These slides may be freely used, distributed, and incorporated into other works.

Add mult to the memory image as well, updating the location counter address contents 0x0080 0000 0x3c06 0040 (lui) 0x0080 0004 0x34c6 0004 (ori) 0x0080 0008 0x8cc7 0004 (lw) 0x0080 000c 0x012a 0018 (mult)

SLIDE 43

43

These slides may be freely used, distributed, and incorporated into other works.

.data a1: .word 3 a2: .word 16:4 a3: .word 5 .text __start: la $6, a2 loop: lw $7, 4($6) mult $9, $10 b loop done

SLIDE 44

44

These slides may be freely used, distributed, and incorporated into other works.

b loop is a pseudoinstruction (must be synthesized) Many translations: beq $0, $0, loop bgez $0, loop blez $0, loop j loop

SLIDE 45

45

These slides may be freely used, distributed, and incorporated into other works.

beq $0, $0, loop

0001 00ss ssst tttt iii ... ii

p code

Rs Rt

00000 00000

I

I is a derivation of an offset.

SLIDE 46

46

These slides may be freely used, distributed, and incorporated into other works.

At run (execution) time, for a taken branch I (from instruction) I || 00 (concatenate) I || 00 (sign extend to 32 bits)

+ PC  PC

SLIDE 47

47

These slides may be freely used, distributed, and incorporated into other works.

I computed by the assembler relies on except, when the PC (the branch address!) is used (at execution time), the PC update step (of the fetch and execute cycle) has already been completed. So,

byte difference target address branch address

=

byte

difference target address branch address

=

+ 4

SLIDE 48

48

These slides may be freely used, distributed, and incorporated into other works.

from the symbol table target is loop 0x0080 0008 beq is at 0x0080 0010

byte offset = 0x00800008 – ( 0x00800010 + 4 )

(can't do this in unsigned, so convert to 2's complement)

0000 0000 1000 0000 0000 0000 0000 1000

0000 0000 1000 0000 0000 0000 0001 0100

1111 1111 0111 1111 1111 1111 1110 1100

additive inverse of

SLIDE 49

49

These slides may be freely used, distributed, and incorporated into other works.

0000 0000 1000 0000 0000 0000 0000 1000 + 1111 1111 0111 1111 1111 1111 1110 1100 1111 1111 1111 1111 1111 1111 1111 0100 this represents -12

12 is the byte offset to be added

to the PC to form the new (correct) target PC

SLIDE 50

50

These slides may be freely used, distributed, and incorporated into other works.

Recall that

At run (execution) time, for a taken branch I (from instruction) I || 00 (concatenate) I || 00 (sign extend to 32 bits)

+ PC  PC

SLIDE 51

51

These slides may be freely used, distributed, and incorporated into other works.

instructions are all 32 bits = 4 bytes addresses of all instructions xx ... xx00 for example, 0, 4, 8, 12, 16 ... So, remove the zeros at assembly time and put them back in at execution time. 18 bits of offset for 16 bits of instruction space

SLIDE 52

52

These slides may be freely used, distributed, and incorporated into other works.

back to the beq instruction:

12 is

1111 1111 1111 1111 1111 0100 1111 1111 1111 1111 1111 01 0001 00ss ssst tttt ii ... ii in hex 0x1000 fffd

eliminated 16 bit I field of instruction

00 0000 0000

1111 1111 1111 1101

SLIDE 53

53

These slides may be freely used, distributed, and incorporated into other works.

Add beq to the memory image, updating the location counter address contents 0x0080 0000 0x3c06 0040 (lui) 0x0080 0004 0x34c6 0004 (ori) 0x0080 0008 0x8cc7 0004 (lw) 0x0080 000c 0x012a 0018 (mult) 0x0080 0010 0x1000 fffd (beq)

SLIDE 54

54

These slides may be freely used, distributed, and incorporated into other works.

If the I field is a 2's complement value, it is 1111 1101  0000 0010 + 1 = 0000 0011 (+3) So, -3 is represented. The TAL code loop: lw mult beq next instr

3 instructions

SLIDE 55

55

These slides may be freely used, distributed, and incorporated into other works.

.data a1: .word 3 a2: .word 16:4 a3: .word 5 .text __start: la $6, a2 loop: lw $7, 4($6) mult $9, $10 b loop done

SLIDE 56

56

These slides may be freely used, distributed, and incorporated into other works.

done is a pseudoinstruction in TAL,

ri $2, $0, 10

syscall

SLIDE 57

57

These slides may be freely used, distributed, and incorporated into other works.

ri $2, $0, 10

0011 01ss ssst tttt ii ... ii

0011 0100 0000 0010 0000 0000 0000 1010

in hex 0x3402000a

p code

Rt Rs

00010 00000

I

16

SLIDE 58

58

These slides may be freely used, distributed, and incorporated into other works.

Add ori to the memory image, updating the location counter address contents 0x0080 0000 0x3c06 0040 (lui) 0x0080 0004 0x34c6 0004 (ori) 0x0080 0008 0x8cc7 0004 (lw) 0x0080 000c 0x012a 0018 (mult) 0x0080 0010 0x1000 fffd (beq) 0x0080 0014 0x3402 000a (ori)

SLIDE 59

59

These slides may be freely used, distributed, and incorporated into other works.

syscall is the easiest instruction of all (all bits are defined)

0000 0000 0000 0000 0000 0000 0000 1100

in hex 0x0000000c

SLIDE 60

60

These slides may be freely used, distributed, and incorporated into other works.

The complete memory image (of the code) address contents 0x0080 0000 0x3c06 0040 (lui) 0x0080 0004 0x34c6 0004 (ori) 0x0080 0008 0x8cc7 0004 (lw) 0x0080 000c 0x012a 0018 (mult) 0x0080 0010 0x1000 fffd (beq) 0x0080 0014 0x3402 000a (ori) 0x0080 0018 0x0000 000c (syscall)

SLIDE 61

61

These slides may be freely used, distributed, and incorporated into other works.

j L1 . . . L1: # more code here machine code for j

0000 10

most of an address 26 bits 6 bits

SLIDE 62

62

These slides may be freely used, distributed, and incorporated into other works.

target address calculation at run time

0000 10

most of an address

26 bits

PC 31..28 || 26 bits || 00

32-bit target address

SLIDE 63

63

These slides may be freely used, distributed, and incorporated into other works.

memory

1 16th of memory

addresses to that 1

16th of memory are all of the form

XXXX?? ● ● ● ??00

26 bits fixed

SLIDE 64

64

These slides may be freely used, distributed, and incorporated into other works.

How big is 228 bytes or 226 words is 64 M words Is it big enough?

1 16th of memory ?

SLIDE 65

65

These slides may be freely used, distributed, and incorporated into other works.

So, if we have address L1:

31 28 00

26 bits of L1

If L1 is assigned address 0xa460 005c, L1 in binary: Then machine code for j L1 is

1010 0100 0110 0000 0000 0000 0101 1100 000010 0100 0110 0000 0000 0000 0101 11 in hex 0x09180017

SLIDE 66

66

These slides may be freely used, distributed, and incorporated into other works.

Summary, on the MIPS architecture: branch instructions use an offset from the current PC at execution time to calculate the target address for a taken branch jump instructions use part of an address together with implied other bits to form an address