Assembly Language Programmer's Guide



Document number: 007-0730-010

# Assembly Language Programmer's Guide

Version 1.0

Document Number 007-0730-010

## Technical Publications:

Robert Reimann

#### **Engineering:**

Greg Boyd

#### © Copyright 1987, Silicon Graphics, Inc.

All rights reserved.

This document contains proprietary information of Silicon Graphics, Inc., and is protected by Federal copyright law. The information may not be disclosed to third parties or copied or duplicated in any form, in whole or in part, without prior written consent of Silicon Graphics, Inc.

The information in this document is subject to change without notice.

Assembly Language Programmer's Guide Version 1.0 Document Number 007-0730-010

Silicon Graphics, Inc. Mountain View, California

UNIX is a registered trademark of AT&T.

## Preface

This book describes the assembly language supported by the compiler system, it's syntax rules, and shows how to write some assembly programs. For information about assembling and linking programs written in assembler language, see the *IRIS-4D Series Compiler Guide*.

The assembler converts assembly language statements into machine code. In most assembly languages, each instruction corresponds to a single machine instruction; however, some assembly language instructions can generate several machine instructions. This feature results in assembly programs that can run without modification on future machines, which might have different machine instructions. See Appendix B for more information about assembler instructions that generate multiple machine instructions.

### Who Should Read This Book?

This book assumes that you are an experienced assembly language programmer.

The assembler exists primarily to produce object modules from the assembly instructions that the C and Fortran 77 compilers generate. It therefore lacks many functions normally present in assemblers. Therefore, we recommend that you use the assembler only when you need to:

- maximize the efficiency of a routine, which might not be possible in C or Fortran 77-for example, to write low-level I/O drivers
- access machine functions unavailable from high-level languages or satisfy special constraints such as restricted register usage
- change the operating system
- change the compiler system

### What Does This Book Cover?

This book has these chapters:

- Chapter 1-Registers describes the format for the general registers, the special registers, and the floating point registers.
- Chapter 2-Addressing describes how addressing works.
- Chapter 3-Exceptions describes exceptions you might encounter with assembly programs.
- Chapter 4-Lexical Conventions describes the lexical conventions that the assembler follows.
- Chapter 5–Instruction Set describes the main processor's instruction set, including notation, load and store instructions, computational instructions, and jump and branch instructions.
- Chapter 6-Coprocessor Instruction Set describes the coprocessor instruction sets.
- Chapter 7-Linkage Conventions describes linkage conventions for all supported high-level languages. It also discusses memory allocation and register use.
- Chapter 8–Pseudo-Operations describes the assembler's pseudo-operations (directives).
- Chapter 9-Object File Format provides an overview of the components comprising the object file and describes the headers and sections of the object file.
- Chapter 10—The Symbol Table describes the purpose of the Symbol Table and the format of entries in the table. This chapter also lists the symbol table routines that are supplied.
- Appendix A summarizes all instructions.
- Appendix B describes instructions that generate more than one machine instruction.

## Contents

| 1. | Registers       1-1         1.1 Register Format       1-1         1.1.1 Big–Endian Machines       1-2         1.2 General Registers       1-2         1.3 Special Registers       1-4         1.4 Floating Point Registers       1-4                                                                                                                                                                                                                                     |
|----|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 2. | Addressing2-12.1 Address Formats2-22.2 Address Descriptions2-3                                                                                                                                                                                                                                                                                                                                                                                                           |
| 3. | Exceptions3-13.1 Main Processor Exceptions3-13.2 Floating Point Exceptions3-2                                                                                                                                                                                                                                                                                                                                                                                            |
| 4. | Lexical Conventions4-14.1 Tokens4-24.2 Comments4-24.3 Identifiers4-24.4 Constants4-24.4 Constants4-24.4.1 Scalar Constants4-24.4.2 Floating Point Constants4-24.4.3 String Constants4-24.5 Multiple Lines Per Physical Line4-24.6 Sections and Location Counters4-24.7 Statements4-24.7.1 Label Definitions4-24.7.3 Keyword Statements4-24.8 Expressions4-24.8.1 Precedence4-24.8.2 Expression Operators4-24.8.3 Data Types4-104.8.4 Type Propagation in Expressions4-10 |

| 5. | Instruction Set5–15.1 Instruction Notation5–25.2 Load and Store Instructions5–25.2.1 Load and Store Formats5–25.2.2 Load Instruction Descriptions5–45.2.3 Store Instruction Descriptions5–85.3 Computational Instructions5–11                                                                                                                                                                                                                                                          |   |
|----|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---|
|    | 5.3.1 Computational Formats       5–11         5.3.2 Computational Instruction Descriptions       5–13         5.4 Jump and Branch Instructions       5–25         5.4.1 Jump and Branch Formats       5–25         5.4.2 Jump and Branch Instruction Descriptions       5–27         5.5 Special Instructions       5–31         5.5.1 Special Formats       5–31         5.5.2 Special Instruction Descriptions       5–31         5.5.2 Special Instruction Descriptions       5–32 |   |
|    | 5.6 Coprocessor Interface Instructions                                                                                                                                                                                                                                                                                                                                                                                                                                                 |   |
| 6. | Coprocessor Instruction Set       6-1         6.1 Instruction Notation       6-1         6.2 Floating Point Instructions       6-2         6.2.1 Floating Point Formats       6-3         6.2.2 Floating Point Load and Store Formats       6-3         6.2.3 Floating Point Load and Store Descriptions       6-4         6.2.4 Floating Point Computational Formats       6-5         6.2.5 Floating Point Computational Instruction       6-7                                       |   |
|    | <ul> <li>6.3 Floating Point Relational Operations</li></ul>                                                                                                                                                                                                                                                                                                                                                                                                                            |   |
|    | Instruction Descriptions6–196.5 Control and Status Register6–206.5.1 Exception Trap Processing6–236.5.2 Floating Point Rounding6–26                                                                                                                                                                                                                                                                                                                                                    |   |
| 7. | Linkage Conventions         7-1           7.1 Introduction         7-1           7.2 Program Design         7-2           7.2.1 Register Use and Linkage         7-2           7.2.2 The Stack Frame         7-4           7.2.3 The Shape of Data         7-9                                                                                                                                                                                                                         | 1 |

|    | 7.3 Examples7–97.4 Learning by Doing7–137.4.1 Calling a High-Level Language Routine7–137.4.2 Calling an Assembly Language Routine7–167.5 Memory Allocation7–18                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |
|----|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 8. | Pseudo-Op Codes 8-1                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                |
| 9. | Object File Format9-19.1 Overview9-19.2 The File Header9-49.2.1 File Header Magic Field (f_magic)9-59.2.2 Flags (f_flags)9-69.3 Optional Header9-89.3.1 Optional Header Magic Field (magic)9-99.4 Section Headers9-99.4.1 Section Name (s_name)9-109.4.2 Line Number Entries (s_Innoptr and s_nInno)9-119.4.3 Flags (s_flags)9-129.6 Section Data9-129.6 Section Relocation Information9-139.6.1 Relocation Table Entry9-139.6.2 Assembler and Link Editor Processing9-159.7 Object Files9-209.7.1 Impure Format (OMAGIC) Files9-219.7.2 Shared Text (NMAGIC) Files9-239.7.3 Demand Paged (ZMAGIC) Files9-239.7.4 Loading Object Files9-259.8 Archive Files9-269.9 Link Editor Defined Symbols9-26 |
| 10 | The Symbol Table       10–1         10.1 Overview       10–1         10.2 Format of Symbol Table Entries       10–8         10.2.1 Symbolic Header       10–8         10.2.2 Line Numbers       10–10         10.2.3 Procedure Descriptor Table       10–15         10.2.4 Local Symbols       10–16         10.2.5 Optimization Symbols       10–20         10.2.6 Auxiliary Symbols       10–21         10.2.7 File Descriptor Table       10–24         10.2.8 External Symbols       11–26                                                                                                                                                                                                     |

| Α. | Instruction Summaries A-                                                                                                                                                     | -1             |
|----|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------|
| Β. | Basic Machine DefinitionB-B.1 Load and Store InstructionsB-B.2 Computational InstructionsB-B.3 Branch InstructionsB-B.4 Coprocessor InstructionsB-B.5 Special InstructionsB- | -1<br>-2<br>-3 |

## 1. Registers

Chapter 1 discusses the registers and describes how memory organization affects them. Refer to Chapter 7 for information regarding register use and linkage.

The machine has these registers:

- general registers, which are always one word wide
- coprocessor registers (for example, floating point registers)
- two special registers that hold the results of multiplication and division instructions

You must use general registers where the assembly instructions expect general registers and floating point registers where the assembly instructions expect floating point registers. If you confuse the two, the assembler issues an error message.

### 1.1 Register Format

A machine's byte ordering scheme (or endian issues) affects memory organization and defines the relationship between address and byte position of data in memory. R2000 machines can be big-endian or little-endian. Big-endian machines store the sign bit in the lowest address byte. Littleendian machines store the sign bit in the highest address byte. The R2000 processors used in the IRIS-4D Series workstations are configured as bigendian.

#### 1.1.1 Big–Endian Machines

Big-endian machines number the bytes of a word from 0 to 3. Byte 0 holds the sign and most significant bits.

For halfwords, big-endian machines number the bytes from 0 to 1. Byte 0 holds the sign and most significant bits.

Big-endian machines number the bits of each byte from 0 to 7, using this format:

- Bit 0 holds the most significant bit.
- Bit 7 holds the least significant bit.

#### **1.2 General Registers**

Each general register has 32 bits. The assembler reserves all register names, and you must use lowercase for the names. All register names start with a dollar sign (\$).

The general registers have the names \$0..\$31. By including the file *reg-def.h* (use #include regdef.h) in your program, you can use software names for some general registers. The operating system and the assembler use the general registers \$1, \$26, \$27, \$28, and \$29 for specific purposes. (NOTE: Attempts to use these general registers in other ways can produce unexpected results.) If a program uses the names \$1, \$26, \$27, \$28, respectively, the assembler issues warning messages.

General register \$0 always contains the value 0. All other general registers are equivalent, except that general register \$31 also serves as the implicit link register for jump and link instructions. See Chapter 7 for a description of register assignments.

1-2 Assembly Language Programmer's Guide

| Register Name   | Software Name<br>(from regdef.h) | Use and Linkage                                                                                                                                           |
|-----------------|----------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------|
| \$0             |                                  | always has the value 0                                                                                                                                    |
| \$at            |                                  | reserved for the assembler                                                                                                                                |
| \$2\$3          | v0-v1                            | used for expression evaluations and to hold<br>the integer type function results. Also used<br>to pass the static link when calling nested<br>procedures. |
| \$4\$7          | a0-a3                            | used to pass the first 4 words of integer<br>type actual arguments, their values are<br>not preserved across procedure calls                              |
| \$8\$1 <i>5</i> | t0-t7                            | temporary registers used for expression evalu-<br>ations; their values aren't preserved across<br>procedure calls.                                        |
| \$16\$23        | s0-s7                            | saved registers. Their values must be pre-<br>served across procedure calls.                                                                              |
| \$24\$25        | t8-t9                            | temporary registers used for expression evalu-<br>ations; their values aren't preserved across<br>procedure calls.                                        |
| \$kt0\$kt1      | k0-k1                            | reserved for the operating system kernel                                                                                                                  |
| \$28 or \$gp    | gp                               | contains the global pointer                                                                                                                               |
| \$29 or \$sp    | sp                               | contains the stack pointer                                                                                                                                |
| \$30 or \$fp    | fp                               | contains the frame pointer (if needed);<br>otherwise a saved register (like s0-s7)                                                                        |
| \$31            | ra                               | contains the return address and used for expression evaluation                                                                                            |

Table 1-1. General Registers

### **1.3 Special Registers**

The machine has two 32 bit special registers. The hi and lo special registers hold the results of the multiplication (mult and multu) and division (div and divu) instructions.

You usually do not need to refer explicitly to these special registers. Instructions that use the special registers refer to them automatically.

| Name | Description                                                                                          |
|------|------------------------------------------------------------------------------------------------------|
| hi   | Multiply/Divide special register holds the most significant 32 bits of multiply, remainder of divide |
| lo   | Multiply/Divide special register holds the least significant 32 bits of multiply, quotient of divide |

Table 1-2. Special Registers

#### **1.4 Floating Point Registers**

There are 32 32-bit (or 16 64-bit) floating point registers on the R2000 processor, numbered \$f0..\$f31. All references to these registers by floating point instructions must be to an *even* register, so most applications should use even-based register pairs as well as double-precision floating point values. Chapter 7 further describes floating point register use.

1-4 Assembly Language Programmer's Guide

( )

| \$f2:       \$f2       \$f3         \$f4:       \$f4       \$f5         \$f6:       \$f6       \$f7         \$f8:       \$f8       \$f9         \$f10:       \$f10       \$f11         \$g12:       \$g12       \$f13         \$f14:       \$f14       \$f15         \$f16:       \$f14       \$f15         \$f16:       \$f16       \$f17         \$f18:       \$f18       \$f17         \$f18:       \$f18       \$f19         \$f20:       \$f20       \$f21         \$f22:       \$f20       \$f21         \$f22:       \$f24       \$f23         \$f24:       \$f24       \$f25         \$f28:       \$f28       \$f29                                                                                  | \$f0: | Single Precision                                                                                         | \$f1  |   |
|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------|----------------------------------------------------------------------------------------------------------|-------|---|
| \$f4:       \$f4       \$f5         \$f6:       \$f6       \$f7         \$f8:       \$f8       \$f9         \$f10:       \$f10       \$f11         \$g12:       \$g12       \$f13         \$f14:       \$f14       \$f15         \$f16:       \$f16       \$f17         \$f18:       \$f16       \$f17         \$f20:       \$f20       \$f21         \$f20:       \$f20       \$f21         \$f22:       \$f22       \$f23         \$f24:       \$f24       \$f25         \$f26:       \$f26       \$f27         \$f28:       \$f28       \$f29 |       |                                                                                                          |       |   |
| \$f6:       \$f6       \$f7         \$f8:       \$f8       \$f9         \$f10:       \$f10       \$f11         \$g12:       \$g12       \$f13         \$f14:       \$f14       \$f15         \$f18:       \$f16       \$f17         \$f18:       \$f16       \$f17         \$f20:       \$f20       \$f21         \$f22:       \$f22       \$f23         \$f24:       \$f24       \$f25         \$f28:       \$f28       \$f29                                                                                                                                                                                                                                                                               |       |                                                                                                          |       | - |
| \$f8:       \$f8       \$f9         \$f10:       \$f10       \$f11         \$g12:       \$g12       \$f13         \$f14:       \$f14       \$f15         \$f16:       \$f16       \$f17         \$f18:       \$f18       \$f19         \$f20:       \$f20       \$f21         \$f22:       \$f22       \$f23         \$f24:       \$f24       \$f25         \$f28:       \$f28       \$f29                                                                                                                                                                                                                                                                                                                   |       | محصور المراقعة المراجع ومعتملة فلنسب يهيم المستعين ألم المحصور وستقدم الكريا والشرافية المشروع والفاديات |       |   |
| ig12:       \$g12       \$f13         if14:       \$f14       \$f15         if16:       \$f16       \$f17         if18:       \$f18       \$f19         if20:       \$f20       \$f21         if22:       \$f22       \$f23         if24:       \$f24       \$f25         if26:       \$f26       \$f27         if28:       \$f28       \$f29                                                                                                                                                                                                                                                                                                                                                                | \$f8: | \$f8                                                                                                     | \$f9  |   |
| sf14:       \$f14       \$f15         sf16:       \$f16       \$f17         sf18:       \$f18       \$f19         sf20:       \$f20       \$f21         sf22:       \$f22       \$f23         sf24:       \$f24       \$f25         sf26:       \$f28:       \$f28                                                                                                                                                                                                                                                                                                                                                                                                                                           | f10:  | \$f10                                                                                                    | \$f11 |   |
| if16:       \$f16       \$f17         if18:       \$f18       \$f19         if20:       \$f20       \$f21         if22:       \$f22       \$f23         if24:       \$f24       \$f25         if26:       \$f26       \$f27         if28:       \$f28       \$f29                                                                                                                                                                                                                                                                                                                                                                                                                                            | g12:  | \$g12                                                                                                    | \$f13 |   |
| \$f16       \$f17         \$f18:       \$f18         \$f20:       \$f20         \$f22:       \$f22         \$f24:       \$f24         \$f26:       \$f26         \$f28:       \$f28                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          | f14:  | \$f14                                                                                                    | \$f15 |   |
| if20:       \$f20       \$f21         if22:       \$f22       \$f23         if24:       \$f24       \$f25         if26:       \$f26       \$f27         if28:       \$f28       \$f29                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        | f16:  | \$f16 `                                                                                                  | \$f17 |   |
| \$f22:       \$f22       \$f23         \$f24:       \$f24       \$f25         \$f26:       \$f26       \$f27         \$f28:       \$f28       \$f29                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          | f18:  | \$f18                                                                                                    | \$f19 |   |
| \$f24:       \$f24       \$f25         \$f26:       \$f26       \$f27         \$f28:       \$f28       \$f29                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 | f20:  | \$f20                                                                                                    | \$f21 |   |
| \$f26:     \$f26     \$f27       \$f28:     \$f28     \$f29                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  | 6f22: | \$f22                                                                                                    | \$f23 |   |
| s <b>f28:</b> \$f28\$\$f29                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   | 6f24: | \$f24                                                                                                    | \$f25 |   |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              | 6f26: | \$f26                                                                                                    | \$f27 |   |
| Sf30: \$f30 \$f31                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            | 6f28: | \$f28                                                                                                    | \$f29 |   |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              | 6f30: | \$f30                                                                                                    | \$f31 |   |

Figure 1-1. Floating Point Register Set



## 2. Addressing

Chapter 2 describes the formats that you can use to specify addresses. The machine uses a byte addressing scheme. Access to halfwords requires alignment on even byte boundaries, and access to words requires alignment on byte boundaries that are divisible by four. Any attempt to address a data item that does not have the proper alignment causes an alignment exception.

The unaligned assembler load and store instructions may generate multiple machine language instructions. They do not raise alignment exceptions. These instructions load and store unaligned data:

- load word left (lwl)
- load word right (lwr)
- store word left (swl)
- store word right (swr)
- unaligned load word (ulw)
- unaligned load halfword (ulh)
- unaligned load halfword unsigned (ulhu)
- unaligned store word (usw)
- unaligned store halfword (ush)
- unaligned store halfword unsigned (ushu)

These instructions load and store aligned data:

- load word (lw)
- load halfword (lh)
- load halfword unsigned (lhu)
- load byte (lb)
- load byte unsigned (lbu)
- store word (sw)
- store halfword (sh)
- store byte (sb)

### 2.1 Address Formats

The assembler accepts these formats for addresses:

| Format                                           | Address                            |
|--------------------------------------------------|------------------------------------|
| (base register)                                  | base address (zero offset assumed) |
| expression                                       | absolute address                   |
| expression (base register)                       | based address                      |
| relocatable-symbol                               | relocatable<br>address             |
| relocatable-symbol ± expression                  | relocatable address                |
| relocatable-symbol ± expression (index register) | indexed relocatable<br>address     |

Table 2-1. Formats for Addresses

### 2.2 Address Descriptions

The assembler accepts any combination of the constants and operations described in Chapter 4 for expressions in address descriptions. Table 2-2 describes expressions and their address descriptions.

| Expression                          | Address Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
|-------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| ( base-register )                   | Specifies an indexed address,<br>which assumes a zero offset. The<br>base-register's contents specify the<br>address.                                                                                                                                                                                                                                                                                                                                                                  |
| expression                          | Specifies an absolute address. The assembler generates the most lo-<br>cally efficient code for referencing<br>a value at the specified address.                                                                                                                                                                                                                                                                                                                                       |
| expression ( base-register )        | Specifies a based address. To get<br>the address, the machine adds the<br>value of the expression to the con-<br>tents of the base-register.                                                                                                                                                                                                                                                                                                                                           |
| relocatable–symbol                  | Specifies a relocatable address.<br>The assembler generates the neces-<br>sary instruction(s) to address the<br>item and generates relocatable in-<br>formation for the link editor.                                                                                                                                                                                                                                                                                                   |
| relocatable-symbol $\pm$ expression | Specifies a relocatable address. To<br>get the address, the assembler adds<br>or subtracts the value of the ex-<br>pression, which has an absolute<br>value, from the relocatable symbol.<br>The assembler generates the neces-<br>sary instruction(s) to address the<br>item and generates relocatable in-<br>formation for the link editor. If<br>the symbol name does not appear<br>as a label anywhere in the assem-<br>bly, the assembler assumes that the<br>symbol is external. |

Table 2-2. Address Descriptions

Addressing 2-3

Expression

#### Address Description

relocatable-symbol ( base-register ) Specifies an indexed relocatable address. To get the address, the machine adds the index-register to the relocatable symbol's address. The assembler generates the necessary instruction(s) to address the item and generates relocatable information for the link editor. If the symbol name does not appear as a label anywhere in the assembly, the assembler assumes that the symbol is external.

#### relocatable-symbol $\pm$ expression ( base-register )

Specifies an indexed relocatable address. To get the address, the assembler adds or subtracts the relocatable symbol, the expression, and the contents of the index-register. The assembler generates the necessary instruction(s) to address the item and generates relocation information for the link editor. If the symbol does not appear as a label anywhere in the assembly, the assembler assumes that the symbol is external.

Table 2-2. Address Descriptions (continued)

2-4 Assembly Language Programmer's Guide

**IRIS-4D Series** 

## 3. Exceptions

**Chapter 3** describes the exceptions that you can encounter while running assembly programs. The machine detects some exceptions directly, and the assembler inserts specific tests that signal other exceptions. This chapter lists only those exceptions that occur most frequently.

### **3.1 Main Processor Exceptions**

For the assembly language programmer, these are the most common main processor exceptions:

- address error exceptions, which occur when the machine references a data item that is not on its proper memory alignment or when an address is invalid for the executing process
- overflow exceptions, which occur when arithmetic operations compute signed values and the destination lacks the precision to store the result
- bus exceptions, which occur when an address is invalid for the executing process
- divide-by-zero exceptions, which occur when a divisor is zero

### **3.2 Floating Point Exceptions**

These are the floating point exceptions (not implemented for first release of the IRIS-4D Series):

- invalid operation exceptions
  - magnitude subtraction of infinities, for example:  $+\infty -\infty$
  - multiplication of 0 by  $\infty$  with any signs
  - division of 0/0 or  $\infty/\infty$  with any signs
  - conversion of a binary floating-point number to an integer format when an overflow or the operand value for the infinity or NaN precludes a faithful representation in the format (see Chapter 6)
  - comparison of predicates that have unordered operands, and that involve Greater Than or Less Than without Unordered.
  - any operation on a signaling NaN
- divide-by-zero exceptions
- overflow exceptions—these occur when a rounded floating point result exceeds the destination format's largest finite number
- underflow exceptions—these occur when a result has lost accuracy and also when a nonzero result is between  $\pm 2^{Emin}$  (plus or minus 2 to the minimum expressable exponent).
- inexact exceptions

## 4. Lexical Conventions

Chapter 4 discusses lexical conventions for these topics:

- tokens
- comments
- identifiers
- constants
- multiple lines per physical line
- sections and location counters
- statements
- expressions

This chapter uses the following notation to describe syntax:

- (vertical bar) means "or"
- [] (square brackets) enclose options
- $\pm$  indicates both addition and subtraction operations

### 4.1 Tokens

The assembler has these tokens:

- identifiers
- constants
- operators

The assembler lets you put blank characters and tab characters anywhere between tokens; however, it does not allow these characters within tokens (except for character constants). A blank or tab must separate adjacent identifiers or constants that are not otherwise separated.

### 4.2 Comments

The pound sign character (#) introduces a comment. Comments that start with a # extend through the end of the line on which they appear. You can also use C-language notation /\*...\*/ to delimit comments.

The assembler uses **cpp** (the C language preprocessor) to preprocess assembler code. Because **cpp** interprets #s in the first column as pragmas (compiler directives), do not start a # comment in the first column.

### 4.3 Identifiers

An identifier consists of a case-sensitive sequence of alphanumeric characters, including these:

• . (period)

• \_ (underscore)

• \$ (dollar sign)

Identifiers can be up to 31 characters long, and the first character cannot be numeric.

If an identifier is not defined to the assembler (only referenced), the assembler assumes that the identifier is an external symbol. The assembler treats the identifier as if a .globl pseudo-operation was encountered (see Chapter 8). If the identifier is defined to the assembler and the identifier has not been specified as global, the assembler assumes that the identifier is a local symbol.

### 4.4 Constants

The assembler has these constants:

- scalar constants
- floating point constants
- string constants

#### 4.4.1 Scalar Constants

The assembler interprets all scalar constants as twos complement numbers. Scalar constants can be any of the digits **0123456789abcdefABCDEF**.

Scalar constants can be one of these constants:

- decimal constants, which consist of a sequence of decimal digits without a leading zero
- hexadecimal constants, which consist of the characters 0x (or 0X) followed by a sequence of digits
- octal constants, which consist of a leading zero followed by a sequence of digits in the range 0..7

#### 4.4.2 Floating Point Constants

Floating point constants can appear only in *.float* and *.double* pseudo-operations (directives)—see Chapter 8. Floating point constants follow this form:

±d1[.d2][e|E±d3]

Where:

- *d1* is written as a decimal integer and denotes the integral part of the floating point value
- *d2* is written as a decimal integer and denotes the fractional part of the floating point value
- d3 is written as a decimal integer and denotes a power of 10
- the "+" symbol is optional

For example:

21.73E-3

represents the number .02173.

#### 4.4.3 String Constants

String constants begin and end with double quotation marks (").

The assembler observes C language backslash conventions. For octal notation, the backslash conventions require three characters when the next character could be confused with the octal number. For hexadecimal notation, the backslash conventions require two characters when the next character could be confused with the hexadecimal number (i.e., use a 0 for the first character of a single character hex number).

The assembler follows the backslash conventions listed in Table 4-1:

4-4 Assembly Language Programmer's Guide

| Convention | Meaning                                 |
|------------|-----------------------------------------|
| \a         | alert (0x07)                            |
| \b         | backspace (0x08)                        |
| \f         | form feed (0x0c)                        |
| \n         | newline (0x0a)                          |
| \r         | carriage return (0x0d)                  |
| \t         | horizontal tab (0x09)                   |
| \v         | vertical feed (0x0b)                    |
| \\         | backslash (0x5c)                        |
| \"         | quotation mark (0x22)                   |
| \'         | single quote (0x27)                     |
| \000       | character whose octal value is 000      |
| \Xnn       | character whose hexadecimal value is nn |

Table 4-1. Backslash Conventions

### 4.5 Multiple Lines Per Physical Line

You can include multiple statements on the same line by separating the statements with semicolons. The assembler does not recognize semicolons as separators when they follow comment symbols (# or /\*).

### 4.6 Sections and Location Counters

Assembled code and data fall in one of six sections as illustrated in Figure 4-1:



Figure 4-1. Location Counters

(For more information on section data, see Chapter 9 of this manual.)

The assembler always generates the text section before other sections. Additions to the text section happen in four-byte units. Each section has

**IRIS-4D Series** 

an implicit location counter, which begins at zero and increments by one for each byte assembled in the section.

The bss section holds zero-initialized data. If a *.lcomm* pseudo-op defines a variable (see **Chapter 8**), the assembler assigns that variable to the **bss** (block started by storage) section or to the **sbss** (short block started by storage) section depending on the variable's size. The default variable size for **sbss** is 512 or fewer bytes.

The command line option -G for each compiler (C, Pascal, Fortran 77, or the assembler), can increase the size of **sbss** to cover all but extremely large data items. The link editor issues an error message when the -G value gets too large. If a -G value is not specified to the compiler, 512 is the default. Items smaller than, or equal to, the specified size go in sbss. Items greater than the specified size go in bss.

Because you can address items much more quickly through \$gp than through a more general method, put as many items as possible in sdata or sbss. The size of sdata and sbss combined must not exceed 64K bytes.

#### 4.7 Statements

Each statement consists of an optional *label*, an operation code, and the operand(s). The machine allows these statements:

- null statements
- keyword statements

#### 4.7.1 Label Definitions

A label definition consists of an identifier followed by a colon. Label definitions assign the current value and type of the location counter to the name. An error results when the name is already defined, the assigned value changes the label definition, or both conditions exists.

Label definitions always end with a colon. You can put a label definition on a line by itself.

A generated label is a single numeric value (1...255). To reference a generated label, put an **f** (forward) or a **b** (backward) immediately after

the digits. The reference tells the assembler to look for the nearest generated label that corresponds to the number in the lexically forward or backward direction.

#### 4.7.2 Null Statements

A null statement is an empty statement that the assembler ignores. Null statements can have label definitions. For example, this line has three null statements in it:

label1: ; ;

#### 4.7.3 Keyword Statements

A keyword statement begins with a predefined keyword. The syntax for the rest of the statement depends on the keyword. All instruction opcodes are keywords. All other keywords are assembler pseudo-operations (directives).

#### 4.8 Expressions

An expression is a sequence of symbols that represent a value. Each expression and its result have data types. The assembler does arithmetic in twos complement integers with 32 bits of precision. Expressions follow precedence rules and consist of:

- operators
- identifiers
- constants

Also, you may use a single character string in place of an integer within an expression. Thus:

.byte "a" ; .word "a"+0x19

is equivalent to

.byte 0x61 ; .word 0x7a

4-8 Assembly Language Programmer's Guide

**IRIS-4D Series** 

#### 4.8.1 Precedence

Unless parentheses enforce precedence, the assembler evaluates all operators of the same precedence strictly from left to right. Because parentheses also designate index-registers, ambiguity can arise from parentheses in expressions. To resolve this ambiguity, put a unary + in front of parentheses in expressions.

The assembler has three precedence levels, which are listed in Table 4-2 from lowest to highest precedence:

| least binding,      |        |                        |
|---------------------|--------|------------------------|
| lowest precedence:  | binary | .+                     |
| •                   |        |                        |
|                     | binary | *, /, %, <<, >>, ^, &, |
| most binding        |        |                        |
| highest precedence: | unary  | -, +, ~                |

| Table 4-2. Precedence Level | Table | 4–2. | Precedence | Levels |
|-----------------------------|-------|------|------------|--------|
|-----------------------------|-------|------|------------|--------|

**NOTE:** The assembler's precedence scheme differs from that of the C language.

#### 4.8.2 Expression Operators

For expressions, you can rely on the precedence rules, or you can group expressions with parentheses. The assembler has these operators:

| Operator | Meaning                         |
|----------|---------------------------------|
| +        | addition                        |
| _        | subtraction                     |
| *        | multiplication                  |
| /        | division                        |
| %        | remainder                       |
| <<       | shift left                      |
| >>       | shift right (sign NOT extended) |
| ^        | bitwise EXCLUSIVE OR            |
| &        | bitwise AND                     |
| I        | bitwise OR                      |
| -        | minus (unary)                   |
| +        | identity (unary)                |
| ~        | complement                      |

Table 4-3. Operators

#### 4.8.3 Data Types

The assembler manipulates several types of expressions. Each symbol you reference or define belongs to one of the categories listed in Table 4-4:

| Туре       | Description                                                                                                                                                                                                                                                                                                                                                                                                     |  |
|------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|
|            |                                                                                                                                                                                                                                                                                                                                                                                                                 |  |
| undefined  | Any symbol that is referenced but not defined be-<br>comes global undefined, and this module will at-<br>tempt to import it. The assembler uses 32-bit ad-<br>dressing to access these symbols. (Declaring such<br>a symbol in a .globl pseudo-op merely makes its<br>status clearer).                                                                                                                          |  |
| sundefined | A symbol defined by a .extern pseudo-op be-<br>comes global small undefined if its size is greater<br>than zero but less than the number of bytes speci-<br>fied by the -G option on the command line (which<br>defaults to 512). The linker places these symbols<br>within a 64k byte region pointed to by the \$gp<br>register, so that the assembler can use economical<br>16-bit addressing to access them. |  |
| absolute   | A constant defined in an "=" expression.                                                                                                                                                                                                                                                                                                                                                                        |  |
| text       | The text section contains the program's instruc-<br>tions, which are not modifiable during execution.<br>Any symbol defined while the .text pseudo-op is<br>in effect belongs to the text section.                                                                                                                                                                                                              |  |
| data       | The <b>data</b> section contains memory which the<br>linker can initialize to nonzero values before your<br>program begins to execute. Any symbol defined<br>while the .data pseudo-op is in effect belongs to<br>the data section. The assembler uses 32-bit ad-<br>dressing to access these symbols.                                                                                                          |  |
| sdata      | This category is similar to data, except that defin-<br>ing a symbol while the .sdata ("small data")<br>pseudo-op is in effect causes the linker to place it<br>within a 64k byte region pointed to by the \$gp<br>register, so that the assembler can use economical<br>16-bit addressing to access it.                                                                                                        |  |
|            | Table 4-4. Data Types                                                                                                                                                                                                                                                                                                                                                                                           |  |
|            |                                                                                                                                                                                                                                                                                                                                                                                                                 |  |

| Туре                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   | Description<br>Any symbol defined while the .rdata pseudo-op is<br>in effect belongs to this category, which is similar<br>to data, but may not be modified during execu-<br>tion.                                                                                                             |  |
|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|
| rdata                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |                                                                                                                                                                                                                                                                                                |  |
| tion.<br>bss and sbss<br>The bss and sbss sections consist of memor<br>which the kernel loader initializes to zero be<br>your program begins to execute. Any symbolistic<br>fined in a .comm or .lcomm pseudo-op belistic<br>these sections (except that a .data, .sdata,<br>.rdata pseudo-op can override a .comm dia<br>tive). If its size is less than the number of<br>specified by the -G option on the command<br>(which defaults to 512), it belongs to sbss (bss"), and the linker places it within a 64k<br>region pointed to by the \$gp register so that<br>assembler can use economical 16-bit address<br>access it. Otherwise, it belongs to bss and<br>assembler uses 32-bit addressing. |                                                                                                                                                                                                                                                                                                |  |
|                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        | Local symbols in bss or sbss defined by .lcomm<br>are allocated memory by the assembler; global<br>symbols are allocated memory by the link editor;<br>and symbols defined by .comm are overlaid upon<br>like-named symbols (in the fashion of Fortran<br>"COMMON" blocks) by the link editor. |  |

#### Table 4-4. Data Types (continued)

Symbols in the undefined and small undefined categories are always global (that is, they are visible to the link editor and can be shared with other modules of your program). Symbols in the absolute, text, data, sdata, rdata, bss, and sbss categories are local unless declared in a .globl pseudo-op.

### 4.8.4 Type Propagation in Expressions

When expression operators combine expression operands, the result's type depends on the types of the operands and on the operator. Expressions follow these type propagation rules:

- If an operand is undefined, the result is undefined.
- If both operands are absolute, the result is absolute.
- If the operator is + and the first operand refers to a relocatable textsection, data-section, bss-section, or an undefined external, the result has the postulated type and the other operand must be absolute.
- If the operator is and the first operand refers to a relocatable text-section, data-section, or bss-section symbol, the second operand can be absolute and the result has the first operand's type; or the second operand can have the same type as the first operand and the result is absolute. If the first operand is external undefined, the second operand must be absolute.
- The operators \* , /, % , << , >> , ~, ^ , & , and | apply only to absolute symbols.



## 5. Instruction Set

Chapter 5 describes instruction notation and discusses assembler instructions for the main processor. Chapter 6 describes coprocessor notation and instructions.

The assembler has the classes of instructions for the main processor listed in Table 5-1:

| Instruction                  | Description                                                                                  |
|------------------------------|----------------------------------------------------------------------------------------------|
|                              |                                                                                              |
| Load and Store Instructions  | These instructions load immediate values and move data between memory and general registers. |
| Computational Instructions   | These instructions do arithmetic<br>and logical operations for values in<br>registers.       |
| Jump and Branch Instructions | These instructions change program control flow.                                              |
| Coprocessor Interface        | These instructions provide standard interfaces to the coprocessors.                          |
| Special Instructions         | These instructions do miscellaneous tasks.                                                   |

Table 5-1. Instruction Classes

### **5.1 Instruction Notation**

The tables in **Chapter 5** list the assembler format for each load, store, computational, jump, branch, coprocessor, and special instruction. The format consists of an op-code and a list of operand formats. The tables list groups of closely related instructions; for those instructions, you can use any op-code with any specified operand. Operands can take any of these formats:

- memory references—for example a relocatable symbol +/- an expression(register)
- expressions (for immediate values)
- two or three operands—for example, add \$3,\$4 is the same as add \$3,\$3,\$4

### 5.2 Load and Store Instructions

The machine has general-purpose load and store instructions.

### 5.2.1 Load and Store Formats

Table 5-2 lists operands and their descriptions. Table 5-3 shows the formats of available load and store instructions.

| Operand     | Description                |            |
|-------------|----------------------------|------------|
| destination | the destination register   |            |
| address     | a symbolic expression (see | Chapter 2) |
| source      | the source register        |            |
| expression  | an absolute value          |            |

Table 5-2. Load and Store Operands

| Description                      | Op-code | Operands                |
|----------------------------------|---------|-------------------------|
| Load Address                     | la      | destination, address    |
| Load Byte                        | lb      |                         |
| Load Byte Unsigned               | lbu     |                         |
| Load Halfword                    | lh      |                         |
| Load Halfword Unsigned           | lhu     |                         |
| Load Word                        | lw      |                         |
| Load Word Left                   | lwl     |                         |
| Load Word Right                  | lwr     |                         |
| Load Double                      | ld      |                         |
| Unaligned Load Halfword          | ulh     |                         |
| Unaligned Load Halfword Unsigned | i ulhu  |                         |
| Unaligned Load Word              | ulw     |                         |
| Load Immediate                   | li      | destination, expression |
| Load Upper Immediate             | lui     | -                       |
| Store Byte                       | sb      | source, address         |
| Store Double                     | sd      |                         |
| Store Halfword                   | sh      |                         |
| Store Word Left                  | swl     |                         |
| Store Word Right                 | swr     |                         |
| Store Word                       | sw      |                         |
| Unaligned Store Halfword         | ush     |                         |
| Unaligned Store Word             | usw     |                         |

Table 5-3. Load and Store Instruction Formats

### 5.2.2 Load Instruction Descriptions

For all machine load instructions, the effective address is the 32-bit twos-complement sum of the contents of the index-register and the (signextended) 16-bit offset. Instructions that have symbolic labels imply an index-register, which the assembler determines. The assembler supports additional load instructions, which can produce multiple machine instructions. NOTE: Load instructions can generate many code sequences for which the link editor must fix the address by resolving external data items.

| Instruction Name         | Description                                                                                                                                                                                                                                                                                                                                  |  |
|--------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|
| Load Address (la)        | Loads the destination register with<br>the effective address of the speci-<br>fied data item.                                                                                                                                                                                                                                                |  |
| Load Byte (lb)           | Loads the least significant byte of<br>the destination register with the<br>contents of the byte that is at the<br>memory location specified by the<br>effective address. The machine<br>treats the loaded byte as a signed<br>value: bit seven is extended to fill<br>the three most significant bytes.                                     |  |
| Load Byte Unsigned (lbu) | Loads the least significant byte of<br>the destination register with the<br>contents of the byte that is at the<br>memory location specified by the<br>effective address. Because the ma-<br>chine treats the loaded byte as an<br>unsigned value, it fills the three<br>most significant bytes of the desti-<br>nation register with zeros. |  |

#### Table 5-4. Load Instruction Descriptors

5-4 Assembly Language Programmer's Guide

#### Description

Load Double (ld)

Load Halfword (lh)

Load Halfword Unsigned (lhu)

Loads the register pair (destination and destination + 1) with the two successive words specified by the address. The destination register must be the even register of the pair. When the address is not on a word boundary, the machine signals an address error exception. **NOTE:** For compatibility with future machines, we recommend the use of double word alignment for all double word operands.

Loads the two least significant bytes of the destination register with the contents of the halfword that is at the memory location specified by the effective address. The machine treats the loaded halfword as a signed value. If the effective address is not even, the machine signals an address error exception.

Loads the least significant bits of the destination register with the contents of the halfword that is at the memory location specified by the effective address. Because the machine treats the loaded halfword as an unsigned value, it fills the two most significant bytes of the destination register with zeros. If the effective address is not even, the machine signals an address error exception.

Table 5-4. Load Instruction Descriptors (continued)

Instruction Name

#### Description

Load Immediate (li)

Loads the destination register with the value of an expression that can be computed at assembly time.

**NOTE:** Load Immediate can generate any efficient code sequence to put a desired value in the register.

Load Upper Immediate (lui)

Load Word (lw)

Loads the most significant half of a register with the expression's value, The machine fills the least significant half of the register with zeros. The expression's value must be in the range -32768...65535.

Loads the destination register with the contents of the word that is at the memory location. The machine replaces all bytes of the register with the contents of the loaded word.

The machine signals an address error exception when the effective address is not divisible by four.

Table 5-4. Load Instruction Descriptors (continued)

5-6 Assembly Language Programmer's Guide

| Instruction Name      | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |
|-----------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Load Word Left (lwl)  | Loads the sign—that is, Load<br>Word Left loads the destination<br>register with the most significant<br>bytes of the word specified by the<br>effective address. The effective<br>address must specify the byte con-<br>taining the sign. In a big-endian<br>machine, the effective address<br>specifies the lowest numbered byte,<br>and in a little-endian machine the<br>effective address specifies the high-<br>est numbered byte.                                               |
|                       | Only the bytes which share the<br>same aligned word in memory are<br>merged into the destination regis-<br>ter.                                                                                                                                                                                                                                                                                                                                                                        |
| Load Word Right (lwr) | Loads the lowest precision bytes—<br>that is, Load Word Right loads<br>the destination register with the<br>least significant bytes of the word<br>specified by the effective address.<br>The effective address must specify<br>the byte containing the least signifi-<br>cant bits. In a big-endian ma-<br>chine, the effective address speci-<br>fies the highest numbered byte,<br>and in a little-endian machine the<br>effective address specifies the low-<br>est numbered byte. |
|                       | Only the bytes which share the<br>same aligned word in memory are<br>merged into the destination regis-<br>ter.                                                                                                                                                                                                                                                                                                                                                                        |

Table 5-4. Load Instruction Descriptors (continued)

Instruction Name

Unaligned Load Halfword (ulh)

#### Description

Loads a halfword into the destination register from the specified address and extends the sign of the halfword. **Unaligned Load Halfword** loads a halfword regardless of the halfword's alignment in memory.

Loads a halfword into the destination register from the specified address and zero extends the halfword. Unaligned Load Halfword Unsigned loads a halfword regardless of the halfword's alignment in memory.

Loads a word into the destination register from the specified address. Unaligned Load Word loads a word regardless of the word's alignment in memory.

Unaligned Load Word (ulw)

Unaligned Load Halfword

Unsigned (ulhu)

 Table 5-4.
 Load Instruction Descriptors (continued)

### 5.2.3 Store Instruction Descriptions

For all machine store instructions, the effective address is the 32-bit twoscomplement sum of the contents of the index-register and the (sign-extended) 16-bit offset. The assembler supports additional store instructions, which can produce multiple machine instructions. Instructions that have symbolic labels imply an index-register, which the assembler determines.

5-8 Assembly Language Programmer's Guide

| Instruction Name    | Description                                                                                                                                                                                                                                                                                                                           |
|---------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Store Byte (sb)     | Stores the contents of the source<br>register's least significant byte in<br>the byte specified by the effective<br>address.                                                                                                                                                                                                          |
| Store Halfword (sh) | Stores the two least significant<br>bytes of the source register in the<br>halfword that is at the memory lo-<br>cation specified by the effective ad-<br>dress. The effective address must<br>be divisible by two, otherwise the<br>machine signals an address error<br>exception.                                                   |
| Store Word (sw)     | Stores the contents of a word from<br>the source register in the memory<br>location specified by the effective<br>address. The effective address<br>must be divisible by four, otherwise<br>the machine signals an address er-<br>ror exception.                                                                                      |
| Store Double (sd)   | Stores the contents of the register<br>pair in successive words, which the<br>address specifies. The source reg-<br>ister must be the even register of<br>the pair, and the storage address<br>must be word aligned. <b>NOTE:</b><br>For compatibility with future ma-<br>chines, we recommend that you<br>use double word alignment. |

### Table 5-5. Store Instruction Descriptors

Store Word Left (swl)

#### Description

Stores the most significant bytes of a word in the memory location specified by the effective address. The contents of the word at the memory location, specified by the effective address, are shifted right so that the leftmost byte of the unaligned word is in the addressed byte position. The stored bytes replace the corresponding bytes of the effective address. The effective address's last two bits determine how many bytes are involved.

Stores the least significant bytes of a word in the memory location specified by the effective address. The contents of the word at the memory location, specified by the effective address, are shifted left so that the right byte of the unaligned word is in the addressed byte position. The stored bytes replace the corresponding bytes of the effective address. The effective address's last two bits determine how many bytes are involved.

Stores the contents of the two least significant bytes of the source register in a halfword that the address specifies. The machine does not require alignment for the storage address.

Stores the contents of the source register in a word specified by the address. The machine does not require alignment for the storage address.

Table 5-5. Store Instruction Descriptors (continued)

5-10 Assembly Language Programmer's Guide

**IRIS-4D Series** 

Store Word Right (swr)

Unaligned Store Halfword (ush)

Unaligned Store Word (usw)

-

## **5.3 Computational Instructions**

The machine has general-purpose and coprocessor-specific computational instructions (for example, the floating point coprocessor). This part of the book describes general-purpose computational instructions.

### 5.3.1 Computational Formats

Table 5-6 shows computational operands and their descriptions. Table 5-7 shows the formats of computational instructions.

| Operand          | Description                                        |
|------------------|----------------------------------------------------|
| destination/src1 | the destination register is also source register 1 |
| destination      | the destination register                           |
| immediate        | the immediate value                                |
| src1,src2        | the source registers                               |

Table 5-6. Computational Operands

| Description                     | Op-code | Operand                               |
|---------------------------------|---------|---------------------------------------|
| Add (with overflow)             | add     | destination, src1, src2               |
| Add (without overflow)          | addu    | destination/src1,src2                 |
| AND                             | and     | destination, src1,                    |
| Divide (signed)                 | div     | immediate                             |
| Divide (unsigned)               | divu    | destination/src1,                     |
| EXCLUSIVE OR                    | xor     | immediate                             |
| Multiply                        | mul     |                                       |
| Multiply with Overflow          | mulo    |                                       |
| Multiply with Overflow Unsigned | mulou   |                                       |
| NOT OR                          | nor     |                                       |
| OR                              | or      |                                       |
| Set Equal                       | seq     |                                       |
| Set Greater                     | sgt     |                                       |
| Set Greater/Equal               | sge     |                                       |
| Set Greater/Equal Unsigned      | sgeu    |                                       |
| Set Greater Unsigned            | sgtu    |                                       |
| Set Less                        | slt     |                                       |
| Set Less/Equal                  | sle     |                                       |
| Set Less/Equal Unsigned         | sleu    |                                       |
| Set Less Unsigned               | sltu    |                                       |
| Set Not Equal                   | sne     |                                       |
| Subtract (with overflow)        | sub     |                                       |
| Subtract (without overflow)     | subu    | · · · · · · · · · · · · · · · · · · · |

Table 5-7. Computational Instruction Formats

5-12 Assembly Language Programmer's Guide

IRIS-4D Series

(

| Description               | Op-code | Operand                 |
|---------------------------|---------|-------------------------|
| Remainder (signed)        | rem     | destination, src1, src2 |
| Remainder (unsigned)      | remu    | destination/src1,src2   |
| Rotate Left               | rol     | destination, src1,      |
| Rotate Right              | ror     | immediate               |
| Shift Right Arithmetic    | sra     | destination/src1,       |
| Shift Left Logical        | sll     | immediate               |
| Shift Right Logical       | srl     |                         |
| Absolute Value            | abs     | destination, src1       |
| Negate (with overflow)    | neg     | destination/src1        |
| Negate (without overflow) | negu    |                         |
| NOT                       | not     |                         |
| Move                      | move    | destination, src1       |
| Multiply                  | mult    | src1,src2               |
| Multiply (unsigned)       | multu   |                         |

 Table 5-7.
 Computational Instruction Formats (continued)

### 5.3.2 Computational Instruction Descriptions

Table 5–8 shows the descriptions of computational instructions.

#### Description

Absolute Value (abs)

Add (with overflow) (add)

Add (without overflow) (addu)

AND (and)

Computes the absolute value of the contents of src1 and puts the result in the destination register. If the value in src1 is -2147483648, the machine signals an overflow exception.

Computes the twos complement sum of two signed values. This instruction adds the contents of src1 to the contents of src2, or it can add the contents of src1 to the immediate value. Add (with overflow) puts the result in the destination register. When the result cannot be extended as a 32-bit number, the machine signals an overflow exception.

Computes the twos complement sum of two 32-bit values. This instruction adds the contents of src1 to the contents of src2, or it can add the contents of src1 to the immediate value. Add (without overflow) puts the result in the destination register. Overflow exceptions never occur.

Computes the Logical AND of two values. This instruction ANDs (bit-wise) the contents of src1 with the contents of src2, or it can AND the contents of src1 with the immediate value. The immediate value is not sign extended. AND puts the result in the destination register.

Table 5-8. Computational Instruction Descriptions

5-14 Assembly Language Programmer's Guide

| Instruction Name      | Description                                                                                                                                                                                                                                                                                                                                          |
|-----------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Divide (signed) (div) | Computes the quotient of two val-<br>ues. <b>Divide (with overflow)</b> treats<br>src1 as the dividend. The divisor<br>can be src2 or the immediate<br>value. The instruction divides the<br>contents of src1 by the contents of<br>src2, or it can divide src1 by the<br>immediate value. It puts the quo-<br>tient in the destination register. If |
|                       | the divisor is zero, the machine                                                                                                                                                                                                                                                                                                                     |
|                       | signals an error. The <b>div</b> instruc-<br>tion rounds toward zero. Overflow                                                                                                                                                                                                                                                                       |

**NOTE:** The special case

is signaled when dividing -2147483648 by -1.

div \$0,src1,src2

generates the real machine divide instruction and leaves the result in the hi/lo register. The hi register contains the remainder and the lo register contains the quotient. No checking for divide by zero is performed.

Computes the quotient of two unsigned 32-bit values. **Divide** (without overflow) treats src1 as the dividend. The divisor can be src2 or the immediate value. This instruction divides the contents of src1 by the contents of src2, or it can divide the contents of src1 by the immediate value. **Divide** (without overflow) puts the quotient in the destination register. If the divisor is zero, the machine signals an exception.

 Table 5-8.
 Computational Instruction Descriptions (continued)

Divide (unsigned) (divu)

| Instruction Name   | Description                                                                                                                                                                                                                                                                                                                   |
|--------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|                    | See the note for div concerning \$0 as a destination.                                                                                                                                                                                                                                                                         |
|                    | Overflow exceptions never occur.                                                                                                                                                                                                                                                                                              |
| EXCLUSIVE OR (xor) | Computes the XOR of two values.<br>This instruction XORs (bit-wise)<br>the contents of src1 with the con-<br>tents of src2, or it can XOR the<br>contents of src1 with the immedi-<br>ate value. The immediate value is<br>not sign extended. <b>EXCLUSIVE</b><br><b>OR</b> puts the result in the destina-<br>tion register. |
| Move (move)        | Moves the contents of src1 to the destination register.                                                                                                                                                                                                                                                                       |
| Multiply (mul)     | Computes the product of two val-<br>ues. This instruction puts the<br>32-bit product of src1 and src2, or<br>the 32-bit product of src1 and the<br>immediate value, in the destination<br>register. The machine does not<br>report overflow.                                                                                  |
|                    | <b>NOTE:</b> Use <b>mul</b> when you do noneed overflow protection: it's often faster than <b>mulo</b> and <b>mulou</b> . For multiplication by a constant, the <b>mul</b> instruction produces faster matchine instruction sequences than <b>mult</b> or <b>multu</b> instructions can produce.                              |
| Multiply (mult)    | Computes the 64-bit product of<br>two 32-bit signed values. This in-<br>struction multiplies the contents of<br>src1 by the contents of src2 and<br>puts the result in the hi and lo<br>registers (see Chapter 1). No<br>overflow is possible.                                                                                |

 Table 5-8.
 Computational Instruction Descriptions (continued)

5-16 Assembly Language Programmer's Guide

| Instruction Name                           | Description                                                                                                                                                                                                                                                                                                                                                                                           |  |
|--------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|
|                                            | <b>NOTE:</b> The <b>mult</b> instruction is a real machine language instruction.                                                                                                                                                                                                                                                                                                                      |  |
| Multiply Unsigned (multu)                  | Computes the product of two un-<br>signed 32-bit values. It multiplies<br>the contents of src1 and the con-<br>tents of src2 and puts the result in<br>the hi and lo registers (see Chap-<br>ter 1). No overflow is possible.                                                                                                                                                                         |  |
|                                            | <b>NOTE:</b> The <b>multu</b> instruction is a real machine language instruction.                                                                                                                                                                                                                                                                                                                     |  |
| Multiply with Overflow (mulo)              | Computes the product of two<br>32-bit signed values. Multiply<br>with Overflow puts the 32-bit<br>product of src1 and src2, or the<br>32-bit product of src1 and the im-<br>mediate value, in the destination<br>register. When a overflow occurs,<br>the machine signals an overflow<br>exception.                                                                                                   |  |
|                                            | <b>NOTE:</b> For multiplication by a constant, <b>mulo</b> produces faster machine instruction sequences than <b>mult</b> or <b>multu</b> can produce; however, if you do not need overflow detection, use the <b>mul</b> instruction. It's often faster than <b>mulo</b> .                                                                                                                           |  |
| Multiply with Overflow Unsigned<br>(mulou) | Computes the product of two<br>32-bit unsigned values. Multiply<br>with Overflow Unsigned puts the<br>32-bit product of src1 and src2, on<br>the product of src1 and the imme-<br>diate value, in the destination reg-<br>ister. This instruction treats the<br>multiplier and multiplicand as<br>32-bit unsigned values. When an<br>overflow occurs, the machine sig-<br>nals an overflow exception. |  |

| Instruction Name                    | Description                                                                                                                                                                                                                                                                   |  |
|-------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|
|                                     | <b>NOTE:</b> For multiplication by a constant, <b>mulou</b> produces faster machine instruction sequences than <b>mult</b> or <b>multu</b> can produce; however, if you do not need overflow detection, use the <b>mul</b> instruction. It's often faster than <b>mulou</b> . |  |
| Negate (with overflow) (neg)        | Computes the negative of a value. This instruction negates the contents of src1 and puts the result in the destination register. If the value in src1 is $-2147483648$ , the machine signals an overflow exception.                                                           |  |
| Negate (without overflow)<br>(negu) | Negates the integer contents of src1 and puts the result in the des-<br>tination register. The machine does not report overflows.                                                                                                                                             |  |
| NOT (not)                           | Computes the Logical NOT of a value. This instruction comple-<br>ments (bit-wise) the contents of src1 and puts the result in the des tination register.                                                                                                                      |  |
| NOT OR (nor)                        | Computes the NOT OR of two val-<br>ues. This instruction combines the<br>the contents of src1 with the con-<br>tents of src2 (or the immediate<br>value). NOT OR complements the<br>result and puts it in the destination<br>register.                                        |  |

OR (or)

Remainder (signed) (rem)

Computes the Logical OR of two values. This instruction ORs (bitwise) the contents of src1 with the contents of src2, or it can OR the contents of src1 with the immediate value. The immediate value is not sign extended. OR puts the result in the destination register.

Computes the remainder of the division of two unsigned 32-bit values. The machine defines the remainder rem(i,j) as i-(j\*div(i,j))where  $i \neq 0$ . Remainder (with overflow) treats src1 as the dividend. The divisor can be src2 or the immediate value. This instruction divides the contents of src1 by the contents of src2, or it can divide the contents of src1 by the immediate value. It puts the remainder in the destination register. The rem instruction rounds toward zero, rather than toward negative infinity. For example, div(5,-3)=-1, and rem(5,-3)=2. If the divisor is zero, the machine signals an error.

| Instruction Name            | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
|-----------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Remainder (unsigned) (remu) | Computes the remainder of the division of two unsigned 32-bit values. The machine defines the remainder rem(i,j) as $i-(j*div(i,j))$ where $j \neq 0$ . Remainder Unsigned treats src1 as the dividend. The divisor can be src2 or the immediate value. This instruction divides the contents of src1 by the contents of src2, or it can divide the contents of src1 by the immediate value. Remainder Unsigned puts the remainder in the destination register. If the divisor is zero, the machine signals an error. |
| Rotate Left (rol)           | Rotates the contents of a register<br>left (toward the sign bit). This in-<br>struction inserts in the least signifi-<br>cant bit any bits that were shifted<br>out of the sign bit. The contents<br>of src1 specify the value to shift,<br>and the contents of src2 (or the<br>immediate value) specify the<br>amount to shift. <b>Rotate Left</b> puts<br>the result in the destination regis-<br>ter. If src2 (or the immediate                                                                                    |

Table 5-8. Computational Instruction Descriptions (continued)

value) is greater than 31, src1 shifts by (src2 MOD 32).

#### Description

Rotate Right (ror)

Rotates the contents of a register right (toward the least significant bit). This instruction inserts in the sign bit any bits that were shifted out of the least significant bit. The contents of src1 specify the value to shift, and the the contents of src2 (or the immediate value) specify the amount to shift. **Rotate Right** puts the result in the destination register. If src2 (or the immediate value) is greater than 32, src1 shifts by src2 MOD 32.

Compares two 32-bit values. If the contents of src1 equal the contents of src2 (or src1 equals the immediate value) this instruction sets the destination register to one; otherwise, it sets the destination register to zero.

Compares two signed 32-bit values. If the contents of src1 are greater than the contents of src2 (or src1 is greater than the immediate value), this instruction sets the destination register to one; otherwise, it sets the destination register to zero.

Compares two signed 32-bit values. If the contents of src1 are greater than or equal to the contents of src2 (or src1 is greater than or equal to the immediate value), this instruction sets the destination register to one; otherwise, it sets the destination register to zero.

Table 5-8. Computational Instruction Descriptions (continued)

Instruction Set 5-21

Set Greater (sgt)

Set Equal (seq)

Set Greater/Equal (sge)

#### Description

Set Greater/Equal Unsigned (sgeu)

Set Greater Unsigned (sgtu)

Set Less (slt)

Set Less/Equal (sle)

Compares two unsigned 32-bit values. If the contents of src1 are greater than or equal to the contents of src2 (or src1 is greater than or equal to the immediate value), this instruction sets the destination register to one; otherwise, it sets the destination register to zero.

Compares two unsigned 32-bit values. If the contents of src1 are greater than the contents of src2 (or src1 is greater than the immediate value), this instruction sets the destination register to one; otherwise, it sets the destination register to zero.

Compares two signed 32-bit values. If the contents of src1 are less than the contents of src2 (or src1 is less than the immediate value), this instruction sets the destination register to one; otherwise, it sets the destination register to zero.

Compares two signed 32-bit values. If the contents of src1 are less than or equal to the contents of src2 (or src1 is less than or equal to the immediate value), this instruction sets the destination register to one; otherwise, it sets the destination register to zero.

Table 5-8. Computational Instruction Descriptions (continued)

| Instruction Name               | Description                                                                                                                                                                                                                                                                                                                                                           |
|--------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Set Less/Equal Unsigned (sleu) | Compares two unsigned 32-bit values. If the contents of src1 are less than or equal to the contents of src2 (or src1 is less than or equal to the immediate value) this instruction sets the destination register to one; otherwise, it sets the destination register to zero.                                                                                        |
| Set Less Unsigned (sltu)       | Compares two unsigned 32-bit values. If the contents of src1 are less than the contents of src2 (or src1 is less than the immediate value), this instruction sets the destination register to one; otherwise, it sets the destination register to zero.                                                                                                               |
| Set Not Equal (sne)            | Compares two 32-bit values. If<br>the contents of scr1 do not equal<br>the contents of src2 (or src1 does<br>not equal the immediate value),<br>this instruction sets the destination<br>register to one; otherwise, it sets<br>the destination register to zero.                                                                                                     |
| Shift Left Logical (sll)       | Shifts the contents of a register left<br>(toward the sign bit) and inserts<br>zeros at the least significant bit.<br>The contents of src1 specify the<br>value to shift, and the contents of<br>src2 or the immediate value specify<br>the amount to shift. If src2 (or<br>the immediate value) is greater<br>than 31 or less than 0, src1 shifts<br>by src2 MOD 32. |

| Instruct | ion N | lame |
|----------|-------|------|
|----------|-------|------|

#### Description

#### Shift Right Arithmetic (sra)

Shift Right Logical (srl)

#### Subtract (with overflow) (sub)

Shifts the contents of a register right (toward the least significant bit) and inserts the sign bit at the most significant bit. The contents of src1 specify the value to shift, and the contents of src2 (or the immediate value) specify the amount to shift. If src2 (or the immediate value) is greater than 31 or less than 0, src1 shifts by the result of src2 MOD 32.

Shifts the contents of a register right (toward the least significant bit) and inserts zeros at the most significant bit. The contents of src1 specify the value to shift, and the contents of src2 (or the immediate value) specify the amount to shift. If src2 (or the immediate value) is greater than 31 or less than 0, src1 shifts by the result of src2 MOD 32.

Computes the twos complement difference for two signed values. This instruction subtracts the contents of src2 from the contents of src1, or it can subtract the contents of the immediate from the src1 value. **Subtract** puts the result in the destination register. When the true result's sign differs from the destination register's sign, the machine signals an overflow exception.

| Instruction Name            | Description                                                                                                                                                                                                                                                                                                                                            |
|-----------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Subtract (without overflow) |                                                                                                                                                                                                                                                                                                                                                        |
| (subu)                      | Computes the twos complement<br>difference for two 32-bit values.<br>This instruction subtracts the con-<br>tents of src2 from the contents of<br>src1, or it can subtract the con-<br>tents of the immediate from the<br>src1 value. <b>Subtract Unsigned</b><br>puts the result in the destination<br>register. Overflow exceptions<br>never happen. |

Table 5-8. Computational Instruction Descriptions (continued)

### 5.4 Jump and Branch Instructions

The jump and branch instructions let you change an assembly program's control flow.

### 5.4.1 Jump and Branch Formats

Table 5-9 shows jump and branch operands and their descriptions. Table 5-10 shows the formats of jump and branch instructions.

| Operand   | Description                          |
|-----------|--------------------------------------|
| address   | an expression                        |
| src1,src2 | the source registers                 |
| label     | a symbol label                       |
| immediate | an expression with an absolute value |

Table 5-9. Jump and Branch Operands

| Description                                                                                                                                                                                                                                           | Op-code                                                         | Operand                                  |
|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------|------------------------------------------|
| Jump<br>Jump and Link                                                                                                                                                                                                                                 | j<br>jal                                                        | address<br>src1                          |
| Branch on Equal<br>Branch on Greater<br>Branch on Greater/Equal<br>Branch on Greater/Equal Unsigned<br>Branch on Greater Unsigned<br>Branch on Less<br>Branch on Less/Equal<br>Branch on Less/Equal<br>Branch on Less Unsigned<br>Branch on Not Equal | beq<br>bgt<br>bgeu<br>bgtu<br>blt<br>ble<br>bleu<br>bltu<br>bne | src1,src2,label<br>src1, immediate,label |
| Branch on Equal to Zero                                                                                                                                                                                                                               | beqz                                                            | src1,label                               |
| Branch on Greater/Equal Zero<br>Branch on Greater Than Zero<br>Branch on Less/Equal Zero<br>Branch on Less Than Zero                                                                                                                                  | bgez<br>bgtz<br>blez<br>bltz<br>bnez                            |                                          |
| Branch on Not Equal to Zero<br>Branch                                                                                                                                                                                                                 |                                                                 |                                          |
| Branch and Link                                                                                                                                                                                                                                       | b<br>bal                                                        | label                                    |
| Branch on Less Than Zero and Link                                                                                                                                                                                                                     | bltzal                                                          |                                          |
| Branch on Greater or Equal to Zero<br>and Link                                                                                                                                                                                                        | bgezal                                                          |                                          |

Table 5-10. Jump and Branch Instruction Formats

### 5.4.2 Jump and Branch Instruction Descriptions

In the following branch instructions, branch destinations must be defined in the source being assembled. Table 5-11 shows descriptions of jump and branch instructions.

| Instruction Name                           | Description                                                                                                                                                                                                                                                                                           |
|--------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Branch (b)                                 | Branches unconditionally to the specified label.                                                                                                                                                                                                                                                      |
| Branch and Link (bal)                      | Branches unconditionally to the specified label and puts the return address in general register \$31.                                                                                                                                                                                                 |
| Branch on Equal (beq)                      | Branches to the specified label<br>when the contents of src1 equal<br>the contents of src2, or it can<br>branch when the contents of src1<br>equal the immediate value.                                                                                                                               |
| Branch on Equal to Zero (beqz)             | Branches to the specified label when the contents of src1 equal zero.                                                                                                                                                                                                                                 |
| Branch on Greater (bgt)                    | Branches to the specified label<br>when the contents of src1 are<br>greater than the contents of src2,<br>or it can branch when the contents<br>of src1 are greater than the imme-<br>diate value. The comparison treats<br>the comparands as signed 32-bit<br>values.                                |
| Branch on Greater/Equal Unsigned<br>(bgeu) | Branches to the specified label<br>when the contents of src1 are<br>greater than or equal to the con-<br>tents of src2, or it can branch<br>when the contents of src1 are<br>greater than or equal to the imme-<br>diate value. The comparison treats<br>the comparands as unsigned 32-bit<br>values. |

Table 5-11. Jump and Branch Instruction Descriptions

| Instruction Name                    | Description                                                                                                                                                                                                                                                                                         |
|-------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Branch on Greater/Equal<br>(bgez)   | <b>Zero</b><br>Branches to the specified label<br>when the contents of src1 are<br>greater than or equal to zero.                                                                                                                                                                                   |
| Branch on Greater/Equal<br>(bgezal) | Zero and Link<br>Branches to the specified label<br>when the contents of src1 are<br>greater than or equal to zero and<br>puts the return address in general<br>register \$31.                                                                                                                      |
| Branch on Greater or Eq<br>(bge)    | Branches to the specified label<br>when the contents of src1 are<br>greater than or equal to the con-<br>tents of src2, or it can branch<br>when the contents of src1 are<br>greater than or equal to the imme-<br>diate value. The comparison treats<br>the comparands as signed 32-bit<br>values. |
| Branch on Greater Than<br>(bgtu)    | Unsigned<br>Branches to the specified label<br>when the contents of src1 are<br>greater than the contents of src2,<br>or it can branch when the contents<br>of src1 are greater than the imme-<br>diate value. The comparison treats                                                                |

**Branch on Greater Than Zero** (bgtz)

Branches to the specified label when the contents of src1 are greater than zero.

the comparands as unsigned 32-bit

Table 5-11. Jump and Branch Instruction Descriptions (continued)

values.

| Instruction Name              | Description                                                                                                                                                                                                                                                                              |
|-------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|                               |                                                                                                                                                                                                                                                                                          |
| Branch on Less (blt)          | Branches to the specified label<br>when the contents of src1 are less<br>than the contents of src2, or it can<br>branch when the contents of src1<br>are less than the immediate value.<br>The comparison treats the com-<br>parands as signed 32-bit values.                            |
| Branch on Less/Equal Unsigned |                                                                                                                                                                                                                                                                                          |
| (bleu)                        | Branches to the specified label<br>when the contents of src1 are less<br>than or equal to the contents of<br>src2, or it can branch when the<br>contents of src1 are less than or<br>equal to the immediate value. The<br>comparison treats the comparands<br>as unsigned 32-bit values. |
| Branch on Less/Equal Zero     |                                                                                                                                                                                                                                                                                          |
| (blez)                        | Branches to the specified label<br>when the contents of src1 are less<br>than or equal to zero. The pro-<br>gram must define the destination.                                                                                                                                            |
| Branch on Less or Equal (ble) | Branches to the specified label                                                                                                                                                                                                                                                          |
|                               | when the contents of src1 are less<br>than or equal to the contents of<br>src2, or it can branch when the<br>contents of src1 are less than or<br>equal to the immediate value. The<br>comparison treats the comparands<br>as signed 32-bit values.                                      |
| Branch on Less Than Unsigned  |                                                                                                                                                                                                                                                                                          |
| (bltu)                        | Branches to the specified label<br>when the contents of src1 are less<br>than the contents of src2, or it can<br>branch when the contents of src1<br>are less than the immediate value.<br>The comparison treats the com-<br>parands as unsigned 32-bit values.                          |

Table 5-11. Jump and Branch Instruction Descriptions (continued)

| Instruction Name                           | Description                                                                                                                                                                                                         |
|--------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Branch on Less Than Zero<br>(bltz)         | Branches to the specified label<br>when the contents of src1 are less<br>than zero. The program must de-<br>fine the destination.                                                                                   |
| Branch on Less Than Zero and L<br>(bltzal) | <b>Jink</b><br>Branches to the specified label<br>when the contents of src1 are less<br>than zero and puts the return ad-<br>dress in general register \$31.                                                        |
| Branch on Not Equal (bne)                  | Branches to the specified label<br>when the contents of src1 do not<br>equal the contents of src2, or it<br>can branch when the contents of<br>src1 do not equal the immediate<br>value.                            |
| Branch on Not Equal to Zero (bnez)         | Branches to the specified label<br>when the contents of src1 do not<br>equal zero.                                                                                                                                  |
| Jump (j)                                   | Unconditionally jumps to a speci-<br>fied location. A symbolic address<br>or a general register specifies the<br>destination. The instruction <b>j</b> \$31<br>returns from the a <b>jal</b> call instruc-<br>tion. |

Table 5-11. Jump and Branch Instruction Descriptions (continued)

5-30 Assembly Language Programmer's Guide

| Instruction Name    | Description                                                                                                                                                                                                                                                                              |
|---------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Jump And Link (jal) | Unconditionally jumps to a speci-<br>fied location and puts the return<br>address in general register \$31. A<br>symbolic address or a general regis-<br>ter specifies the destination. The<br>instruction <b>jal procname</b> transfers<br>to procname and saves the return<br>address. |
|                     | The machine does not allow a <b>Jump and Link</b> to register \$31.                                                                                                                                                                                                                      |

Table 5-11. Jump and Branch Instruction Descriptions (continued)

### 5.5 Special Instructions

The main processor's special instructions do miscellaneous tasks.

### 5.5.1 Special Formats

Table 5-12 shows special format operands. Table 5-13 shows the formats of special instructions.

| Operand   | Description                          |
|-----------|--------------------------------------|
| register  | destination or source register       |
| breakcode | value that determines the break type |

Table 5-12. Special Operands Instructions

| Description                                                                                  | Op-code                      | Operand   |
|----------------------------------------------------------------------------------------------|------------------------------|-----------|
| Break                                                                                        | break                        | breakcode |
| Restore From Exception<br>Syscall                                                            | rfe<br>syscall               |           |
| Move From HI Register<br>Move To HI Register<br>Move From LO Register<br>Move To LO Register | mfhi<br>mthi<br>mflo<br>mtlo | register  |

Table 5-13. Special Instruction Formats

### 5.5.2 Special Instruction Descriptions

Table 5–14 shows descriptions of special instructions.

| Instruction Name             | Description                                                                                                                          |  |
|------------------------------|--------------------------------------------------------------------------------------------------------------------------------------|--|
| Break (break)                | Unconditionally transfers control to<br>the exception handler. The break-<br>code operand is interpreted by<br>software conventions. |  |
| Move From HI Register (mfhi) | Moves the contents of the HI reg-<br>ister to a general purpose register.                                                            |  |
| Move From LO Register (mflo) | Moves the contents of the LO reg-<br>ister to a general purpose register.                                                            |  |
| Move To HI Register (mthi)   | Moves the contents of a general purpose register to the HI register.                                                                 |  |
| Move To LO Register (mtlo)   | Moves the contents of a general purpose register to the LO register.                                                                 |  |

Table 5-14. Special Instruction Descriptions

5-32 Assembly Language Programmer's Guide

| Instruction Name             | Description                                                                                                                                                    |  |
|------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------|--|
| Restore From Exception (rfe) | Restores the previous interrupt<br>callee and user/kernel state. This<br>instruction can execute only in ker-<br>nel state and is unavailable in user<br>mode. |  |
| Syscall (syscall)            | Causes a system call trap. The op-<br>erating system interprets the infor-<br>mation set in registers to determine<br>what system call to do.                  |  |

Table 5-14. Special Instruction Descriptions (continued)

### **5.6 Coprocessor Interface Instructions**

The coprocessor interface instructions provide standard ways to access the machine's coprocessors.

Note: You cannot use coprocessor load and store instructions with the system control coprocessor (cp0).

### 5.6.1 Coprocessor Interface Formats

Table 5-15 shows coprocessor interface operands. Table 5-16 shows the formats of coprocessor interface descriptions.

| Operand     | Description                                           |
|-------------|-------------------------------------------------------|
| Z           | a coprocessor number in the range 03                  |
| destination | the destination coprocessor register                  |
| dest-gpr    | the destination general register                      |
| address     | a symbolic expression                                 |
| source      | a coprocessor register from which values are assigned |
| src-gpr     | a general register from which values are assigned     |
| operation   | the coprocessor specific operation                    |
| label       | a symbolic label                                      |

Table 5-15. Coprocessor Interface Operands

| Description                                             | Op-code      | Operand                                  |
|---------------------------------------------------------|--------------|------------------------------------------|
| Load Word Coprocessor z                                 | lwcz         | destination, address                     |
| Store Word Coprocessor z                                | SWCZ         | source, address                          |
| Move From Coprocessor z<br>Move To Coprocessor z        | mfcz<br>mtcz | dest–gpr, source<br>src–gpr, destination |
| Branch Coprocessor z False<br>Branch Coprocessor z True | bczf<br>bczt | label                                    |
| Coprocessor z Operation                                 | CZ           | expression                               |
| Control From Coprocessor z<br>Control To Coprocessor z  | cfcz<br>ctcz | dest–gpr, source<br>src–gpr, destination |

Table 5-16. Coprocessor Interface Instruction Formats

5-34 Assembly Language Programmer's Guide

### 5.6.2 Coprocessor Interface Instruction Descriptions

Table 5-17 shows descriptions of coprocessor interface instructions.

| Instruction Name                       | Description                                                                                                                                                                                               |
|----------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Branch Coprocessor z True<br>(bczt)    | Branches to the specified label<br>when the specified coprocessor as-<br>serts a true condition. The $z$ se-<br>lects one of the coprocessors. A<br>previous coprocessor operation sets<br>the condition. |
| Branch Coprocessor z False<br>(bczf)   | Branches to the specified label<br>when the specified coprocessor as-<br>serts a false condition. The z se-<br>lects one of the coprocessors. A<br>previous coprocessor operation sets<br>the condition.  |
| Control From Coprocessor z (cfcz)      | Stores the contents of the coprocessor control register speci-<br>fied by the source in the general register specified by dest-gpr.                                                                       |
| <b>Control To Coprocessor</b> z (ctcz) | Stores the contents of the general register specified by src-gpr in the coprocessor control register specified by the destination.                                                                        |
| <b>Coprocessor z Operation</b> (cz)    | Executes a coprocessor-specific operation on the specified coprocessor. The $z$ selects one of four distinct coprocessors.                                                                                |

Table 5-17. Coprocessor Interface Instruction Descriptions

#### Description

Load Word Coprocessor z (lwcz)

Loads the destination with the contents of a word that is at the memory location specified by the effective address. The z selects one of four distinct coprocessors. Load Word Coprocessor replaces all register bytes with the contents of the loaded word. If bits 0 and 1 of the effective address are not zero, the machine signals an address exception.

Move From Coprocessor z (mfcz)

Move To Coprocessor z (mtcz)

Store Word Coprocessor z (swcz)

Stores the contents of the coprocessor register specified by the source in the general register specified by **dest-gpr**.

Stores the contents of the general register specified by **src-gpr** in the coprocessor register specified by the destination.

Stores the contents of the coprocessor register in the memory location specified by the effective address. The z selects one of four distinct coprocessors. If bits 0 and 1 of the effective address are not zero, the machine signals an address error exception.

Table 5–17. Coprocessor Interface Instruction Descriptions (continued)

5-36 Assembly Language Programmer's Guide

# 6. Coprocessor Instruction Set

Chapter 6 describes the coprocessor instructions for these coprocessors:

- system control coprocessor (cp0) instructions
- floating point coprocessor instructions

See Chapter 5 for a description of the main processor's instructions and the coprocessor interface instructions.

### **6.1 Instruction Notation**

The tables in this chapter list the assembler format for each coprocessor's load, store, computational, jump, branch, and special instructions. The format consists of an op-code and a list of operand formats. The tables list groups of closely related instructions; for those instructions, you can use any op-code with any specified operand. **NOTE:** The system control coprocessor instructions do not have operands. Operands can have any of these formats:

- memory references—for example a relocatable symbol +/- an expression(register)
- expressions (for immediate values)
- two or three operands—for example, add \$3,\$4 is the same as add \$3,\$3,\$4

The following terms are used to discuss floating point operations:

- infinite—A value of  $+\infty$  or  $-\infty$ .
- infinity—A symbolic entity that represents values with magnitudes greater than the largest value in that format.
- ordered—The usual result from a comparison, namely: <,=, or >
- NaN-Symbolic entities that represent values not otherwise available in floating point formats. There are two kinds of NaNs. Quiet NaNs represent unknown or uninitialized values. Signaling NaNs represent symbolic values and values that are too big or too precise for the format. Signaling NaNs raise an invalid operation exception whenever an operation is attempted on them.
- **unordered**—The condition that results from a floating-point comparison when one or both operands are NaNs.

## **6.2 Floating Point Instructions**

Table 6-1 shows the classes of the floating point coprocessor instructions.

| Instruction                    | Description                                                                        |  |  |
|--------------------------------|------------------------------------------------------------------------------------|--|--|
| Load and Store Instructions    | Load values and move data be-<br>tween memory and coprocessor<br>registers.        |  |  |
| Computational Instructions     | Do arithmetic and logical opera-<br>tions on values in coprocessor reg-<br>isters. |  |  |
| <b>Relational Instructions</b> | Compare two floating point values.                                                 |  |  |
| Move Instructions              | Move data between registers.                                                       |  |  |

 Table 6-1.
 Floating Point Coprocessor Instructions

A particular floating point instruction may be implemented in hardware, software, or a combination of hardware and software.

6-2 Assembly Language Programmer's Guide

### **6.2.1 Floating Point Formats**

The formats for the single and double precision floating point constants are shown below.



Figure 6-1. Floating Point Formats

#### 6.2.2 Floating Point Load and Store Formats

Floating point load and store instructions must use even registers. Table 6-2 shows floating point operands. Table 6-3 shows the floating point load and store formats.

| Operand     | Meaning                  |  |
|-------------|--------------------------|--|
| destination | the destination register |  |
| address     | offset (base)            |  |
| source      | the source register      |  |

Table 6-2. Floating Point Load and Store Operands

| Description | Op-code | Operand              |
|-------------|---------|----------------------|
| Load Fp     |         |                      |
| Double      | l.d     | destination, address |
| Single      | l.s     |                      |
| Store Fp    |         |                      |
| Double      | s.d     | source, address      |
| Single      | S.S     |                      |

Table 6-3. Floating Point Load and Store Formats

#### 6.2.3 Floating Point Load and Store Descriptions

Table 6-4 groups the instructions by function. Please consult Table 6-3 for the op-codes.

| Instruction           | Description                                                                                                                                                                                                                                                                                                                                                                                      |  |  |
|-----------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|
|                       |                                                                                                                                                                                                                                                                                                                                                                                                  |  |  |
| Load Fp Instructions  | Load eight bytes for double preci-<br>sion and four bytes for single preci<br>sion from the specified effective<br>address into the destination regis-<br>ter, which must be an even regis-<br>ter. The bytes must be word<br>aligned. <b>NOTE:</b> To ensure com-<br>patibility with future machines, we<br>recommend that you use double<br>word alignment for double preci-<br>sion operands. |  |  |
| Store Fp Instructions | Stores eight bytes for double preci-<br>sion and four bytes for single preci-<br>sion from the source floating point<br>register in the destination register,<br>which must be an even register.<br><b>NOTE:</b> To ensure compatibility<br>with future machines, we recom-<br>mend that you use double word<br>alignment for double precision op-<br>erands.                                    |  |  |

Table 6-4. Floating Point Load and Store Descriptors

## 6.2.4 Floating Point Computational Formats

This part of **Chapter 6** describes floating point computational instructions. Table 6-5 shows floating point computational operands. Table 6-6 shows the formats of floating point computational instructions.

| Operand     | Operand Meaning          |  |
|-------------|--------------------------|--|
| destination | the destination register |  |
| source      | the source register      |  |
| gpr         | general purpose register |  |

Table 6-5. Floating Point Computational Operands

| Op-code | Operand                                                                                         |
|---------|-------------------------------------------------------------------------------------------------|
|         |                                                                                                 |
|         |                                                                                                 |
| abs.d   | destination, src1                                                                               |
| abs.s   |                                                                                                 |
|         |                                                                                                 |
| neg.d   |                                                                                                 |
| neg.s   |                                                                                                 |
|         |                                                                                                 |
|         |                                                                                                 |
|         | destination, src1, src2                                                                         |
| add.s   |                                                                                                 |
|         |                                                                                                 |
| div.d   |                                                                                                 |
| div.s   |                                                                                                 |
|         |                                                                                                 |
| mul.d   |                                                                                                 |
| mul.s   |                                                                                                 |
|         |                                                                                                 |
| sub.d   |                                                                                                 |
| sub.s   |                                                                                                 |
|         | abs.d<br>abs.s<br>neg.d<br>neg.s<br>add.d<br>add.s<br>div.d<br>div.s<br>mul.d<br>mul.s<br>sub.d |

Table 6-6. Floating Point Computational Instruction Formats

6-6 Assembly Language Programmer's Guide

| Description                                                                      | Op-code                       | Operand               |
|----------------------------------------------------------------------------------|-------------------------------|-----------------------|
| Convert Source to<br>Specified Fp Precision                                      |                               |                       |
| Double to Single Fp<br>Fixed Point to Single Fp<br>Single to Double Fp           | cvt.s.d<br>cvt.s.w<br>cvt.d.s | destination, src1     |
| Fixed Point to Double Fp<br>Single to Fixed Point Fp<br>Double to Fixed Point Fp | cvt.d.w<br>cvt.w.s<br>cvt.w.d |                       |
| Truncate and Round<br>Operations                                                 |                               | destination, src, gpr |
| Truncate to Single Fp<br>Truncate to Double Fp                                   | trunc.w.s<br>trunc.w.d        |                       |
| Round to Single Fp<br>Round to Double Fp                                         | round.w.s<br>round.w.d        |                       |

 Table 6-6.
 Floating Point Computational Instruction Formats (continued)

## 6.2.5 Floating Point Computational Instruction Descriptions

Table 6–7 groups the instructions by function. Please consult Table 6–6 for the op–code names.

Instruction

#### Description

**Absolute Value Fp Instructions** 

Add Fp Single Instructions

Compute the absolute value of the contents of src1 and put the specified precision floating point result in the destination register.

Add the contents of src1 (or the destination) to the contents of src2 and put the result in the destination register. When the sum of two operands with opposite signs is exactly zero, the sum has a positive sign for all rounding modes except round toward  $-\infty$ . For that rounding mode, the sum has a negative sign.

**Convert Source to Another Precision Fp Instructions** 

Truncate and Round Instructions

the result in the destination register. The trunc instructions truncate the value in the source floating-point register and put the resulting integer in the destination floating-point register, using the third (general-

Convert the contents of src1 to the specified precision, round according to the rounding mode, and put

register, using the third (generalpurpose) register to hold a temporary value. (This is a macro-instruction.) The round instructions work like trunc, but round the floating-point value to an integer instead of truncating it.

Table 6-7. Floating Point Computational Instruction Descriptions

| Instruction              | DescriptionCompute the quotient of two values. These instructions treat src1as the dividend and src2 as the divisor. Divide Fp instructions divide the contents of src1 by thecontents of src2 and put the resultin the destination register. If thedivisor is a zero, the machine signals a error if the divide-by-zeroexception is enabled.Multiplies the contents of src1 (orthe destination) with the contentsof src2 and puts the result in the |  |  |
|--------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|
| Divide Fp Instructions   |                                                                                                                                                                                                                                                                                                                                                                                                                                                      |  |  |
| Multiply Fp Instructions |                                                                                                                                                                                                                                                                                                                                                                                                                                                      |  |  |
| Negate FP Instructions   | Compute the negative value of the contents of src1 and put the specified precision floating point result in the destination register.                                                                                                                                                                                                                                                                                                                |  |  |
| Subtract Fp Instructions | Subtract the contents of src2 from<br>the contents of src1 (or the desti-<br>nation). These instructions put the<br>result in the destination register.<br>When the difference of two oper-<br>ands with the same signs is exactly<br>zero, the difference has a positive<br>sign for all rounding modes except<br>round toward $-\infty$ . For that<br>rounding mode, the sum has a<br>negative sign.                                               |  |  |

# Table 6-7. Floating Point Computational Instruction Descriptions (continued)

## 6.3 Floating Point Relational Operations

Table 6-8 summarizes the floating point relational instructions. The first column under *Condition* gives a mnemonic for the condition tested. As the "branch on true/false" condition can be used to logically negate any condition, the second column supplies a mnemonic for the logical negation of the condition in the first column. This provides a total of 32 possible conditions. The four columns under *Relations* give the result of the comparison based on each condition. The final column states if an invalid operation is signaled for each condition.

For example, with an equal condition (EQ mnemonic in the True column), the logical negation of the condition is not equal (NEQ), and a comparison that is equal is True for equal and False for greater than, less than, and unordered, and no Invalid Operation Exception is given if the relation is unordered.

| Condi | ition | Relations |      |       | Invalid    |              |
|-------|-------|-----------|------|-------|------------|--------------|
| Mnen  | nonic |           |      |       |            | Operation    |
|       |       | Greater   | Less | Faul  | Ilmondonod | Exception if |
| True  | False | Than      | Than | Equal | Unordered  | Unordered    |
| F     | Т     | F         | F    | F     | F          | no           |
| UN    | OR    | F         | F    | F     | Ť          | no           |
| EQ    | NEQ   | F         | F    | Т     | F          | no           |
| UEQ   | OLG   | F         | F    | Т     | Т          | no           |
| OLT   | UGE   | F         | T a  | F     | F          | no           |
| ULT   | OGE   | F         | Т    | F     | Т          | no           |
| OLE   | UGT   | F         | Т    | Т     | F          | no           |
| ULE   | OGT   | F         | Т    | Т     | Т          | no           |
|       |       |           |      |       |            |              |
| SF    | ST    | F         | F    | F     | F          | yes          |
| NGLE  | GLE   | F         | F    | F     | T          | yes          |
| SEQ   | SNE   | F         | F    | Т     | F          | yes          |
| NGL   | GL    | F         | F    | T     | T          | yes          |
| LT    | NLT   | F         | T    | F     | F          | yes          |
| NGE   | GE    | F         | Т    | F     | T          | yes          |
| LE    | NLE   | F         | T    | T     | F          | yes          |
| NGT   | GT    | F         | Т    | Т     | Т          | yes          |

Table 6-8. Floating Point Relational Operators

The mnemonics in Table 6-8 have the following meanings:

| F<br>UN<br>EQ<br>UEQ | False<br>Unordered<br>Equal<br>Unordered or Equal | T<br>OR<br>NEQ<br>OLG | True<br>Ordered<br>Not Equal<br>Ordered or<br>Less Than<br>or Greater<br>Than |
|----------------------|---------------------------------------------------|-----------------------|-------------------------------------------------------------------------------|
| OLT                  | Ordered Less Than                                 | UGE                   | Unordered or<br>Greater Than<br>or Equal                                      |
| ULT                  | Unordered or Less Than                            | OGE                   | Ordered<br>Greater Than                                                       |
| OLE                  | Ordered Less Than<br>or Equal                     | UGT                   | Unordered or<br>Greater Than                                                  |
| ULE                  | Unordered or Less<br>Than or Equal                | OGT                   | Ordered<br>Greater Than                                                       |
| SF                   | Signaling False                                   | ST                    | Signaling True                                                                |
| NGLE                 | Not Greater Than or<br>Less Than or Equal         | GLE                   | Greater Than,<br>or Less<br>Than or Equal                                     |
| SEQ                  | Signaling Equal                                   | SNE                   | Signaling Not<br>Equal                                                        |
| NGL                  | Not Greater than or                               | GL                    | Greater Than<br>or Less Than                                                  |
| LT                   | Less Than                                         | NLT                   | Not Less Than                                                                 |
| NGE                  | Not Greater Than<br>or Equal                      | GE                    | Greater Than<br>or Equal                                                      |
| LE                   | Less Than or Equal                                | NLE                   | Not Less Than<br>or Equal                                                     |
| NGT                  | Not Greater Than                                  | GT                    | Greater Than                                                                  |

To branch on the result of a relational:

/\* branching on a compare result \*/

| c.eq.s | \$f1,\$f2 | <pre>/* compare the single precision values *</pre>  | 1  |
|--------|-----------|------------------------------------------------------|----|
| bc1t   | true      | <pre>/* if \$f1 equals \$f2, branch to true */</pre> |    |
| bc1f   | false     | <pre>/* if \$f1 does not equal \$f2, branch to</pre> | */ |
|        |           | /* false */                                          |    |

## 6.3.1 Floating Point Relational Formats

| Description           | Op-code | Operand   |
|-----------------------|---------|-----------|
| Compare F             |         |           |
| Double                | c.f.d   | src1,src2 |
| Single                | c.f.s   |           |
| Compare UN            |         |           |
| Double                | c.un.d  |           |
| Single                | c.un.s  |           |
| *Compare EQ           |         |           |
| Double                | c.eq.d  |           |
| Single                | c.eq.s  |           |
| Compare UEQ<br>Double | c.ueg.d |           |
| Single                | -       |           |
| Compare OLT           | c.ueq.s |           |
| Double                | c.olt.d |           |
| Single                | c.olt.s |           |
| Compare ULT           | ciontis |           |
| Double                | c.ult.d |           |
| Single                | c.ult.s |           |
| Compare OLE           |         |           |
| Double                | c.ole.d |           |
| Single                | c.ole.s |           |
| Compare ULE           |         |           |
| Double                | c.ule.d |           |
| Single                | c.ule.s |           |
| Compare SF            |         |           |
| Double                | c.sf.d  |           |
| Single                | c.sf.s  |           |

In Table 6-9 src1 and src2 refer to the source registers.

\* These are the most common **Compare** instructions. The machine provides other **Compare** instructions for IEEE compatibility.

Table 6-9. Floating Point Relational Instruction Formats

| Description        | Op-code  | Operand    |
|--------------------|----------|------------|
| Compare NGLE       |          |            |
| Double             | c.ngle.d | src1, src2 |
| Single             | c.ngle.s |            |
| Compare SEQ        | -        |            |
| Double             | c.seq.d  |            |
| Single             | c.seq.s  |            |
| Compare NGL        |          |            |
| Double             | c.ngl.d  |            |
| Single             | c.ngl.s  |            |
| *Compare LT        |          |            |
| Double             | c.lt.d   |            |
| Single             | c.lt.s   |            |
| Compare NGE        |          |            |
| Double             | c.nge.d  |            |
| Single             | c.nge.s  |            |
| *Compare LE        |          |            |
| Double             | c.le.d   |            |
| Single             | c.le.s   |            |
| <b>Compare NGT</b> |          |            |
| Double             | c.ngt.d  |            |
| Single             | c.ngt.s  |            |

\* These are the most common **Compare** instructions. The machine provides other **Compare** instructions for IEEE compatibility.

 
 Table 6-9.
 Floating Point Relational Instruction Formats (continued)

#### 6.3.2 Floating Point Relational Instruction Descriptions

This part of Chapter 6 describes the relational instruction descriptions by function. Refer to Chapter 1 for information regarding registers. Please consult Table 6-9 for the op-code names.

| h | c | tr |   | ^ | ŧi | 2 | n |  |
|---|---|----|---|---|----|---|---|--|
|   | 9 | •• | ÷ | v | •  | v |   |  |

#### Description

**Compare EQ Instructions** Compare the contents of src1 with the contents of src2. If src1 equals src2 a true condition results; otherwise, a false condition results. The machine does not signal an exception for unordered values. **Compare F Instructions** Compare the contents of src1 with the contents of src2. These instructions always produce a false condition. The machine does not signal an exception for unordered values. **Compare LE Instructions** Compare the contents of src1 with the contents of src2. If src1 is less than or equal to src2, a true condition results; otherwise, a false condition results. The machine signals an exception for unordered values. **Compare LT Instructions** Compare the contents of src1 with the contents of src2. If src1 is less than src2, a true condition results: otherwise, a false condition results. The machine signals an exception for unordered values. **Compare NGE Instructions** Compare the contents of src1 with the contents of src2. If src1 is less than src2 (or the contents are unordered), a true condition results; otherwise, a false condition results. The machine signals an exception

Table 6-10. Floating Point Relational Instruction Descriptions

for unordered values.

| Instruction               | Description                                                                                                                                                                                                                                                                 |
|---------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Compare NGL Instructions  | Compare the contents of src1 with<br>the contents of src2. If src1<br>equals src2 or the contents are un-<br>ordered, a true condition results;<br>otherwise, a false condition results.<br>The machine signals an exception<br>for unordered values.                       |
| Compare NGLE Instructions | Compare the contents of src1 with<br>the contents of src2. If src1 is un-<br>ordered, a true condition results;<br>otherwise, a false condition results.<br>The machine signals an exception<br>for unordered values.                                                       |
| Compare NGT Instructions  | Compare the contents of src1 with<br>the contents of src2. If src1 is less<br>than or equal to src2 or the con-<br>tents are unordered, a true condi-<br>tion results; otherwise, a false con-<br>dition results. The machine signals<br>an exception for unordered values. |
| Compare OLE Instructions  | Compare the contents of src1 with<br>the contents of src2. If src1 is les<br>than or equal to src2, a true condi-<br>tion results; otherwise, a false con-<br>dition results. The machine does<br>not signal an exception for unor-<br>dered values.                        |
| Compare OLT Instructions  | Compare the contents of src1 with<br>the contents of src2. If src1 is les<br>than src2, a true condition results;<br>otherwise, a false condition results<br>The machine does not signal an<br>exception for unordered values.                                              |

 Table 6-10.
 Floating Point Relational Instruction Descriptions (continued)

#### **Compare SEQ Instructions**

**Compare SF Instructions** 

**Compare ULE Instructions** 

#### **Compare UEQ Instructions**

**Compare ULT Instructions** 

#### Description

Compare the contents of src1 with the contents of src2. If src1 equals src2, a true condition results; otherwise, a false condition results. The machine signals an exception for unordered values.

Compare the contents of src1 with the contents of src2. This always produces a false condition. The machine signals an exception for unordered values.

Compare the contents of src1 with the contents of src2. If src1 is less than or equal to src2 (or src1 is unordered), a true condition results; otherwise, a false condition results. The machine does not signal an exception for unordered values.

Compare the contents of src1 with the contents of src2. If src1 equals src2 (or src1 and src2 are unordered), a true condition results; otherwise, a false condition results. The machine does not signal an exception for unordered values.

Compare the contents of src1 with the contents of src2. If src1 is less than src2 (or the contents are unordered), a true condition results; otherwise, a false condition results. The machine does not signal an exception for unordered values.

 Table 6-10.
 Floating Point Relational Instruction Descriptions (continued)

| Instruction             | Description                                                                                                                                                                                                                                         |
|-------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Compare UN Instructions | Compare the contents of src1 with<br>the contents of src2. If either src1<br>or src2 is unordered, a true condi-<br>tion results; otherwise, a false con-<br>dition results. The machine does<br>not signal an exception for unor-<br>dered values. |

 
 Table 6-10.
 Floating Point Relational Instruction Descriptions (continued)

#### 6.3.3 Floating Point Move Formats

The floating point coprocessor's **move** instructions move data from source to destination registers (only floating point registers are allowed).

| Description | Op-code                                | Operand           |
|-------------|----------------------------------------|-------------------|
| Move Fp     |                                        |                   |
| Double      | mov.d                                  | destination, src1 |
| Single      | mov.s                                  |                   |
| -           | ······································ |                   |

Table 6-11. Floating Point Move Instruction Formats

#### 6.3.4 Floating Point Move Instruction Descriptions

This part of Chapter 6 describes the floating point move instructions. Please consult Table 6-11 for the op-code names.

| Instruction          | Description                                                                                                                         |
|----------------------|-------------------------------------------------------------------------------------------------------------------------------------|
| Move Fp Instructions | Move the double or single preci-<br>sion contents of src1 to the desti-<br>nation register, maintaining the<br>specified precision. |

Table 6-12. Floating Point Move Instruction Descriptions

## 6.4 System Control Coprocessor Instructions

The system control coprocessor (cp0) handles all functions and special and privileged registers for the virtual memory and exception handling subsystems. The system control coprocessor translates addresses from a large virtual address space into the machine's physical memory space. The coprocessor uses a translation lookaside buffer (TLB) to translate virtual addresses to physical addresses.

Primarily, only people who need to change the operating system need these instructions.

## 6.4.1 System Control Coprocessor Formats

These coprocessor system control instructions do not have operands:

| Description                               | Op-code | Operand |
|-------------------------------------------|---------|---------|
| Translation Lookaside Buffer Probe        | tlbp    |         |
| Translation Lookaside Buffer Read         | tlbr    |         |
| Translation Lookaside Buffer Write Random | tlbwr   |         |
| Translation Lookaside Write Index         | tlbwi   |         |

Table 6-13. System Control Instruction Formats

#### 6.4.2 System Control Coprocessor Instruction Descriptions

This part of Chapter 6 describes the system control coprocessor instructions.

| Instruction                                      | Description                                                                                                                                                                                                                                                                                                                                                                 |
|--------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| <b>Translation Lookaside Buffer Probe</b> (tlbp) | Probes the translation lookaside<br>buffer (TLB) to see if the TLB has<br>an entry that matches the contents<br>of the EntryHi register. If a<br>match occurs, the machine loads<br>the Index register with the number<br>of the entry that matches the<br>EntryHi register. If no TLB entry<br>matches, the machine sets the<br>high-order bit of the Index regis-<br>ter. |
| Translation Lookaside Buffer Read<br>(tlbr)      | Loads the EntryHi and EntryLo<br>registers with the contents of the<br>translation lookaside buffer (TLB)<br>entry specified in the TLB Index<br>register.                                                                                                                                                                                                                  |

 Table 6–14.
 System Control Coprocessor Instruction Descriptions

| Instruction                 | Description                        |
|-----------------------------|------------------------------------|
| Translation Lookaside Buffe | r Write Random                     |
| (tlbwr)                     | Loads the specified translation    |
|                             | lookaside buffer (TLB) entry with  |
|                             | the contents of the EntryHi and    |
|                             | EntryLo registers. The contents of |
|                             | the TLB Random register specify    |
|                             | the TLB entry to be loaded.        |
| Translation Lookaside Buffe | r Write Index                      |
| (tlbwi)                     | Loads the specified translation    |
|                             | lookaside buffer (TLB) entry with  |
|                             | the contents of the EntryHi and    |
|                             | EntryLo registers. The contents of |
|                             | the TLB Index register specify the |
|                             | TLB entry to be loaded.            |

 Table 6-14.
 System Control Coprocessor Instruction Descriptions (continued)

## 6.5 Control and Status Register

Floating-point coprocessor control register 31 contains status and control information. It controls the arithmetic rounding mode and the enabling of user-level traps, and indicates exceptions that occurred in the most recently executed instruction, and any exceptions that may have occurred without being trapped.

6-20 Assembly Language Programmer's Guide



Figure 6-2. Floating Point Control and Status Register 31

The exception bits are set for instructions that cause an IEEE standard exception or an optional exception used to emulate some of the more hardware-intensive features of the IEEE standard.

The exception field is loaded as a side-effect of each floating-point operation (excluding loads, stores, and unformatted moves). The exceptions which were caused by the immediately previous floating-point operation can be determined by reading the exception field.

The meaning of each bit in the exception field is given in Table 6-15. If two exceptions occur together on one instruction, the field will contain the inclusive OR of the bits for each exception.

| Exception<br>Field Bit | Description             |  |
|------------------------|-------------------------|--|
| E                      | Unimplemented Operation |  |
| V                      | Invalid Operation       |  |
| Z                      | Division by Zero        |  |
| Ι                      | Inexact Exception       |  |
| 0                      | Overflow Exception      |  |
| U                      | Underflow Exception     |  |

Table 6-15. Exception Field Bit Descriptions

The unimplemented operation exception is normally invisible to user-level code. It is provided to maintain IEEE compatibility for non-standard implementations.

The five IEEE standard exceptions are listed below:

| Field | Description         |  |
|-------|---------------------|--|
| V     | Invalid Operation   |  |
| Z     | Division by Zero    |  |
| I     | Inexact Exception   |  |
| Ο     | Overflow Exception  |  |
| U     | Underflow Exception |  |

Table 6-16. IEEE Standard Exceptions

Each of the five exceptions is associated with a trap under user control, which is enabled by setting one of the five bits of the enable field, shown above.

When an exception occurs, both the corresponding exception and status bits are set. If the corresponding enable flag bit is set, a trap is taken. In some cases the result of an operation is different if a trap is enabled.

**IRIS-4D Series** 

The status flags are never cleared as a side effect of floating-point operations, but may be set or cleared by writing a new value into the status register, using a "move to coprocessor control" instruction.

The floating-point compare instruction places the condition which was detected into the "c" bit of the control and status register, so that the state of the condition line may be saved and restored. The "c" bit is set if the condition is true, and cleared if the condition is false, and is affected only by compare and move to control register instructions.

#### 6.5.1 Exception Trap Processing

For each IEEE standard exception, a status flag is provided that is set on any occurrence of the corresponding exception condition with no corresponding exception trap signaled. It may be reset by writing a new value into the status register. The flags may be saved and restored individually, or as a group, by software. When no exception trap is signaled, a default action is taken by the floating-point coprocessor, which provides a substitute value for the original, exceptional, result of the floating-point operation. The default action taken depends on the type of exception, and in the case of the Overflow exception, the current rounding mode.

#### Invalid operation exception

The invalid operation exception is signaled if one or both of the operands are invalid for an implemented operation. The result, when the exception occurs without a trap, is a quiet NaN when the destination has a floatingpoint format, and is indeterminate if the result has a fixed-point format. The invalid operations are

- 1. Addition or subtraction: magnitude subtraction of infinities, such as  $(+\infty) (-\infty)$
- 2. Multiplication: 0 times  $\infty$ , with any signs
- 3. Division: 0 over 0 or  $\infty$  over  $\infty$ , with any signs
- 4. Square root:  $\mathbf{r}$  x, where x is less than zero
- 5. Conversion of a floating-point number to a fixed-point format when an overflow, or operand value of infinity or NaN, precludes a faithful representation in that format

- 6. Comparison of predicates involving < or > without ?, when the operands are "unordered"
- 7. Any operation on a signaling NaN.

Software may simulate this exception for other operations that are invalid for the given source operands. Examples of these operations include IEEE-specified functions implemented in software, such as Remainder: x REM y, where y is zero or x is infinite; conversion of a floating-point number to a decimal format whose value causes and overflow or is infinity of NaN; and trancendental functions, such as ln (-5) or cos  $^{-1}(3)$ .

#### Division-by-zero exception

The division by zero exception is signaled on an implemented divide operation if the divisor is zero and the dividend is a finite nonzero number. The result, when no trap occurs, is a correctly signed infinity.

If division by zero traps are enabled, the result register is not modified, and the source registers are preserved.

Software may simulate this exception for other operations that produce a signed infinity, such as  $\ln(0)$ ,  $\sec(\pi/2)$ ,  $\csc(0)$  or  $0^{-1}$ .

#### **Overflow exception**

The overflow exception is signaled when what would have been the magnitude of the rounded floating-point result, were the exponent range unbounded, is larger than the destination format's largest finite number. The result, when no trap occurs, is determined by the rounding mode and the sign of the intermediate result.

If overflow traps are enabled, the result register is not modified, and the source registers are preserved.

#### Underflow exception

Two related events contribute to underflow. One is the creation of a tiny non-zero result between + or  $-2 E \min$  (minimum expressable exponent) which, because it is tiny, may cause some other exception later. The

other is extraordinary loss of accuracy during the approximation of such tiny numbers by denormalized numbers.

The IEEE standard permits a choice in how these events are detected, but requires that they must be detected the same way for all operations.

The IEEE standard specifies that "tininess" may be detected either: "after rounding" (when a nonzero result computed as though the exponent range were unbounded would lie strictly between + or -2 E min, or "before rounding" (when a nonzero result computed as though the exponent range and the precision were unbounded would lie strictly between + or -

2~E~min. The architecture requires that tininess be detected after rounding.

Loss of accuracy may be detected as either "denormalization loss" (when the delivered result differs from what would have been computed if the exponent range were unbounded), or "inexact result" (when the delivered result differs from what would have been computed if the exponent range and precision were both unbounded). The architecture requires that loss of accuracy be detected as inexact result.

When an underflow trap is not enabled, underflow is signaled (via the underflow flag) only when both tininess and loss of accuracy have been detected. The delivered result might be zero, denormalized, or + or -

 $2 E \min$ . When an underflow trap is enabled, underflow is signaled when tininess is detected regardless of loss of accuracy.

If underflow traps are enabled, the result register is not modified, and the source registers are preserved.

#### Inexact exception

If the rounded result of an operation is not exact or if it overflows without an overflow trap, then the inexact exception is signaled. The rounded or overflowed result is delivered to the destination register, when no inexact trap occurs. If inexact exception traps are enabled, the result register is not modified, and the source registers are preserved.

### Unimplemented operation exception

If an operation is specified that the hardware may not perform, due to an implementation restriction on the supported operations or supported for-

mats, an unimplemented operation exception may be signaled, which always causes a trap, for which there are no corresponding enable or flag bits. The trap cannot be disabled.

This exception is raised at the execution of the unimplemented instruction. The instruction may be emulated in software, possibly using implemented floating-point unit instructions to accomplish the emulation. Normal instruction execution may then be restarted.

This exception is also raised when an attempt is made to execute an instruction with an operation code or format code which has been reserved for future architectural definition. The unimplemented instruction trap is not optional, since the current definition contains codes of this kind.

This exception may be signaled when unusual operands or result conditions are detected, for which the implemented hardware cannot properly handle the condition. These may include (but are not limited to), denormalized operands or results, NaN operands, trapped overflow or underflow conditions. The use of this exception for such conditions is optional.

#### 6.5.2 Floating Point Rounding

Bits 0 and 1 of the coprocessor control register 31 sets the rounding mode for floating point. The machine allows four rounding modes:

- Round to nearest rounds the result to the nearest representable value. When the two nearest representable values are equally near, this mode rounds to the value with the least significant bit zero. To select this mode, set bits 1..0 of control register 31 to 0.
- Round toward zero rounds toward zero. It rounds to the value that is closest to and not greater in magnitude than the infinitely precise result. To select this mode, set bits 1..0 of control register 31 to 1.
- Round toward positive infinity rounds to the value that is closest to and not less than the infinitely precise result. To select this mode, set bits 1..0 of control register 31 to 2.
- Round toward negative infinity rounds toward negative infinity. It rounds to the value that is closest to and not greater than the infinitely precise result. To select this mode, set bits 1..0 of control register 31 to 3.

IRIS-4D Series

To set the rounding mode:



## 7. Linkage Conventions

This chapter gives rules and examples to follow when designing an assembly language program. The chapter concludes with a "learn by doing" technique that you can use if you still have any doubts about how a particular calling sequence should work. This involves writing a skeleton version of your prospective assembly routine using a high level language, and then compiling it with the -S option to generate a human-readable assembly language file. The assembly language file can then be used as the starting point for coding your routine.

## 7.1 Introduction

When you write assembly language routines, you should follow the same calling conventions that the compilers observe, for two reasons:

- Often your code must interact with compiler-generated code, accepting and returning arguments or accessing shared global data.
- The symbolic debugger gives better assistance in debugging programs using standard calling conventions.

The conventions for the compiler system are a bit more complicated than some, mostly to enhance the speed of each procedure call. Specifically:

- The compilers use the full, general calling sequence only when necessary; where possible, they omit unneeded portions of it. For example, the compilers don't use a register as a frame pointer whenever possible.
- The compilers and debugger observe certain implicit rules rather than communicating via instructions or data at execution time. For example, the debugger looks at information placed in the symbol table by a ".frame" directive at compilation time, so that it can tolerate the lack of a register containing a frame pointer at execution time.

## 7.2 Program Design

This section describes three general areas of concern to the assembly language programmer:

- usable and restricted registers
- stack frame requirements on entering and exiting a routine
- the "shape" of data (scalars, arrays, records, sets) laid out by the various high level languages

#### 7.2.1 Register Use and Linkage

The main processor has 32 32-bit integer registers. The uses and restrictions of these registers are described in Table 7-1.

The floating point coprocessor has 16 floating point registers. Each register can hold either a single precision (32 bit) or a double precision (64 bit) value. All references to these registers uses an even register number (e.g., f4). Refer to Table 7-2.

7-2 Assembly Language Programmer's Guide

**IRIS-4D Series** 

| register name | software name<br>(from regdef.h) | use and linkage                                                                                                                                            |
|---------------|----------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------|
| \$0           |                                  | always has the value 0                                                                                                                                     |
| \$at          |                                  | reserved for the assembler                                                                                                                                 |
| \$2\$3        | v0-v1                            | used for expression evaluations and to<br>hold the integer type functions results.<br>Also used to pass the static link when<br>calling nested procedures. |
| \$4\$7        | a0-a3                            | used to pass the first 4 words of integer<br>type actual arguments, whose values are<br>not preserved across procedure calls                               |
| \$8\$15       | 10-17                            | temporary registers, used for expression<br>evaluations, whose values are not preserved<br>across procedure calls.                                         |
| \$16\$23      | s0-s7                            | saved registers, whose values must be preserved across procedure calls.                                                                                    |
| \$24\$25      | t8-t9                            | temporary registers, used for expression<br>evaluations, whose values are not pre-<br>served across procedure calls.                                       |
| \$kt0\$kt1    | k0-k1                            | reserved for the operating system kernel                                                                                                                   |
| \$28 or \$gp  | gp                               | contains the global pointer                                                                                                                                |
| \$29 or \$sp  | sp                               | contains the stack pointer                                                                                                                                 |
| \$30 or \$fp  | fp                               | contains the frame pointer (if needed)                                                                                                                     |
| \$31          | ra                               | contains the return address; used for expression evaluation.                                                                                               |

Table 7-1. Integer Registers

| egister name | use and linkage                                                                                                                                           |
|--------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------|
| \$f0f3       | used to hold floating point type function<br>results (\$f0) and complex type function<br>results (\$f0 has the real part, \$f2 has<br>the imaginary part) |
| \$f4f10      | temporary registers, used for expression<br>evaluation, whose values are not pre-<br>served across procedure calls.                                       |
| \$f12\$f14   | used to pass the first 2 single or double<br>precision actual arguments, whose values<br>are not preserved across procedure calls.                        |
| \$f16\$f18   | temporary registers, used for expression<br>evaluations, whose values are not pre-<br>served across procedure calls.                                      |
| \$f20\$f30   | saved registers, whose values must be preserved across procedure calls.                                                                                   |

Table 7-2. Floating Point Registers

#### 7.2.2 The Stack Frame

The compilers classify each routine into one of of the following categories:

- non-leaf routines, that is, routines that call other procedures
- leaf routines, that is, routines that do not themselves execute any procedure calls. Leaf routines are of two types:

 $\circ$  leaf routines that require stack storage for local variables

 $\circ$  leaf routines that do not require stack storage for local variables.

You must decide the routine category before determining the calling sequence. To write a program with proper stack frame usage and debugging capabilities, use the following procedure:

1. Regardless of the type of routine, you should include a .ent pseudoop and an entry label for the procedure. The .ent pseudo-op is for use by the debugger, and the entry label is the procedure name. The syntax is:

.ent procedure\_name procedure\_name:

2. If you are writing a leaf procedure that does not use the stack, skip to step 3. For leaf procedure that uses the stack or non-leaf procedures, you must allocate all the stack space that the routine requires. The syntax to adjust the stack size is:

subu \$sp,framesize

where *framesize* is the size of frame required. Space must be allocated for:

- local variables
- saved general registers. Space should be allocated only for those registers saved. For non-leaf procedures, you must save \$31, which is used in the calls to other procedures from this routine. If you use registers \$16-\$23, you must also save them.
- saved floating point registers. Space should be allocated only for those registers saved. If you use registers \$f20-\$f30 you must also save them.
- Procedure call argument area. You must allocate the maximum number of bytes for arguments of any procedure that you call from this routine.

**NOTE:** Once you have modified \$sp, you should not modify it again for the rest of the routine.

3. Now include a .frame pseudo-op:

frame framereg, framesize, returnreg

The virtual frame pointer is a frame pointer as used in other compiler systems but has no register allocated for it. It consists of the *framereg* (sp, in most cases) added to the *framesize* (see step 2 above). Figure 7-1 illustrates the stack components.



Figure 7–1. Stack Organization

The returned specifies the register the return address is in (usually \$31). These usual values may change if you use a varying stack pointer or are specifying a kernel trap routine.

4. If the procedure is a leaf procedure that does not use the stack, skip to step 7. Otherwise you must save the registers you allocated space for in step 2.

To save the general registers, use the following operations:

.mask bitmask,frameoffset
sw reg,framesize+frameoffset-N(\$sp)

The .mask directive specifies the registers to be stored and where they are stored. A bit should be on in *bitmask* for each register saved (for example, if register \$31 is saved, bit 31 should be '1' in *bitmask*. Bits are set in bitmask in little-endian order, even if the machine configuration is big-endian). The *frameoffset* is the offset from the virtual frame pointer (this number is usually negative). N should be 0 for the highest numbered register saved and then incremented by four for each subsequently lower numbered register saved. For example:

| sw | <pre>\$31,framesize+frameoffset(\$sp)</pre>   |
|----|-----------------------------------------------|
| sw | <pre>\$17,framesize+frameoffset-4(\$sp)</pre> |
| sw | <pre>\$16,framesize+frameoffset-8(\$sp)</pre> |

Figure 7–2 illustrates this example.



Figure 7-2. Stack Example

Now save any floating point registers that you allocated space for in step 2 as follows:

.fmask bitmask,frameoffset
s.[sd] reg,framesize+frameoffset-N(\$sp)

Notice that saving floating point registers is identical to saving general registers except we use the .fmask pseudo-op instead of .mask, and the stores are of floating point singles or doubles. The discussion regarding saving general registers applies here as well, but remember that N should be incremented by 8 for doubles.

5. This step describes parameter passing: how to access arguments passed into your routine and passing arguments correctly to other procedures.

As specified in step 2, space must be allocated on the stack for all arguments even though they may be passed in registers. This provides a saving area if their registers are needed for other variables.

General registers 4-7 and float registers f12, f14 must be used for passing the first four arguments (if possible). You must allocate a pair of registers (even if it's a single precision argument) that start with an even register for floating point arguments appearing in registers.

In the table below, the 'fN' arguments are considered single and double precision floating point arguments, and 'nN' arguments are everything else. The elipses (...) mean that the rest of the arguments do not go in registers regardless of their type. The 'stack' assignment means that you do not put this argument in a register. The register assignments occur in the order shown in order to satisfy optimizing compiler protocols.

| Arguments         | Register Assignments                         |
|-------------------|----------------------------------------------|
| (f1, f2,)         | f1 -> \$f12, f2 -> \$f14                     |
| (f1, n1, f2,)     | f1 -> \$f12, n1 -> \$6, f2 -> stack          |
| (f1, n1, n2,)     | f1 -> \$f12, n1 -> \$6, n2 -> \$7            |
| (n1, n2, n3, n4,) | n1 -> \$4, n2 -> \$5, n3 -> \$6, n4 -> \$7   |
| (n1, n2, n3, f1,) | n1 -> \$4, n2 -> \$5, n3 -> \$6, f1 -> stack |
| (n1, n2, f1,)     | n1 -> \$4, n2 -> \$5, f1 -> (\$6, \$6)       |
| (n1, f1,)         | n1 -> \$4, f1 -> (\$6,\$7)                   |



6. Next, you must restore registers that were saved in step 4. To restore general purpose registers:

lw reg,framesize+frameoffset-N(\$sp)

To restore the floating point registers:

l.[sd] reg,framesize+frameoffset-N(\$sp)

(Refer to step 4 for a discussion of the value of N.)

7. Get the return address:

lw \$31,framesize+frameoffset(\$sp)

8. Clean up the stack:

addu \$sp, framesize

9. Return:

j \$31

10. To end the procedure:

.end procedurename

#### 7.2.3 The Shape of Data

In most cases, high-level language routine and assembly routines communicate via simple variables: pointers, integers, booleans, and single- and double-precision real numbers. Describing the details of the various highlevel data structures (arrays, records, sets, and so on) is beyond our scope here. If you need to access such a structure as an argument or as a shared global variable, refer to the "Learn by Doing" technique described at the end of this section.

### 7.3 Examples

This section contains the examples that illustrate program design rules; each example shows a procedure written and C and its equivalent written in assembly language. Figure 7–3 shows a non-leaf procedure. Notice that it creates a stack-frame, and also saves its return address since it must put a new return address into register 31 when it invokes its callee:

```
float
nonleaf(i, j)
  int i, *j;
  double atof();
  int temp;
  temp = i - *j;
  if (i < *j) temp = -temp;
  return atof(temp);
  }
          .globl
                     nonleaf
          float
 #
     1
 #
     2
          nonleaf(i, j)
 #
     3
             int i, *j;
 #
     4
             {
                     nonleaf 2
           .ent
nonleaf:
           subu
                     $sp, 24
                                        ## Create stackframe
                     $31, 20($sp)
                                        ## Save the return address
           sw
           .mask
                     0x80000000, -4
           .frame
                     $sp, 24, $31
 #
     5
            double atof();
 #
     6
            int temp;
 #
     7
 #
             temp = i - *j;
     8
                     $2, 0($5)
           lw
                                       ## Arguments are in $4 and $5
                     $3, $4, $2
           subu
            if (i < *j) temp = -temp;
 #
     9
                     $4, $2, $32 ## Note: $32 is a label, not a register
           bge
                     $3, $3
           negu
$32:
 # 10
             return atof(temp);
                     $4, $3
           move
           jal
                     atof
           cvt.s.d
                     $f0, $f0
                                        ## Return value goes in $f0
           lw
                     $31, 20($sp)
                                        ## Restore return address
           addu
                     $sp, 24
                                        ## Delete stackframe
                     $31
                                        ## Return to caller
           j
                     nonleaf
           .end
```



#### 7-10 Assembly Language Programmer's Guide

Figure 7-4 shows a leaf procedure that does not require stack space for local variables. Notice that it creates no stackframe, and saves no return address:

```
int
leaf(p1, p2)
  int p1, p2;
  {
  return (p1 > p2) ? p1 : p2;
  }
              .globl
                            leaf
              int
     1
              leaf(p1, p2)
     2
 Ħ
     3
                int p1, p2;
     4
                {
              .ent
                            leaf 2
leaf:
              .frame
                            $sp, 0, $31
                 return (p1 > p2) ? p1 : p2;
     5
              ble
                            $4, $5, $32
                                            ## Arguments in $4 and $5
              move
                            $3, $4
                            $33
              b
$32:
                            $3, $5
              move
$33:
              move
                            $2, $3
                                         ## Return value goes in $2
                            #31
                                           ## Return to caller
              j
                 }
  #
     6
              .end leaf
```

Figure 7-4. Leaf Procedure Without Stack Space for Local Variables

Figure 7-5 shows a leaf procedure that requires stack space for local variables. Notice that it creates a stack frame, but does not save a return address.



Figure 7-5. Leaf Procedure With Stack Space for Local Variables

# 7.4 Learning by Doing

The rules and parameter requirements required between assembly language and other languages are varied and complex. The simplest approach to coding an interface between an assembly routine and a routine written in a high-level language is to do the following:

- Use the high-level language to write a skeletal version of the routine that you plan to code in assembly language.
- Compile the program using the -S option, which creates an assembly language (.s) version of the compiled source file.
- Study the assembly-language listing and then, imitating the rules and conventions used by the compiler, write your assembly language code.

The next two sections illustrate techniques to use in creating an interface between assembly language and high-level language routines. The examples shown are merely to illustrate what to look for in creating your interface. Details such as register numbers will vary according to the number, order, and data types of the arguments. You should write and compile realistic examples of your own code in writing your particular interface.

### 7.4.1 Calling a High–Level Language Routine

The following steps show a technique to follow in writing an assembly language routine that calls *atof*, a routine written in C that converts ASCII characters to numbers; for more information, see the **atof(3)** in the *IRIS-4D Programmer's Reference Manual*.

1. Write a C program that calls *atof*. Pass global rather than local variables; this makes them to recognize in the assembly language version of the C program. (and ensures that optimization doesn't remove any of the code on the grounds that it has no effect.)

Figure 7-6 is an example of a C program that calls atof.





2. Compile the program using the using the compiler options shown below:

cc -S -O caller.c

The -S option causes the compiler to produce the assembly-language listing; the -O option, though not required, reduces the amount of code generated, making the listing easier to read.

3. After compilation, look at the file caller.s (shown below). The highlighted section of the listing shows how the parameters are passed, the execution of the call, and how the returned values are retrieved.



Figure 7-7. Compilation Listing

#### 7.4.2 Calling an Assembly Language Routine

This section shows a technique to follow in writing an assembly language routine that calls a routine written in a high-level language (Pascal is used in this example).

1. Write a facsimile of the assembly language routine you wish to call. In the body of the routine, write statements that use the same arguments you intend to use in the final assembly language routine. Copy the arguments to global variables rather than local variables to make it easy for you to read the resulting assembly language listing.

Figure 7-8 shows the Pascal facsimile of the assembly language program.

```
type
  str = packed array [1 .. 10] of char;
  subr = 2 .. 5;
var
  global_r: real;
  global_c: subr;
  global_s: str;
  global_b: boolean;
function callee(var r: real; c: subr; s: str): boolean;
  begin
  global_r := r;
  global_r := r;
  global_c := c;
  global_s := s;
  callee := c = 3;
  end;
```

Figure 7-8. Pascal Facsimile

2. Compile the program using the using the compiler options shown below:

cc -S -O caller.c

The -S option causes the compiler to produce the assembly-language listing; the -O option, though not required, reduces the amount of code generated, making the listing easier to read.

3. After compilation, look at the file caller.s (shown below). The highlighted section of the listing shows how the parameters are passed, the execution of the call, and how the returned values are retrieved.

|              | .lcom                  | m \$dat 0                                |                                            |
|--------------|------------------------|------------------------------------------|--------------------------------------------|
|              | .comm                  |                                          |                                            |
|              | .comm                  |                                          |                                            |
|              | .comm                  | global_s 10                              |                                            |
|              | .comm                  | global_b 1                               |                                            |
|              | .text                  |                                          |                                            |
|              |                        | l callee                                 |                                            |
| # 1          |                        |                                          | r: real; c: subr; s: str): boolean;        |
|              | .ent                   | callee 2                                 |                                            |
| calle        |                        | - #~~ - #21                              |                                            |
|              | .iram<br>SW            | e \$sp, 0, \$31<br>\$5, 4(\$sp)          |                                            |
|              | sw<br>sw               | \$6, 8(\$sp)                             |                                            |
|              |                        |                                          |                                            |
|              | lbu                    | \$3, 4(\$sp)                             | ## Get subrange c, masking it to 8 bits    |
|              | and                    | \$3, \$3, 255                            |                                            |
| # 1          |                        | 43, 43, 200<br>begin                     |                                            |
|              |                        | global_r := r;                           |                                            |
|              | 1.s                    | \$f4, 0(\$4)                             | ## The pointer to "r" is in O(\$4)         |
|              |                        |                                          |                                            |
| # 1          | s.s<br>3               | <pre>\$f4, global_r global c := c;</pre> |                                            |
| 7 1          | sb.                    | \$3, global_c                            |                                            |
| # 1          | .4                     | global_s := s;                           |                                            |
|              | la                     | \$14, global s                           | ## For array "s", the caller gives you a   |
|              | addu                   | \$15, \$sp, 8                            | ## pointer at 8(\$sp). If you want to use  |
|              | .set                   | noat                                     | ## it as a call-by-value argument just as  |
|              | addu                   | \$24, \$15, 10                           | ## Pascal does (that is, if you want to    |
| \$32:        |                        |                                          | ## be able to modify a local copy without  |
|              | lbu                    | \$1, 0(\$15)                             | ## affecting the global copy) then you     |
|              | addu                   | \$15, \$15, 2                            | ## must copy it into your stack frame as   |
|              | sb                     | \$1, 0(\$14)                             | ## shown here (the code enclosed in ".set  |
| l            | lbu                    | \$1, -1(\$15)                            | ## noat" is a tight byte-copying loop).    |
|              | addu                   | \$14, \$14, 2                            | ## Otherwise, you may simply use the       |
|              | sb                     | \$1, -1(\$14)                            | ## pointer provided to you.                |
|              | bne                    | \$15, \$24, \$32                         |                                            |
|              | .set                   | at at                                    |                                            |
| # 1          | L5                     | callee := c = 3;                         |                                            |
|              | seq                    | \$5, \$3, 3                              |                                            |
|              | and                    | <b>\$5, \$5, 255</b>                     |                                            |
| # 1          | 16                     | end;                                     |                                            |
|              | and                    | \$2, \$5, 255                            | ## Return the boolean by leaving it in \$2 |
|              | j                      | \$31                                     |                                            |
|              | .end                   |                                          |                                            |
| 200000000000 | <u>and dialage ang</u> |                                          |                                            |

Figure 7-9. Compilation Listing

# 7.5 Memory Allocation

The machine's default memory allocation scheme gives every process two storage areas, that can grow without bound. A process exceeds virtual storage only when the sum of the two areas exceeds virtual storage space. The link editor and assembler use the scheme shown in Figure 7–10; an explanation of each area in the alocation scheme follows the figure.





- 1 Reserved for kernel operations.
- 2 Reserved for operating system use.
- 3 Used for local data in C programs.
- 4 Not allocated until a user requests it, as in System V shared memory regions.
- 5 The heap is reserved for sbrk and break system calls, and it not always present.
- 6 The machine divides all data into one of five sections:
  - bss —Uninitialized data with a size greater than the value specified by the –G command line option.
  - sbss —Data less than or equal to the –G command line option. (512 is the default value for the –G option.)
  - sdata (small data)—Data initialized and specified for the sdata section.
  - data (data)—Data initialized and specified for the data section.
  - rdata (read-only data)—Data initialized and specified for the rdata section.
- 7 Reserved for any shared libraries.
- 8 Contains the .text section
- 9 Reserved.

)

# 8. Pseudo Op-Codes

The keywords in Chapter 8 describe pseudo op-codes (directives). These pseudo op-codes influence the assembler's later behavior. In the text, boldface type specifies a keyword and italics represents an operand that you define.

| Pseudo-Op         | Description                                                                                                                                                                                                                                                                                                                                                               |  |
|-------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|
| .align expression | Advance the location counter to make the <i>expression</i> low order bits of the counter zero.                                                                                                                                                                                                                                                                            |  |
|                   | Normally, the .half, .word, .float, and<br>.double directives automatically align their<br>data appropriately. For example, .word<br>does an implicit .align 2 (.double does a<br>.align 3). You disable the automatic<br>alignment feature with .align 0. The as-<br>sembler reinstates automatic alignment at<br>the next .text, .data, .rdata, or .sdata<br>directive. |  |
|                   | Labels immediately preceding an automatic<br>or explicit alignment are also realigned.<br>For example, foo: .align 3; .word 0 is the<br>same as .align 3; foo: .word0.                                                                                                                                                                                                    |  |

The assembler has the pseudo op-codes shown in Table 8-1.

Table 8-1. Pseudo Op-Codes

| Pseudo-Op                                           | Description                                                                                                                                                                                                                                                                                                                                                                                      |
|-----------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| .ascii string [, string]                            | Assembles each <i>string</i> from the list into successive locations. The <b>.ascii</b> directive does not null pad the string. You MUST put quotation marks (") around each string. You can use the backslash escape characters. For a list of the backslash characters, see Chapter 4.                                                                                                         |
| asciiz string [, string]                            | Assembles each <i>string</i> in the list into successive locations and adds a null. You can use the backslash escape characters. For a list of the backslash characters, see Chapter 4.                                                                                                                                                                                                          |
| .asm0                                               | Tells the assembler's second pass that this assembly came from the first pass. (For use by compilers.)                                                                                                                                                                                                                                                                                           |
| .bgnb symno                                         | (For use by compilers.) Sets the beginning<br>of a language block. The .bgnb and<br>.endb directives delimit the scope of a<br>variable set. The scope can be an entire<br>procedure, or it can be a nested scope (for<br>example a "{}" block in the C language).<br>The symbol number <i>symno</i> refers to a<br>dense number in a .T file. To set the end<br>of a language block, see .endb. |
| .byte expression1 [, expression]<br>[, expressionN] | on2 ]<br>Truncates the expressions from the<br>comma-separated list to 8-bit values, and<br>assembles the values in successive loca-<br>tions. The expressions must be absolute.<br>The operands can optionally have the<br>form: expression1 [: expression2]. The<br>expression2 replicates expression1's value<br>expression2 times.                                                           |

8-2 Assembly Language Programmer's Guide

| Pseudo-Op                                        | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |
|--------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| .comm name, expression                           | Unless defined elsewhere, <i>name</i> becomes a global common symbol at the head of a block of <i>expression</i> bytes of storage. The linker overlays like-named common blocks, using the maximum of the <i>expression</i> s.                                                                                                                                                                                                                                                                                                                                                                  |
| .data                                            | Tells the assembler to add all subsequent data to the <b>data</b> section.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |
| .double expression [ , express<br>[, expressionN | <i>tion2 ]</i><br>Initializes memory to 64-bit floating point<br>numbers. The operands can optionally<br>have the form: <i>expression1</i> [ : <i>expression2</i> ]. The <i>expression1</i> is the floating point<br>value. The optional <i>expression2</i> is a non-<br>negative expression that specifies a repeti-<br>tion count. The <i>expression2</i> replicates <i>ex-<br/>pression1</i> 's value <i>expression2</i> times. This<br>directive automatically aligns its data and<br>any preceding labels to a double-word<br>boundary. You can disable this feature by<br>using .align 0. |
| .end [proc_name]                                 | Sets the end of a procedure. Use this di-<br>rective when you want to generate informa<br>tion for the debugger. To set the begin-<br>ning of a procedure, see .ent.                                                                                                                                                                                                                                                                                                                                                                                                                            |
| .endb symno                                      | Sets the end of a language block. To set the beginning of a language block, see .bgnb.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |
| .endr                                            | Signals the end of a repeat block. To state a repeat block, see .repeat.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |
| .ent proc_name                                   | Sets the beginning of the procedure <i>proc_name</i> . Use this directive when you want to generate information for the debugger. To set the end of a procedure, see .end.                                                                                                                                                                                                                                                                                                                                                                                                                      |

Table 8-1. Pseudo Op-Codes (continued)

Description

name is a global undefined symbol whose .extern name expression size is assumed to be *expression* bytes. The advantage of using this directive, instead of permitting an undefined symbol to become global by default, is that the assembler can decide whether to use the economical \$gp-relative addressing mode. depending on the value of the -G option. As a special case, if *expression* is zero, the assembler refrains from using \$gp to address this symbol regardless of the size specified by -G. .err Signals an error. Any compiler front-end that detects an error condition puts this directive in the input stream. When the assembler encounters a .err, it quietly ceases to assemble the source file. This prevents the assembler from continuing to process a program that is incorrect. (For use by compilers.) .file file number file name string Specifies the source file corresponding to the assembly instructions that follow. (For use by compilers.) .float expression1 [, expression2 ]... [, expressionN ] Initializes memory to single precision 32-bit floating point numbers. The operands can optionally have the form: expression1 [: expression2]. The optional expression2 is a non-negative expression that specifies a repetition count. This optional form replicates expression1's value expression2 times. This directive automatically aligns its data and preceding labels to a word boundary. You can disable this fea-

Table 8-1. Pseudo Op-Codes (continued)

ture by using .align 0.

8-4 Assembly Language Programmer's Guide

Description

.fmask mask offset

Sets a mask with a bit turned on for each floating point register that the current routine saved. The least-significant bit corresponds to register \$f0. The offset is the distance in bytes from the virtual frame pointer at which the floating point registers are saved. The assembler saves higher register numbers closer to the virtual frame pointer. You must use .ent before .fmask and only one .fmask may be used per .ent. Space should be allocated for those registers specified in the .fmask.

.frame frame-register offset return\_pc\_register

.globl name

Describes a stack frame. The first register is the frame-register, the offset is the distance from the frame register to the virtual frame pointer, and the second register is the return program counter (or, if the first register is \$0, this directive shows that the return program counter is saved four bytes from the virtual frame pointer). You must use .ent before .frame and only one .frame may be used per .ent. No stack traces can be done in the debugger without .frame.

Makes the *name* external. If the name is otherwise defined (by its appearance as a label), the assembler will export the symbol; otherwise it will import the symbol. In general, the assembler imports undefined symbols (that is, it gives them the UNIX storage class "global undefined" and requires the linker to resolve them).

Table 8-1. Pseudo Op-Codes (continued)

Pseudo-Op

#### Description

| .half expression1 [, expressio                                                                                                                                                                         | n2 1 [. expressionN]                                                                                                                                                                                                                                                                                                                                                                   |
|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|                                                                                                                                                                                                        | Truncates the <i>expressions</i> in the comma-<br>separated list to 16-bit values and assem-<br>bles the values in successive locations.<br>The <i>expressions</i> must be absolute. This                                                                                                                                                                                              |
|                                                                                                                                                                                                        | directive can optionally have the form: <i>expression1</i> [ : <i>expression2</i> ]. The <i>expression2</i> replicates <i>expression1</i> 's value <i>expression2</i> times. This directive automatically                                                                                                                                                                              |
| an an an an an an an ann an Arainn<br>An an an Arainn an Arainn an Arainn an Arainn<br>An an Arainn an Arain | aligns its data appropriately. You can dis-<br>able this feature by using <b>.align 0</b> .                                                                                                                                                                                                                                                                                            |
| .lab label_name                                                                                                                                                                                        | Associates a named label with the current location in the program text. (For use by compilers).                                                                                                                                                                                                                                                                                        |
| .lcomm name, expression                                                                                                                                                                                | Makes the <i>name</i> 's data type bss. The assembler allocates the named symbol to the bss area, and the expression defines the named symbol's length. If a .globl directive also specifies the name, the assembler allocates the named symbol to external bss. The assembler puts bss symbols in one of two bss areas. If the defined size is smaller than the size specified by the |
|                                                                                                                                                                                                        | assembler or compiler's -G command line<br>option, the assembler puts the symbols in<br>the sbss area and uses <b>\$gp</b> to address the<br>data.                                                                                                                                                                                                                                     |

Table 8-1. Pseudo Op-Codes (continued)

8-6 Assembly Language Programmer's Guide

.loc file\_number line\_number

Specifies the source file and the line within that file that corresponds to the assembly instructions that follow. The assembler ignores the file number when this directive appears in the assembly source file. Then, the assembler assumes that the directive refers to the most recent .file directive. When a .loc directive appears in the binary assembly language .G file, the file number is a dense number pointing at a file symbol in the symbol table .T file. For more information about .G and .T files, see the *IRIS-4D Series Compiler Guide*.

.mask mask offset

nop

Sets a mask with a bit turned on for each general purpose register that the current routine saved. Bit one corresponds to register \$1. The offset is the distance in bytes from the virtual frame pointer where the registers are saved. The assembler saves higher register numbers closer to the the virtual frame pointer. Space should be allocated for those registers appearing in the mask. If bit zero is set it is assumed that space is allocated for all 31 registers regardless of whether they appear in the mask. (For use by compilers).

Tells the assembler to put in an instruction that has no effect on the machine state. While several instructions cause no-operation, the assembler only considers the ones generated by the **nop** directive to be wait instructions. This directive puts an explicit delay in the instruction stream.

Table 8-1. Pseudo Op-Codes (continued)

| Pseudo-Op          | Description                                                                                                                                                                                                                                                                                                    |  |
|--------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|
| .option options    | Tells the assembler that certain options<br>were in effect during compilation. (These<br>options can, for example, limit the assem-<br>bler's freedom to perform branch optimiza-<br>tions.) This option is intended for com-<br>piler-generated .s files rather than for<br>hand-coded ones.                  |  |
| .repeat expression | Repeats all instructions or data between the .repeat directive and the .endr directive.<br>The <i>expression</i> defines how many times the data repeats. With the .repeat directive, you CANNOT use labels, branch instructions, or values that require relocation in the block. To end a .repeat, see .endr. |  |
| .rdata             | Tells the assembler to add subsequent data into the rdata section.                                                                                                                                                                                                                                             |  |
| .sdata             | Tells the assembler to add subsequent data to the sdata section.                                                                                                                                                                                                                                               |  |

 Table 8–1.
 Pseudo Op-Codes (continued)

#### Description

.set option

Instructs the assembler to enable or to disable certain options. Use set options only for hand-crafted assembly routines. The assembler has these default options: **reorder**, **macro**, and **at**. You can specify only one option for each .set directive. You can specify these .set options:

- The reorder option lets the assembler reorder machine language instructions to improve performance.
- The noreorder option prevents the assembler from reordering machine language instructions. If a machine language instruction violates the hardware pipeline constraints, the assembler issues a warning message.
- The macro option lets the assembler generate multiple machine instructions from a single assembler instruction.
  - The nomacro option causes the assembler to print a warning whenever an assembler operation generates more than one machine language instruction. You must select the noreorder option before using the nomacro option; otherwise, an error results.
- The at option lets the assembler use the **\$at** register for macros, but generates warnings if the source program uses **\$at**.

Table 8-1. Pseudo Op-Codes (continued)

| seudo-Op           | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
|--------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|                    | • When you use the <b>noat</b> option and an assembler operation requires the <b>\$at</b> register, the assembler issues a warning message; however, the <b>noat</b> option does let source programs use <b>\$at</b> without issuing warnings.                                                                                                                                                                                                                                                                                                                                                                                                            |
|                    | <ul> <li>The nomove options tells the assembler to mark each subsequent instruction so that it cannot be moved during reorganization. Because the assembler can still insert nop instructions where necessary for pipeline constraints, this option is less stringent than noreorder. The assembler can still move instructions from below the nomove region to fill delay slots above the region or vice versa. The nomove option has part of the effect of the "volatile" C declaration; it prevents otherwise independent loads or stores from occurring in a different order than intended.</li> <li>The move option cancels the effect of</li> </ul> |
|                    | • The move option cancels the effect of nomove.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
| space expression   | Advances the location counter by the value<br>of the specified <i>expression</i> bytes. The as-<br>sembler fills the space with zeros.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
| .struct expression | This permits you to lay out a structure us-<br>ing labels plus directives like .word, .byte,<br>and so forth. It ends at the next segment<br>directive (.data, .text, etc.). It does not<br>emit any code or data, but defines the la-<br>bels within it to have values which are the<br>sum of <i>expression</i> plus their offsets from<br>the .struct itself.                                                                                                                                                                                                                                                                                          |

8-10 Assembly Language Programmer's Guide

| Pseudo-Op                       | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
|---------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| (symbolic equate)               | Takes one of these forms: $name = expression$ or $name = register$ . You must define the name only once in the assembly, and you CANNOT redefine the name. The expression must be computable when you assemble the program, and the expression must involve operators, constants, and equated symbols. You can use the name as a constant in any later statement.                                                                                                                              |
| .text                           | Tells the assembler to add subsequent code to the text section. (This is the default.)                                                                                                                                                                                                                                                                                                                                                                                                         |
| .verstamp major minor           | Specifies the major and minor version<br>numbers (for example, version 0.15 would<br>be .verstamp 0 15).                                                                                                                                                                                                                                                                                                                                                                                       |
| .vreg register offset symno     | (For use by compilers). Describes a register variable by giving the offset from the virtual frame pointer and the symbol number <i>symno</i> (the dense number) of the surrounding procedure.                                                                                                                                                                                                                                                                                                  |
| .word expression1 [, expression | on2 ] [, expressionN]<br>Truncates the expressions in the comma-<br>separated list to 32-bits and assembles the<br>values in successive locations. The expres-<br>sions must be absolute. The operands can<br>optionally have the form: expression1 [:<br>expression2]. The expression2 replicates<br>expression1's value expression2 times.<br>This directive automatically aligns its data<br>and preceding labels to a word boundary.<br>You can disable this feature by using<br>.align 0. |

Table 8-1. Pseudo Op-Codes (continued)



# 9. Object File Format

This chapter provides information on the object file format and has the following major topics:

- An overview of the components that make up the object file, and the differences between the object-file format and the UNIX System V common object file format (COFF).
- A description of the headers and sections of the object file. Detailed information is given on the logic followed by the assembler and link editor in handling relocation entries.
- The format of the three types of object files (OMAGIC, NMAGIC, and ZMAGIC), and information used by the system loader in loading object files at run-time.
- Archive files and link editor defined symbols.

## 9.1 Overview

The assembler and the link editor generate object files in the order shown in Figure 9-1. Any areas empty of data are omitted, except that the Optional Header is always present.

The fields of the Symbol table portion (indicated in Figure 9-1) that appear in the final object file format vary, as follows:

- The Line Numbers, Optimization Symbols, and Auxiliary Symbols subtables appear only when debugging is on (when the user specifies one of the compiler -g1, -g2 or -g3 options ).
- When the user specifies the -x option (strip non-globals) for the link edit phase, the link editor strips the Local Symbols, Optimization Symbols, Auxiliary Symbols subtables from the object file, and updates the Procedure Descriptor table.
- The link editor strips the entire Symbol table from the object file when the user specifies the -s option (strip) for the link edit phase.

Any new assembler or link editor designed to work with the compiler system should lay out the object file in the order shown in Figure 9-1. The link editor can process object files that are ordered differently, but performance may be degraded.

#### 9-2 Assembly Language Programmer's Guide



Figure 9-1. Object Files

Readers already familiar with standard UNIX System V COFF (common object file format) may be interested in the difference between it and the compiler system format, as described next.

The compiler system File Header definition is based on UNIX System V header file *filehdr.h* with the following modifications.

- The symbol table file pointer and the number of symbol table entries now specify the file pointer and the size of the Symbolic Header respectively (described in Chapter 10).
- All tables that specify symbolic information have their file pointers and number of entries in this Symbolic Header.

The Optional Header definition has the same format as the UNIX System V header file *aouthdr.h* (the standard UNIX system a.outheader) except the following fields have been added: *bss\_start, gprmask, cprmask,* and  $gp\_value$ . See Table 9-4.

The Section Header definition has the same format as the UNIX System V's header file *scnhdr.h.* except the line number fields (*s\_lnnoptr and s\_nlnno*) aren't used. See Table 9-6.

The relocation information definition is similar to Berkeley UNIX, which has "local" relocation types; however, you should read the topic Section Relocation Information in this chapter to be aware of differences that do exist.

## 9.2 The File Header

Table 9–1 shows the format of the File Header; the header file filehdr.h contains its definition.

9-4 Assembly Language Programmer's Guide

**IRIS-4D Series** 

( )

| Declaration                                                                          | Field                                                                               | Description                                                                                                                                                |
|--------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------|
| unsigned short<br>unsigned short<br>long<br>long<br>unsigned short<br>unsigned short | f_magic;<br>f_nscns;<br>f_timdat;<br>f_symptr;<br>f_nsyms;<br>f_opthdr;<br>f_flags; | magic number<br>number of section<br>time and date stamp<br>file pointer to symbolic header<br>size of symbolic header<br>size of optional header<br>flags |

Table 9-1. File Header Format

The  $f\_symptr$  points to the Symbolic Header of the Symbol table, and the  $f\_nsyms$  gives the size of the header. For a description of the Symbolic Header, see Chapter 10. Other fields in the File Header are described in the sections that follow.

### 9.2.1 File Header Magic Field (f\_magic)

The magic number in the  $f_{magic}$  entry in the File Header specifies the target machine on which an object file can execute. Table 9-2 shows the values and mnemonics for the magic numbers; the header file *filehdr.h* contain the preprocessor macro definitions.

| Symbol       | Value  | Description                                                                       |
|--------------|--------|-----------------------------------------------------------------------------------|
|              |        |                                                                                   |
| MIPSEBMAGIC  | 0x0160 | big-endian target (headers and tables have same byte sex as host machine.         |
| MIPSELMAGIC  | 0x0162 | little-endian target (headers and tables have same byte sex as host machine.)     |
| SMIPSEBMAGIC | 0x6001 | big–endian target (headers and tables<br>have opposite byte sex as host machine.) |
| SMIPSELMAGIC | 0x6201 | little-endian target (headers and tables have opposite byte sex as host machine)  |
|              |        |                                                                                   |

Table 9-2. File Header Magic Numbers

## 9.2.2 Flags (f\_flags)

The  $f_flags$  field describes the object file characteristics. Table 9-3 describes the flags and gives their hexadecimal bit patterns. The table notes those flags that don't apply to compiler system object files.

| Symbol   | Value  | Description                                                                                                                    |
|----------|--------|--------------------------------------------------------------------------------------------------------------------------------|
| F_RELFLG | 0x0001 | relocation information stripped from file                                                                                      |
| F_EXEC   | 0x0002 | file is executable (i.e. no unresolved external references)                                                                    |
| F_LNNO   | 0x0004 | line numbers stripped from file                                                                                                |
| F_LSYMS  | 0x0008 | local symbols stripped from file                                                                                               |
| F_MINMAL | 0x0010 | <sup>1</sup> minimal object file (".m") output of fextract                                                                     |
| F_UPDATE | 0x0020 | <sup>1</sup> fully bound update file, output of ogen                                                                           |
| F_SWABD  | 0x0040 | <sup>1</sup> file whose bytes were swabbed (in names)                                                                          |
| F_AR16WR | 0x0080 | <sup>1</sup> file has the byte ordering of an AR16WR<br>(e.g.11/70) machine (it was created there,<br>or was produced by conv) |
| F_AR32WR | 0x0100 | file has the byte ordering of an AR32WR machine (e.g. vax)                                                                     |
| F_AR32W  | 0x0200 | <sup>1</sup> file has the byte ordering of an AR32W machine (e.g. 3b,maxi,MC68000)                                             |
| F_PATCH  | 0x0400 | <sup>1</sup> file contains "patch" list in Optional Header                                                                     |
| F_NODF   | 0x0400 | <sup>1</sup> (minimal file only) no decision functions for replaced functions.                                                 |

<sup>1</sup>Not used by compiler system object modules.

Table 9-3. File Header Flags

# 9.3 Optional Header

The link editor and the assembler fill in the Optional Header, and the system (kernel) loader (or other program that loads the object module at run-time) uses the information it contains, as described in the section **Loading Object Files** in this chapter.

| Symbol | Value       | Description                                                 |
|--------|-------------|-------------------------------------------------------------|
| short  | magic;      | See Table 9.5.                                              |
| short  | vstamp;     | version stamp                                               |
| long   | tsize;      | text size in bytes, padded to double-word boundary          |
| long   | dsize;      | initialized data in bytes, padded to double-word boundary   |
| long   | bsize;      | uninitialized data in bytes, padded to double-word boundary |
| long   | entry;      | entry point                                                 |
| long   | text_start; | base of text used for this file                             |
| long   | data_start; | base of data used for this file                             |
| long   | bss_start;  | base of bss used for this file                              |
| long   | gprmask;    | general purpose register mask                               |
| long   | cprmask[4]; | co-processor register masks                                 |
| long   | gp_value;   | the gp value used for this object                           |

Table 9-4 shows the format of the Optional Header; the header file *aouthdr.h* contains its definition.

Table 9-4. Optional Header Definition

The next section describes the magic field in the Optional Header.

9-8 Assembly Language Programmer's Guide

### 9.3.1 Optional Header Magic Field (magic)

Table 9-5 shows the values of the *magic* field for the Optional Header; the header file *aouthdr.h* contains the preprocessor macro definitions.

| Symbol | Value | Description                                                                                                                                                                                                                                                                                                     |
|--------|-------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| OMAGIC | 0407  | Impure Format. The text is not write-protected or sharable; the data segment is contiguous with the text segment.                                                                                                                                                                                               |
| NMAGIC | 0410  | Shared Text. The data segment starts at the next page following the text segment and the text segment is write-protected.                                                                                                                                                                                       |
| ZMAGIC | 0413  | Demand Paged. The object file is to be demand<br>loaded (each page is copied into memory directly<br>from load module on the first reference to that<br>page), the file has a special format, and the text<br>and data segments start at fixed addresses. The<br>text segment is write-protected (the default). |

Table 9-5. UNIX Magic Numbers

See the **Object Files** section in this chapter for information on the format of OMAGIC, NMAGIC, and ZMAGIC files.

# 9.4 Section Headers

Table 9–6 shows the format of the Section Header; the header file *scnhdr.h.* contains the definition.

| Declaration                                          | Field                                                                   | Description                                                                                                                               |   |
|------------------------------------------------------|-------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------|---|
| char<br>long<br>long<br>long<br>long<br>long<br>long | s_name[8];<br>s_paddr;<br>s_vaddr;<br>s_size;<br>s_scnptr;<br>s_relptr; | section name<br>physical address<br>virtual address<br>section size<br>file pointer to raw data for section<br>file pointer to relocation | ( |
| long<br>unsigned short<br>unsigned short<br>long     | s_lnnoptr;<br>s_nreloc;<br>s_nlnno;<br>s_flags;                         | file pointer to line numbers (not used)<br>number of relocation entries<br>number of line number entries (not used)<br>flags              |   |

| Table 9-6. Section Header Forma |
|---------------------------------|
|---------------------------------|

The sections that follow describe in detail some of the entries in the Section Header.

### 9.4.1 Section Name (s\_name)

Table 9-7 shows the constants for section names that can appear in the  $s_name$  field of the Section Header; the header file  $s_nhdr.h$  contains the preprocessor macro definitions.

| Declaration | Field    | Description            |
|-------------|----------|------------------------|
| _TEXT       | ".text"  | text section           |
| _RDATA      | ".rdata" | read only data section |
| _DATA       | ".data"  | large data section     |
| _SDATA      | ".sdata" | small data section     |
| _BSS        | ".bss"   | large bss section      |
| _SBSS       | ".sbss"  | small bss section      |

Table 9-7. Section Header Constants for Section Names

### 9.4.2 Line Number Entries (s\_Innoptr and s\_nInno)

The link editor fills the line number fields with zeros. The compiler and link editor don't use the line number entries in the Section Header, because line number information appears only once in the object file, not once per section. The compiler instead places line numbers in the Line Numbers subtables of the Symbol table, as described in Chapter 10.

#### 9.4.3 Flags (s\_flags)

Table 9-8 shows the flags that appear in s\_flags and notes those flags that apply to compiler system object files; the header file scnhdr.h contains their definition.

| Symbol                                 | Value | Description                                              |
|----------------------------------------|-------|----------------------------------------------------------|
| •••••••••••••••••••••••••••••••••••••• |       |                                                          |
| STYP_REG                               | 0x00  | regular section; allocated, relocated, loaded            |
| STYP_DSECT                             | 0x01  | <sup>1</sup> dummy; not allocated, relocated, not loaded |
| STYP_NOLOAD                            | 0x02  | <sup>1</sup> noload; allocated, relocated, not loaded    |
| STYP_GROUP                             | 0x04  | <sup>1</sup> grouped; formed of input sections           |
| STYP_PAD                               | 0x08  | <sup>1</sup> padding; not allocated, not relocated,      |
|                                        |       | loaded                                                   |
| STYP_COPY                              | 0x10  | <sup>1</sup> copy; for decision function used by field   |
|                                        |       | update; not allocated, not relocated,                    |
|                                        |       | loaded; relocated, and line number entries               |
|                                        |       | processed normally                                       |
| STYP_TEXT                              | 0x20  | text only                                                |
| STYP_RDATA                             | 0x100 | read only data only                                      |
| STYP_DATA                              | 0x40  | data only                                                |
| STYP_SDATA                             | 0x200 | small data only                                          |
| STYP_SBSS                              | 0x80  | contains small bss only                                  |
| STYP_BSS                               | 0x400 | bss only                                                 |

<sup>1</sup>Not used by compiler system object modules.

Table 9-8. Format of s\_flags Section Header Entry

**NOTE:** For performance reasons, the link editor uses the *s\_flags* entry instead of *s\_name* to determine the type of section. However, the link editor does correctly fill in the *s\_name* entry.

## 9.5 Section Data

A section represents the smallest portion of the object file; compiler systme files are represented in six sections: .text, .rdata (read-only data), .data (data)<sup>1</sup> .sdata (small data), .sbss (small block started by storage), and .bss (block started by storage). The .text section contains the machine instructions that are to be executed; the .rdata, .data, and .sdata contain initialized data; and the .sbss, and .bss sections reserve space for uninitialized data that is created by the kernel loader for the program before execution and filled with zeros.

Figure 9-2 shows the layout of the six sections.



Figure 9-2. Organization of Section Data

As noted in Figure 9-2, the sections are grouped into the text segment (containing the *.text* section), the data segment (*.rdata*, *.data*, and *s.data*) and the bss segment (*.sbss* and *.bss*). A section is described in and referenced through the Section Header, and segments through the Optional Header.

#### 9-12 Assembly Language Programmer's Guide

The link editor references the data shown in Figure 9-2 both as sections and segments, through the Section Header and Optional Header respectively. However, the system (kernel) loader, when loading the object file at run-time, references the same data only by segment, through the Optional Header.

# 9.6 Section Relocation Information

This portion of the chapter is divided into the following parts:

- The format of a relocation table entry and an explanation of its fields.
- The logic followed by the assembler and the link editor is creating and updating an entry.

#### 9.6.1 Relocation Table Entry

Table 9–9 shows the format of an entry in the Relocation Table; the header file reloc.h contains the definition.

| Declaration Field |                                           | Description                                                                                                                                                                                                                                      |
|-------------------|-------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| long<br>unsigned  | r_vaddr;<br>r_symndx:24,                  | (virtual) address of an item to be relocated.<br>index into external symbols or section<br>number; see r_extern below.                                                                                                                           |
|                   | r_reserved:3,<br>r_type:4,<br>r_extern:1; | <ul> <li>relocation type</li> <li>= 1 for an external relocation entry;</li> <li>r_symndx is an index into External Symbols.</li> <li>= 0 for a local, relocation entry; r_symndx is the number of the section containing the symbol.</li> </ul> |

Table 9-9. Format of a Relocation Table Entry

The sections that follow describe some of the fields shown in Table 9-9.

### Symbol Index (r\_symndx) and Extern Field (r\_extern)

For external relocation entries,  $r\_extern$  is set to 1 and  $r\_symnndx$  is the index into External Symbols for this entry. In this case, the value of the symbol is used as the value for relocation.

For local relocation entries,  $r\_extern$  is set to 0, and  $r\_symndx$  contains a constant that refers to a section. In this case, the starting address of the section to which the constant refers is used as the value for relocation.

Table 9–10 gives the section numbers for  $r_symndx$ ; the *reloc.h* file contains their preprocessor macro definitions.

| Symbol                  | Value  | Description                     |
|-------------------------|--------|---------------------------------|
| R_SN_TEXT               | 1      | .text section                   |
| R_SN_RDATA<br>R_SN_DATA | 2<br>3 | .rdata section<br>.data section |
| R_SN_SDATA<br>R_SN_SBSS | 4<br>5 | sdata section.                  |
| R_SN_BSS                | 6      | .bss section                    |

Table 9-10. Section Numbers for Local Relocation Entries

### Relocation Type (r\_type)

Table 9–11 shows valid symbolic entries for the relocation type  $(r_type)$  field; the header file *reloc.h* contains their preprocessor macro definitions.

| Symbol    | Value | Description                                                                        |
|-----------|-------|------------------------------------------------------------------------------------|
| R_ABS     | 0x0   | relocation already performed.                                                      |
| R_REFHALF | 0x1   | 16-bit reference to the symbol's virtual address                                   |
| R_REFWORD | 0x2   | 32-bit reference to the symbol's virtual address                                   |
| R_JMPADDR | 0x3   | 26-bit jump reference to the symbol's virtual address                              |
| R_REFHI   | 0x4   | reference to the high 16-bits of symbol's virtual address                          |
| R_REFLO   | 0x5   | reference to the low 16-bits of symbol's virtual address                           |
| R_GPREL   | 0x6   | reference to the offset from the global pointer<br>to the symbol's virtual address |

Table 9-11. Relocation Types

### 9.6.2 Assembler and Link Editor Processing

Compiler system executable object modules with all external references defined, have the same format as relocatable modules and are executable without re-link editing.

Local relocation entries must be used for symbols that are defined. Therefore, external relocations are used only for undefined symbols. Figure 9-3 gives an overview of the Relocation Table entry for an undefined external symbol.



Figure 9-3. Relocation Table Entry for Undefined External Symbols

The assembler creates this entry as follows:

- Sets *r\_vaddr* to point to the item to be relocated.
- Places a constant to be added to the value for relocation at the address for the item to be relocated  $(r_vaddr)$ .
- If the width of the constant is less than 32 bits, sign-extends the constant to 32 bits.
- Adds the value for relocation (the value of the symbol) to the constant and places it back in the address to be relocated.
- Sets *r\_symndx* to the index of the External Symbols entry that contains the symbol value.
- Sets *r\_type* to the constant for the type of relocation types. Table 9-11 shows the valid constants for the relocation type.
- Sets *r\_extern* to 1.

**NOTE:** The assembler always sets the value of the undefined entry in External Symbols to 0. It may assign a constant value to be added to the relocated value at the address where the location is to be done. If the

width of the constant is less than a full word, and an overflow occurs after relocation, the link editor flags this as an error.

When the link editor determines that an external symbol is defined, it changes the Relocation Table entry for the symbol to a local relocation entry. Figure 9-4 gives an overview of the new entry.



Figure 9-4. Relocation Table Entry for a Local Relocation Entry

To change this entry from an external relocation entry to a local relocation entry, the link editor:

- Picks up the constant from the address to be relocated  $(r_vaddr)$ .
- If the width of the constant is less than 32 bits, sign-extends the constant to 32 bits.
- Adds the value for relocation (the value of the symbol) to the constant and places it back in the address to be relocated.
- Sets *r\_symndx* to the section number that contains the external symbol.
- Sets  $r_extern$  to 0.

### Examples

The examples that follow use external relocation entries.

**Example 1:** 32-Bit Reference—R\_REFWORD. This example shows assembly statements that set the value at location b to the global data value y.

```
.globl y
.data
b: .word y #R_REFWORD relocation type at address b for symbol y
```

In processing this statement, the assembler generates a relocation entry of type R\_REFWORD for the address b and the symbol y. After determining the address for the symbol y, the loader adds the 32-bit address of y to the 32-bit value at location b, and places the sum in location b. The loader handles 16-bit addresses (R\_REFHALF) in the same manner, except it checks for overflow after determining the relocation value.

**Example 2:** 26-Bit Jump-R\_JMPADDR. This example shows assembly statements that call routine x from location c.

.text x: ;routine x ...

jal x #R\_JMPADDR relocation type at address c for symbol x

In processing these statements, the assembler generates a relocation entry of type  $R_JMPADDR$  for the address c and the symbol x. After determining the address for the routine, the loader shifts the address right two bits, adds the low 26 bits of the result to the low 26 bits of the instruction at address c, and places the results back into the low 26 bits at address c.

**R\_JMPADDR** relocation entries are produced for the assembler's j (Jump) and *jal* (Jump and Link) instructions. These instructions take the high four bits of the target address from the address of their instruction. The link editor makes sure that the same four bits are in the target address after relocation; if not, it generates an error message.

If the entries are local relocation types, the target of the Jump instruction is assembled in the instruction at the address to be relocated. The high four bits of the jump target are taken from the high 4 bits of the address of the instruction to be relocated. **Example 3:** High/Low Reference-R\_REFHI/R\_REFLO. This example shows an assembler macro that loads the absolute address y, plus a constant, into Register 6:

lw \$r6,constant+y

In processing this statement, the assembler generates a 0 as the value y, and the following machine language statements:

| f: | lui   | \$at,constant>>16                         | #R_REFHI relocation type at |
|----|-------|-------------------------------------------|-----------------------------|
|    |       |                                           | address f for symbol y      |
| g: | addiu | <pre>\$r6,constant&amp;0xffff(\$at)</pre> | #R_REFLO relocation type at |
|    |       |                                           | address g for symbol y      |

In this example, the assembler produces two relocation entries.

**NOTE:** When a R\_REFHI relocation entry appears, the next relocation entry must always be the corresponding R\_REFLO entry. This is required in order to reconstruct the constant that is to be added to the value for relocation.

In determining the final constant values for the two instructions, the assembler must take into account that the **addiu** instruction of the  $R\_REFLO$  relocation entry, sign-extends the immediate value of the constant.

In determining the sum of the address for the symbol y and the constant, the assembler does the following:

- It uses the low 16 bits of this sum for the immediate value of the R\_REFLO relocation address.
- Because all instructions that are marked with a R\_REFLO perform a signed operation, the assembler adjusts the high portion of the sum if Bit 15 is set. Then it uses the high 16 bits of the sum for the immediate value of the R\_REFHI instruction at the relocation address. For example:



#### Figure 9-5.

**Example 4:** Offset Reference—R\_GPREL. This example shows an assembly macro that loads a global pointer relative value y into Register 6:

lw \$r6,y

In processing this statement, the assembler generates a 0 as the value y and the following machine language statement:

h: lw \$r6,0(\$gp) #R\_GPREL relocation type at address h for symbol y

and a R\_GPREL relocation entry would be produced. The assembler then uses the difference between the address for the symbol y and the address of the global pointer, as the immediate value for the instruction. The link editor gets the value of the global pointer used by the assembler from  $gp\_value$  in the Optional Header (Table 9-4).

### 9.7 Object Files

This section describes the three object-file formats created by the link editor, namely the Impure (OMAGIC), Shared Text (NMAGIC), and Demand Paged (ZMAGIC) formats. Before reading this section, you should be familiar in the format and contents of the text, data, and bss segments as described in the **Section Data** section of this chapter.

### 9.7.1 Impure Format (OMAGIC) Files

An OMAGIC file has the format shown in Figure 9-6.



Figure 9-6. Layout of OMAGIC Files in Virtual Memory

The OMAGIC format has the following characteristics:

- Each section follows the other in virtual address space aligned on an 8-byte boundary.
- No blocking of sections.
- Text, data and bss segments can be placed anywhere in the virtual address space using the link editor's -T, -D and -B options.
- The addresses specified for the segments must be rounded to 8-byte boundaries.
- The text segment contains only the *.text* section.
- The sections in the data segment are ordered as follows: .rdata, .data and .sdata
- The sections in the bss segment are ordered as follows: .sbss and .bss.

### 9.7.2 Shared Text (NMAGIC) Files

An NMAGIC file has the format shown in Figure 9–7.



Figure 9-7. Layout of NMAGIC Files in Virtual Memory

An NMAGIC file has the following characteristics:

- The virtual address of the data segment is rounded up to the next *pagesize* boundary.
- No blocking of sections.
- Each section follows the other in virtual address space aligned on an 8-byte boundary.
- Only the start of the text section, using the link editor's -T option, can be specified for a shared text format file; the start of the text section must be a multiple of the pagesize.

### 9.7.3 Demand Paged (ZMAGIC) Files

A ZMAGIC file is a demand, paged file in the format shown in Figure 9-8.

A ZMAGIC file has the following characteristics:

• The text segment and the data segment are blocked, with *pagesize* as the blocking factor. Blocking reduces the complexity of paging in the files.



Figure 9-8. Layout of ZMAGIC Files in Virtual Memory

- The size of the sum of the of the File, Section, and Optional Headers (Tables 9-1, 9-4, and 9-6) rounded to 8 bytes is included in blocking of the text segment.
- The .text section must start at 0x8000 (32K) or higher, plus the size of the sum of the headers again rounded to 8 bytes. With the standard software, the .text section starts at 0x400000 + header size.

**NOTE:** This is required because the first 32K bytes of memory are reserved for future use by the compiler system to allow data access relative to the constant register 0.

• Only the start of the text section, using the link editor's -T option can be specified for a demand paged format file and must be a multiple of the pagesize.

Figure 9-9 shows a ZMAGIC file as it appears in a disk file.



Figure 9-9. Layout of a ZMAGIC File on Disk

### 9.7.4 Loading Object Files

The link editor produces object files with their sections in a fixed order similar to UNIX system object files that existed before COFF. See Figure 9-2 for the a description of the sections and how they are formatted.

The sections are grouped into segments, which are described in the Optional Header. In loading the object module at run-time, the system (kernel) loader needs only the magic number in the File Header and the Optional Header to load an object file for execution.

The starting addresses and sizes of the segments for all types of object files are specified similarly, and they are loaded in the same manner.

After reading in the File Header and the Optional Header, the system (kernel) loader must examine the file magic number to determine if the program can be loaded. Then, the system (kernel) loader loads the text and data segments.

The starting offset in the file for the text segment is given by the macro

N\_TXTOFF(f,a)

in the header file a.out.h, where f is the File Header structure and a is the option header structure for the object file to be loaded. The tsize field in the Optional Header (Table 9-4) contains the size of the text segment and text\_start contains the address at which it is to be loaded.

The starting offset of the data segment follows the text segment. The  $d\_size$  field in the Section Header (Table 9-6) contains the size of the data segment;  $data\_start$  contains the address at which it is to be loaded.

The system (kernel) loader must fill the **bss** segment with zeros. The *bss\_start* field in the Optional Header specifies the starting address; *bsize* specifies the number of bytes to be filled with zeros. In ZMAGIC files, the link editor adjusts *bsize* to account for the zero filled area it created in the **data** segment that is part of of the .sbss or .bss sections.

If the object file itself does not load the global pointer register it must be set to the  $gp_value$  field in the Optional Header (Table 9-4).

The other fields in the Optional Header are *gprmask* and *cprmask[4]*, whose bits show the registers used in the .text section. They can be used by the operating system, if desired, to avoid save register relocations on context-switch.

## 9.8 Archive Files

The link editor can link object files in archives created by the archiver. The archiver and the format of the archives are based on the UNIX System V portable archive format. To improve performance, the format of the archives symbol table was changed so that it is a hash table, not a linear list.

The archive hash table is accessed through the *ranhashinit* and *ranlookup()* library routines in *libmld.a*, which are documented in the manual page ranhash (3x). The archive format definition is in the header file *ar.h.* 

# 9.9 Link Editor Defined Symbols

Certain symbols are reserved and their values are defined by the link editor. A user program can reference these symbols, but not define one, or else an error is generated. Table 9-12 lists the names and values of these symbols; the header file sym.h contains their preprocessor macro definitions.

| Symbol | Value    | Description                                                    |
|--------|----------|----------------------------------------------------------------|
| _etext | "etext"  | first location after the .text section                         |
| _EDATA | "edata"  | first location after the .sdata section (all initialized data) |
| _END   | "end"    | first location after the .bss section (all data)               |
| FTEXT  | "_ftext" | <sup>1</sup> first location of the .text section               |
| FDATA  | "_fdata" | <sup>1</sup> first location of the .data section               |
|        | "_fbss"  | <sup>1</sup> first location of the .bss section                |
| _GP    | "_gp"    | <sup>1</sup> the value of the global pointer                   |

<sup>1</sup>compiler system only

Table 9-12. Link Editor Defined Symbols

The first three symbols come from the standard UNIX system link editors and the rest are compiler system specific. The last symbol is used by the start up routine to set the value of the global pointer, as shown in the following assembly language statements:

globl \_GP la \$gp,\_GP

The assembler generates the following machine instructions for these statements:

a: lui gp,0 # R\_REFHI relocation type at address a for symbol \_GP b: add gp,0 # R\_REFLO relocation type at address b for symbol \_GP

which would cause the correct value of the global pointer to be loaded.



# 10. The Symbol Table

This chapter describes the symbol table and symbol table routines used to create and make entries in the table. The chapter contains the following major sections:

- Overview, which gives the purpose of the Symbol table, a summary of its components, and their relationship to each other.
- Format of Symbol Table Entries, which shows the structures of Symbol table entries and the values you assign them through the Symbol Table routines.
- Symbol Table Routine Reference, which lists the symbol table routines supplied with the compiler and summarizes the function of each.

**NOTE:** Third Eye Software, Inc. owns the copyright (dated 1984) to the format and nomenclature of the Symbol Table used by the compiler system as documented in this chapter.

Third Eye Software, Inc. grants reproduction and use rights to all parties, PROVIDED that this comment is maintained in the copy.

Third Eye makes no claims about the applicability of this symbol table to a particular use.

### 10.1 Overview

The symbol table in created by the compiler front-end as a stand-alone file. The purpose of the table is to provide information to the link editor and the debugger in performing their respective functions. At the option of the user, the link editor includes information from the Symbol table in the final object file for use by the debugger. See Figure 9-1 in Chapter 9 for details.



Figure 10-1. The Symbol Table - Overview

The elements that make up the Symbol table are shown in Figure 10-1. The front-end creates one group of tables (the shaded areas in Figure 10.1) that contain global information relative to the entire compilation. It also creates a unique group of tables (the unshaded areas in the figure) for the source file and each of its include files.

Compiler front-ends, the assembler, and the link editor interact with the symbol table as summarized below:

- The front-end, using calls to routines supplied with the compiler system, enters symbols and their descriptions in the table.
- The assembler fills in line numbers, optimization symbols, updates Local Symbols and External Symbols, and updates the Procedure Descriptor table.
- The link editor eliminates duplicate information in the External Symbols and the External Strings tables, removes tables with duplicate information, updates Local Symbols with relocation information, and creates the Relative File Descriptor table.

The major elements of the table are summarized in the paragraphs that follow. Some of these elements are explored in more detail later in the chapter.

Symbolic Header. The Symbolic Header (HDRR for HeadDeR Record) contains the sizes and locations (as an offset from the beginning of the file) of the subtables that make up the Symbol Table. Figure 10-2 shows the symbolic relationship of the header to the other tables.





Line Numbers. The assembler creates the Line Number table. It creates an entry for every instruction. Internally, the information is stored in an encoded form. The debugger uses the entries to map instruction to the source lines and vice versa.

**Dense Numbers.** The Dense Number table is an array of pairs. An index into this table is called a dense number. Each pair consists of a file table index (ifd) and an index (isym) into Local Symbols. The table facilitates symbol look-up for the assembler, optimizer, and code generator by allowing direct table access rather than hashing.

**Procedure Descriptor Table.** The Procedure Descriptor table contains register and frame information, and offsets into other tables that provide detailed information on the procedure. The front-end creates the table and links it to the Local Symbols table. The assembler enters information on registers and frames. The debugger uses the entries in determining the line numbers for procedures and frame information for stack traces.

Local Symbols. The Local Symbols table contains descriptions of program variables, types, and structures, which the debugger uses to locate and interpret runtime values. The table gives the symbol type, storage class, and offsets into other tables that further define the symbol.

A unique Local Symbols table exists for every source and include file; the compiler locates the table through an offset from the file descriptor entry that exists for every file. The entries in Local Symbols can reference related information in the Local Strings and Auxiliary Symbols subtables. This relationship is shown in Figure 10-3.



Figure 10-3. Logical Relationship between the File Descriptor Table and Local Symbols

Optimization Symbols. To be defined at a future date.

Auxiliary Symbols. The Auxiliary Symbols tables contain data type information specific to one language. Each entry is linked to an entry in Local Symbols. The entry in Local Symbols can have multiple, contiguous entries. The format of an auxiliary entry depends on the symbol type and storage class. Table entries are required only when the compiler debugging option is ON.

Local Strings. The Local Strings subtables contain the names of local symbols.

External Strings. The External Strings table contains the names of external symbols.

File Descriptor. The File Descriptor table contains one entry each for each source file and each of its include files. (The structure of an entry is given in Table 10-12 later in this chapter.) The entry is composed of pointers to a group of subtables related to the file. The physical layout of the subtables is shown in Figure 10-4.



Figure 10-4. Physical Relationship of a File Descriptor Entry to Other Tables

The file descriptor entry allows the compiler to access a group of subtables unique to one file. The logical relationship between entries in this table and in its subtables is shown in 10-5.

10-6 Assembly Language Programmer's Guide





Relative File Descriptor. See the section Link Editor Processing later in this chapter.

**External Symbols.** The External Symbols contains global symbols entered by the front-end. The symbols are defined in one module and referenced in one or more other modules. The assembler updates the entries, and the link editor merges the symbols and resolves their addresses.

### **10.2 Format of Symbol Table Entries**

### **10.2.1 Symbolic Header**

The structure of the Symbolic Header is shown below in Table 10-1; the sym.h header file contains the header declaration.

| Declaration | Name          | Description                             |
|-------------|---------------|-----------------------------------------|
|             |               |                                         |
| short       | magic         | to verify validity of the table         |
| short       | vstamp        | version stamp                           |
| long        | ilineMax      | number of line number entries           |
| long        | cbLine        | number of bytes for line number entries |
| long        | cbLineOffset  | index to start of line numbers          |
| long        | idnMax        | max index into dense numbers            |
| long        | cbDnOffset    | index to start dense numbers            |
| long        | ipdMax        | number of procedures                    |
| long        | cbPdOffset    | index to procedure descriptors          |
| long        | isymMax       | number of local symbols                 |
| long        | cbSymOffset   | index to start of local symbols         |
| long        | ioptMax       | maximum index into optimization entries |
| long        | cbOptOffset   | index to start of optimization entries  |
| long        | iauxMax       | number of auxiliary symbols             |
| long        | cbAuxOffset   | index to the start of auxiliary symbols |
| long        | issMax        | max index into local strings            |
| long        | cbSsOffset    | index to start of local strings         |
| long        | issExtMax     | max index into external strings         |
| long        | cbSsExtOffset | index to the start of external strings  |
| long        | ifdMax        | number of file descriptors              |
| long        | cbFdOffset    | index to file descriptor                |
| long        | crfd          | number of relative file descriptors     |
| long        | cbRfdOffset   | index to relative file descriptors      |
| long        | iextMax       | maximum index into external symbols     |
| long        | cbExtOffset   | index to the start of external symbols  |

#### Table 10-1. Format of the Symbolic Header

The lower byte of the vstamp field contains LS\_STAMP and the upper byte MS\_STAMP (see the stamp.h header file). These values are defined in the stamp.h file. The iMax fields and the cbOffset field must be set to 0 if one of the tables shown in Table 10-1 isn't present. The magic field must contain the constant *magicSym*, also defined in longsymconst.h.

### 10.2.2 Line Numbers

Table 10-2 shows the format of an entry in the Line Numbers table; the sym.h header file contains its declaration.

| Declaration  | Name           |  |
|--------------|----------------|--|
| typedef long | LINER, *PLINER |  |

Table 10-2. Format of a Line Number Entry

The line number section in the Symbol table is rounded to the nearest four-byte boundary.

Line numbers map executable instructions to source lines; one line number is stored for each instruction associated with a source line. It is stored as a long integer in memory and in packed format on disk.

The layout on disk is as follows:



Figure 10-6. Disk Layout of Line Numbers

The compiler assigns a line number to only those lines of source code that generate one or more executable instructions.

Delta is a four-bit value in the range -7...7, defining the number of source lines between the current source line, and the previous line generating executable instructions. The Delta of the first line number entry is the displacement from the *lnLow* field in the Procedure Descriptor Table.

Count is a four-bit field with a value in the range 0...15 indicating the number (1...16) of executable instructions associated with a source line.

If more than 16 instructions (15+1) are associated with a source line, new line number entries are generated with Delta = 0.

An extended format of the line number entry is used when Delta is outside the range of -7...7.

The layout of the extended field on disk is as follows:



Figure 10-7. Disk Layout of Extended Field

**NOTE:** The compiler allows a maximum of 32,767 comment lines, blank lines, continuation lines and other lines *not* producing executable instructions, between two source lines that do.

Line number example. This section gives an example of how the compiler assigns line numbers. For the source listing shown in Figure 10-8, the compiler generates line numbers only for the highlighted lines (6, 7, 17, 18, and 19); the other lines are either blank or contain comments.



Figure 10-8. Source Listing for Line Number Example

Figure 10-9 (on the next page) shows the instructions generated for lines 3, 7, 17, 18, and 19. Figure 10-10) shows the compiler-generated liner entries for each source line.

| Source                                                                                              | Li       | ner                                 |
|-----------------------------------------------------------------------------------------------------|----------|-------------------------------------|
| Line                                                                                                | Contents | Meaning                             |
| 3                                                                                                   | 02       | delta 0, count 2                    |
| 6                                                                                                   | 31       | delta 3, count 1                    |
| 7                                                                                                   | 1f       | delta 1, count 15                   |
| 7                                                                                                   | 03       | delta 0, count 3                    |
| 17 <sup>1</sup>                                                                                     | 82 00 0a | -8 <sup>1</sup> , count 2, delta 10 |
| 18                                                                                                  | 1f       | delta 1, count 15                   |
| 18 <sup>2</sup>                                                                                     | 03       | delta 0 <sup>2</sup> , count 3      |
| 19                                                                                                  | 15       | delta 1, count 5                    |
| <ol> <li>Extended format (count is greater than<br/>seven lines).</li> <li>Continuation.</li> </ol> |          |                                     |

Figure 10-9. Source Listing for Line Number Example

| [main:3, 0x4001a0]                         | addiu | sp,sp,-32       |                                         |
|--------------------------------------------|-------|-----------------|-----------------------------------------|
| [main:3, 0x4001a4]                         | sw    | r31,20(sp)      |                                         |
| [main:3, 0x4001a8]                         | SW    | r16, 16(sp)     | (3,6)                                   |
| [main:6, 0x4001ac]                         | jal   | printf          |                                         |
| [main:6, 0x4001b0]                         | addiu | r4,gp,-32752    | ~                                       |
| [main:7, 0x4001b0]                         | lw    | r14,-32552(gp)  |                                         |
|                                            |       | 114,-32332(gp)  |                                         |
| [main:7, 0x4001b8]                         | nop   |                 |                                         |
| [main:7, 0x4001bc]                         | addiu | r15,r14,-1      |                                         |
| [main:7, 0x4001c0]                         | bltz  | r15,0x4001e4    |                                         |
| [main:7, 0x4001c4]                         | sw    | r15, -32552(gp) |                                         |
| [main:7, 0x4001c8]                         | lw    | r24,-32548(gp)  |                                         |
| [main:7, 0x4001cc]                         | nop   |                 | _                                       |
| [main:7, 0x4001d0]                         | lbu   | r25,0(r24)      |                                         |
| [main:7, 0x4001d4]                         | addiu | r8,r24,1        | (7)                                     |
| [main:7, 0x4001d8]                         | sb    | r25,31(sp)      |                                         |
| [main:7, 0x4001dc]                         | b     | 0x4001f4        |                                         |
| [main:7, 0x4001e0]                         | SW    | r8,-32548(gp)   |                                         |
| [main:7, 0x4001e4]                         | jal   | filbuf          |                                         |
| [main:7, 0x4001e8]                         | addiu | r4,gp,-32552    | ,                                       |
| [main:7, 0x4001ec]                         | move  | r16,r2          |                                         |
| [main:7, 0x4001f0]                         | sb    | r16,31(sp)      |                                         |
| [main:7, 0x400110]<br>[main:7, 0x4001f4]   | lbu   | r9,31(sp)       | · *                                     |
| [main:7, 0x400114]<br>[main:7, 0x4001f8]   | li    |                 | 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - |
|                                            | _     | r1,-1           |                                         |
| [main:7, 0x4001fc]                         | beq   | r9,r1,0x400260  |                                         |
| [main:7. 0x400200]                         | nop   |                 |                                         |
| [main:17, 0x400204]                        | lbu   | r5,31(sp)       |                                         |
| [main:17, 0x400208]                        | jal   | printf          | (17)                                    |
| [main:17, 0x40020c]                        | addiu | r4,gp,-32716    |                                         |
| [main:18, 0x400210]                        | lw    | r10,-32552(gp)  |                                         |
| [main:18, 0x400214]                        | nop   |                 |                                         |
| [main:18, 0x400218]                        | addiu | r11,r10,-1      |                                         |
| [main:18, 0x40021c]                        | bltz  | r11,0x400240    |                                         |
| [main:18, 0x400220]                        | SW    | r11, -32552(gp) |                                         |
| [main:18, 0x400224]                        | lw    | r12, -32548(gp) |                                         |
| [main:18, 0x400228]                        | nop   | , ;===(op)      |                                         |
| [main:18, 0x40022c]                        | lbu   | r13,0(r12)      |                                         |
| [main:18, 0x400220]<br>[main:18, 0x400230] | addiu | r14,r12,1       | (18)                                    |
|                                            | sb    |                 |                                         |
| [main:18, 0x400234]                        |       | r13,31(sp)      |                                         |
| [main:18, 0x400238]                        | b     | 0x400250        |                                         |
| [main:18, 0x40023c]                        | sw    | r14,-32548(gp)  |                                         |
| [main:18, 0x400240]                        | jal   | _filbuf         |                                         |
| [main:18, 0x400244]                        | addiu | r4,gp,-32552    |                                         |
| [main:18, 0x400248]                        | move  | r16,r2          |                                         |
| [main:18, 0x40024c]                        | sb    | r16,31(sp)      |                                         |
| [main:18, 0x400250]                        | lbu   | r15,31(sp)      |                                         |
| [main:18, 0x400254]                        | li    | r1,-1           |                                         |
| [main:18, 0x400258]                        | bne   | r15,r1,0x400204 |                                         |
| [main:18. 0x40025c]                        | non   |                 |                                         |
| [main:19, 0x400260]                        | b     | 0x400268        |                                         |
| [main:19, 0x400264]                        | nop   | 0.100200        |                                         |
| [main:19, 0x400264]<br>[main:19, 0x400268] | lw    | r31,20(sp)      |                                         |
|                                            |       |                 | (19)                                    |
| [main:19, 0x40026c]                        | lw    | r16,16(sp)      | $\mathbf{v}$                            |
|                                            | jr    | r31             |                                         |
| [main:19, 0x400270]<br>[main:19, 0x400274] | addiu | sp, sp, 32      | 1                                       |

Figure 10-10. Source Listing for Line Number Example

10-14 Assembly Language Programmer's Guide

### **10.2.3 Procedure Descriptor Table**

Table 10-3 shows the format of an entry in the Procedure Descriptor table; the sym.h header file contains its declaration.

| Declaration    | Name         | Description                          |  |
|----------------|--------------|--------------------------------------|--|
|                |              |                                      |  |
| unsigned, long | adr          | memory address of start of procedure |  |
| long           | isym         | start of local symbols               |  |
| long           | iline        | procedure's line numbers             |  |
| long           | regmask      | saved register mask                  |  |
| long           | regoffset    | saved register offset                |  |
| long           | iopt         | procedure's optimization symbol      |  |
|                |              | entries                              |  |
| long           | fregmask     | save floating point register mask    |  |
| long           | fregoffset   | save floating point register offset  |  |
| long           | frameoffset  | frame size                           |  |
| long           | framereg     | frame pointer register               |  |
| long           | pcreg        | index or reg of return program       |  |
|                |              | counter                              |  |
| long           | lnLow        | lowest line in the procedure         |  |
| long           | lnHigh       | highest line in the procedure        |  |
| long           | cbLineOffset | byte offset for this procedure from  |  |
| 0              |              | the base of the file descriptor      |  |
|                |              | entry.                               |  |

Table 10-3. Format of a Procedure Descriptor Table Entry

### 10.2.4 Local Symbols

Table 10-4 shows the format of an entry in the Local Symbols table; the sym.h header file contains its declaration.

| Declaration | Name         | Description                             |
|-------------|--------------|-----------------------------------------|
| long        | iss          | index into local strings of symbol name |
| long        | value        | value of symbol. See Table 10-5.        |
| unsigned    | st : 6       | symbol type. See Table 10–6.            |
| unsigned    | sc : 5       | storage class. See Table 10-7.          |
| unsigned    | reserved : 1 |                                         |
| unsigned    | index : 20   | index into local or auxiliary symbols   |
| -           |              | See Table 3-5.                          |

Table 10-4. Format of a Local Symbols Entry

The meanings of the fields in a local symbol entry are explained in the following paragraphs.

iss. The iss (for index into string space) is an offset from the issBase field of an entry in the file descriptor table, to the name of the symbol.

value. An integer representing an address, size, offset from a frame pointer. The value is determined by the symbol type, as illustrated in Table 10-5.

st and sc. The symbol type (st) defines the symbol; the storage class (sc), where applicable explains how to access the symbol type in memory. The valid st and sc constants are given in Tables 10-6 and 10-7. These constants are defined in symconst.h.

index. The index is an offset into either Local Symbols or Auxiliary Symbols, depending of the storage type (st) as shown in Table 10-5. The compiler uses *isymBase* in the file descriptor entry as the base for a Local Symbol entry and *iauxBase* for an Auxiliary Symbols entry.

| Symbol Type  | Storage Class       | Index    | Value                     |
|--------------|---------------------|----------|---------------------------|
| etEile       |                     |          |                           |
| stFile       | scText              | isymMac  | address                   |
| stLabel      | scText <sub>1</sub> | indexNil | address                   |
| stGlobal     | scD/B <sup>1</sup>  | iaux     | address                   |
| stStatic     | scD/B <sup>1</sup>  | iaux     | address                   |
| stParam      | scAbs               | iaux     | frame offset <sup>1</sup> |
|              | scRegister          | iaux     | register number           |
|              | scVar               | iaux     | frame offset <sup>2</sup> |
|              | scVarRegister       | iaux     | register number           |
| stLocal      | scAbs               | iaux     | frame offset <sup>2</sup> |
|              | scRegister          | iaux     | register number           |
| stProc       | scText              | iaux     | address                   |
|              | scNil               | iaux     | address                   |
|              | scUndefined         | iaux     | address                   |
| stStaticProc | scText              | iaux     | address                   |
| stMember     |                     |          |                           |
| enumeration  | scInfo              | indexNil | ordinal 3                 |
| structure    | scInfo              | iaux     | bit offset                |
| union        | scInfo              | iaux     | bit offset                |

 $^{1}scD/B$  is the storage class determined by the assembler, either large/small or data/bss.

 $^2$ frame offset is the offset from the virtual frame pointer.

 $^{3}\mbox{bit offset is computed from the beginning of the procedure.}$ 

# Table 10-5 (Part 1 of 2).Index and Value as a Function of SymbolType and Storage Class

| Symbol Type  | Storage Class                                                                                                   | Index                                                            | Value                         |
|--------------|-----------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------|-------------------------------|
| at Dia alt   | na mangang kangkan pangkan kana kangkan | ge di nan saya nga gening na na na haring nga Panana ng pagaring |                               |
| stBlock      | T C.                                                                                                            | 1                                                                |                               |
| enumeration  | scInfo                                                                                                          | isymMac                                                          | max enumeration               |
| structure    | scInfo                                                                                                          | isymMac                                                          | size                          |
| text bock    | scText                                                                                                          | isymMac                                                          | relative address <sup>2</sup> |
| common block | scCommon                                                                                                        | isymMac                                                          | size                          |
| variant      | scVariant                                                                                                       | isymMac                                                          | isymTag <sup>3</sup>          |
| variant arm  | scInfo                                                                                                          | isymMac                                                          | iauxRanges <sup>4</sup>       |
| union        | scInfo                                                                                                          | isymMac                                                          | size                          |
| stEnd        |                                                                                                                 | _                                                                |                               |
| enumeration  | scInfo                                                                                                          | isymStart <sup>5</sup>                                           | 0 2                           |
| file         | scText                                                                                                          | isymStart                                                        | relative address              |
| procedure    | scText                                                                                                          | isymStart                                                        | relative address <sup>2</sup> |
| structure    | scInfo                                                                                                          | isymStart                                                        | 0                             |
| text block   | scText                                                                                                          | isymStart                                                        | relative $address^2$          |
| union        | scInfo                                                                                                          | isymStart                                                        | 0                             |
| common block | scCommon                                                                                                        | isymStart                                                        | 0                             |
| variant      | scVariant                                                                                                       | isymStart                                                        | 0                             |
| variant arm  | scInfo                                                                                                          | isymStart                                                        | 0                             |
| stTypedef    | scInfo                                                                                                          | iaux                                                             | 0                             |

<sup>1</sup>isymMac is the isym of the corresponding stEnd symbol plus 1.

 $^2\mbox{relative}$  address is the relative displacement from the beginning of the procedure.

 $^{3}$ isymTab is the isym to the symbol that is the tag for the variant.

<sup>4</sup>iauxRanges is the iaux to ranges for the variant arm.

<sup>5</sup>isymStart is the isym of the corresponding begin block (stBlock, stFile, stProc, etc.)

 Table 10-5 (Part 2 of 2).
 Index and Value as a Function of Symbol

 Type and Storage Class

The link editor ignores all symbols except the types stProc, stStatic, stLabel, stStaticProc, which it will relocate. Other symbols are used only by the debugger, and need be entered in the table only when the compiler debugger option is ON.

Symbol Type (st). Table 10-6 gives the allowable constants that can be specified in the st field of Local Symbols entries; the symconst.h header file contains the declaration for the constants.

| Constant     | Value | Description                    |
|--------------|-------|--------------------------------|
| ·····        |       |                                |
| stNil        | 0     | Dummy entry                    |
| stGlobal     | 1     | external symbol                |
| stStatic     | 2     | static                         |
| stParam      | 3     | procedure argument             |
| stLocal      | 4     | local variable                 |
| stLabel      | 5     | label                          |
| stProc       | 6     | Procedure                      |
| stBlock      | 7     | start of block                 |
| stEnd        | 8     | end block, file, or procedures |
| stMember     | 9     | member of structure, union, or |
|              |       | enumeration.                   |
| stTypedef    | 10    | type definition                |
| stFile       | 11    | file name                      |
| stStaticProc | 14    | load time only static procs    |
| stConstant   | 15    | const                          |

Table 10-6. Symbol Type (st) Constants Supported by the Compiler

Storage Class (st) Constants. Table 10-7 gives the allowable constants that can be specified in the sc field of Local Symbols entries; the symconst.h header file contains the declaration for the constants.

| Constant      | Value | Description                                |
|---------------|-------|--------------------------------------------|
| scNil         | 0     | dummy entry.                               |
| scText        | 1     | text symbol                                |
| scData        | 2     | initialized data symbol                    |
| scBss         | 3     | un-initialized data symbol                 |
| scRegister    | 4     | value of symbol is register number         |
| scAbs         | 5     | symbol value is absolute; not to be        |
|               |       | relocated.                                 |
| scUndefined   | 6     | Used but undefined in the current module   |
| reserved      | 7     |                                            |
| scBits        | 8     | this is a bit field                        |
| scDbx         | 9     | dbx internal use                           |
| scRegImage    | 10    | register value saved on stack              |
| scInfo        | 11    | symbol contains debugger information       |
| scUserStruct  | 12    | address in struct user for current process |
| scSData       | 13    | (load time only) small data                |
| scSBss        | 14    | (load time only) small common              |
| scRData       | 15    | (load time only) read only data            |
| scVar         | 16    | Var parameter (Fortran or Pascal)          |
| scCommon      | 17    | common variable                            |
| scSCommon     | 18    | small common                               |
| scVarRegister | 19    | var parameter in a register                |
| scVariant     | 20    | variant records                            |
| scUndefined   | 21    | small undefined                            |

Table 10-7. Storage Class Constants Supported by the Compiler

### 10.2.5 Optimization Symbols

Reserved for future use.

### **10.2.6 Auxiliary Symbols**

| Name   | Description                                           |
|--------|-------------------------------------------------------|
|        |                                                       |
| ti     | type information record                               |
| rndx   | relative index into local symbols                     |
| dnLow  | low dimension                                         |
| dnHigh | high dimension                                        |
| isym   | index into local symbols for stEnd                    |
| iss    | index into local strings (not used)                   |
| width  | width of a structure field not                        |
|        | declared with the default value                       |
|        | for size.                                             |
| count  | count of ranges for variant arm                       |
|        | ti<br>rndx<br>dnLow<br>dnHigh<br>isym<br>iss<br>width |

Table 10-8 shows the format of an entry, which is a union, in Auxiliary Symbols; the sym.h file contains its declaration.

Table 10-8. Storage Class Constants Supported by the Compiler

All of the fields except the ti field are explained in the order they appear in the above layout. The ti field is explained last.

**rndx.** Relative File Index. The front-end fills this field in describing structures, enumerations, and other complex types. The relative file index is a pair of indexes. One index is an offset from the start of the File Descriptor table to one of its entries. The second is an offset from the file descriptor entry to an entry in the Local Symbols or Auxiliary Symbols table.

dnLow. Low Dimension of Array.

dnHigh. High Dimension of Array.

isym. Index into Local Symbols. This index is always an offset to an stEnd entry denoting the end of a procedure.

#### width. Width of Structured Fields.

count. Range Count. Used in describing case variants. Gives how many elements are separated by commas in a case variant.

ti. Type Information Record. Table 10-9 shows the format of a ti entry; the sym.h file contains its declaration.

| Declaration | Name          | Description                         |  |
|-------------|---------------|-------------------------------------|--|
| unsigned    | fBitfield : 1 | setif bit width is specified        |  |
| unsigned    | continued : 1 | next auxiliary entry has<br>tq info |  |
| unsigned    | bt : 6        | basic type                          |  |
| unsigned    | tq4 : 4       | Type qualifier.                     |  |
| unsigned    | tq5 : 4       |                                     |  |
| unsigned    | tq0 : 4       |                                     |  |
| unsigned    | tq1 : 4       |                                     |  |
| unsigned    | tq2 : 4       |                                     |  |
| unsigned    | tq3 : 4       |                                     |  |

Table 10-9. Format of a ti Type Information Record Entry

All groups of auxiliary entries have a type information record with the following entries:

- *fbitfield*. Set if the basic type (bt) is of non-strandard width.
- *bt* (for basic type) specifies if the symbol is integer, real complex, numbers, a structure, etc. The valid entries for this field are shown in Table 10-10; the sym.h file contains its declaration.
- tq (for type qualifier) defines whether the basic type (bt) has an array of, function returning, or pointer to qualifier. The valid entries for this field are shown in Table 10-11; the sym.h file contains its declaration.

|                |       | Default |                           |
|----------------|-------|---------|---------------------------|
| Constant       | Value | Size*   | Description               |
|                |       |         |                           |
| btNil          | 0     | 0       | undefined, void           |
| btAdr          | 1     | 32      | address – same size as    |
| pointer        |       |         |                           |
| btChar         | 2     | 8       | symbol character          |
| btUChar        | 3     | 8       | unsigned character        |
| btShort        | 4     | 16      | short (16 bits)           |
| btUShort       | 5     | 16      | unsigned short            |
| btInt          | 6     | 32      | integer                   |
| btUInt         | 7     | 32      | unsigned integer          |
| btLong         | 8     | 32      | long (32 bits)            |
| btULong        | 9     | 32      | unsigned long             |
| btFloat        | 10    | 32      | floating point (real)     |
| btDouble       | 11    | 64      | double-precision floating |
| point real     |       |         |                           |
| btStruct       | 12    | n/a     | structure (Record)        |
| btUnion        | 13    | n/a     | union (variant)           |
| btEnum         | 14    | 32      | enumerated                |
| btTypedef      | 15    | n/a     | defined via a typedef;    |
|                |       |         | rndx points at a          |
|                |       |         | stTypedef symbol.         |
| btRange        | 16    | 32      | subrange of integer       |
| btSet          | 17    | 32      | pascal sets               |
| btComplex      | 18    | 64      | fortran complex           |
| btDComplex     | 19    | 128     | fortran double complex    |
| btIndirect     | 20    |         | Indirect definition;rndx  |
|                |       |         | points to TIR aux         |
| btMax          | 64    |         | -                         |
| *Size in bits. |       |         |                           |

Table 10-10. Basic Type (bt) Constants

i

| Constant | Value | Description                 |                                               |
|----------|-------|-----------------------------|-----------------------------------------------|
| tqNil    | 0     | Place holder. No qualifier. | 4 <u>400000000000000000000000000000000000</u> |
| tqPtr    | 1     | pointer to                  |                                               |
| tqProc   | 2     | function returning          |                                               |
| tqArray  | 3     | array of                    |                                               |
| tqVol    | 5     | volatile                    |                                               |
| tqMax    | 8     |                             |                                               |

Table 10-11. Type Qualifier (tq) Constants

### 10.2.7 File Descriptor Table

Table 10-12 shows the format of an entry in the File Descriptor table; the sym.h file contains its declaration.

| Declaration   | Name           | Description                          |  |
|---------------|----------------|--------------------------------------|--|
| <u> </u>      |                |                                      |  |
| unsigned,long | adr            | memory address of start of file      |  |
| long          | rss            | source file name                     |  |
| long          | issBase        | start of local strings               |  |
| long          | cbSs           | number of bytes in local strings     |  |
| long          | isymBase       | start of local symbol entries        |  |
| long          | csym           | count of local symbol entries        |  |
| long          | ilineBase      | start of line number entries         |  |
| long          | cline          | count of line number entries         |  |
| long          | ioptBase       | start of optimization symbol entries |  |
| long          | copt           | count of optimization symbol entries |  |
| short         | ipdFirst       | start of procedure descriptor table  |  |
| short         | cpd            | count of procedures descriptors      |  |
| long          | iauxBase       | start of auxiliary symbol entries    |  |
| long          | caux           | count of auxiliary symbol entries    |  |
| long          | rfdBase        | index into relative file descriptors |  |
| long          | crfd           | relative file descriptor count       |  |
| unsigned      | lang : 5       | language for this file               |  |
| unsigned      | fMerge : 1     | whether this file can be merged      |  |
| unsigned      | fReadin : 1    | true if it was read in (not          |  |
|               |                | just created)                        |  |
| unsigned      | fBigendian : 1 | if set, was compiled on big endian   |  |
|               |                | machine aux's is in compile          |  |
|               |                | host's sex                           |  |
| unsigned      | reserved : 22  | reserved for future use              |  |
| long          | cbLineOffset   | byte offset from header or file ln's |  |
| long          | cbLine         |                                      |  |

Table 10-12. Format of File Descriptor Entry

#### 10.2.8 External Symbols

The External Symbols table has the same format as Local Symbols, except an offset (ifd) field into the File Descriptor table has been added. This field is used to locate information associated with the symbol in an Auxiliary Symbols table. Table 10-13 shows the format of an entry in External Symbols; the sym.h file contains its declaration.

| Declaration    | Name            | Description                                              |  |
|----------------|-----------------|----------------------------------------------------------|--|
| short<br>short | reserved<br>ifd | reserved for future use pointer to file descriptor entry |  |
| SYMR           | asym            | Same as Local Symbols                                    |  |

Table 10-13. Format of an Entry in External Symbols

**IRIS-4D Series** 

# **Appendix A: Instruction Summaries**

In the tables in this appendix, the operand terms have the following meanings:

| Operand     | Description                     |
|-------------|---------------------------------|
| destination | destination register            |
| destination | destination register            |
| address     | expression                      |
| source      | source register                 |
| expression  | aboslute value                  |
| immediate   | immediate value                 |
| label       | symbol label                    |
| breakcode   | value that determines the break |

Table A-1. Operand Meanings

| Description                      | Op-code | Operand              |
|----------------------------------|---------|----------------------|
| Load Address                     | la      | destination, address |
| Load Byte                        | lb      |                      |
| Load Byte Unsigned               | lbu     |                      |
| Load Halfword                    | lh      |                      |
| Load Halfword Unsigned           | lhu     |                      |
| Load Word                        | lw      |                      |
| Load Coprocessor z               | lwcz    |                      |
| Load Word Left                   | lwl     |                      |
| Load Word Right                  | lwr     |                      |
| Store Byte                       | sb      | source, address      |
| Store Halfword                   | sh      |                      |
| Store Word                       | sw      |                      |
| Store Word Coprocessor z         | swcz    |                      |
| Store Word Left                  | swl     |                      |
| Store Word Right                 | swr     |                      |
| Unaligned Load Halfword          | ulh     |                      |
| Unaligned Load Halfword Unsigned | ulhu    |                      |
| Unaligned Load Word              | ulw     |                      |
| Unaligned Store Halfword         | ush     |                      |
| Unaligned Store Word             | usw     |                      |

Table A-2. Main Processor Instruction Summary

A-2 Assembly Language Programmer's Guide

| Description                          | Op-code | Operand                      |
|--------------------------------------|---------|------------------------------|
|                                      |         |                              |
| Load Immediate                       | 1i      | destination, expression      |
| Load Upper Immediate                 | lui     |                              |
| Restore From Exception               | rfe     |                              |
| Syscall                              | syscall |                              |
| Absolute Value                       | abs     | destination, src1            |
| Negate (with overflow)               | neg     | destination/src1             |
| Negate (without overflow)            | negu    |                              |
| NOT                                  | not     |                              |
| Add (with overflow)                  | add     | destination, src1, src2      |
| Add (without overflow)               | addu    | destination/src1,src2        |
| AND                                  | and     | destination, src1, immediate |
| Divide (with overflow)               | div     | destination/src1, immediate  |
| Divide (without overflow)            | divu    |                              |
| EXCLUSIVE OR                         | xor     |                              |
| Multiply                             | mul     |                              |
| Multiply (with overflow)             | mulo    |                              |
| Multiply (with overflow)<br>Unsigned | mulou   |                              |
| NOT OR                               | nor     |                              |
| OR                                   | or      |                              |
| Remainder                            | rem     |                              |
| Remainder Unsigned                   | remu    |                              |
| Rotate Left                          | rol     |                              |
| Rotate Right                         | ror     |                              |
| Set Equal                            | seq     |                              |
| SEt Less Than                        | slt     |                              |
| Set Less Than Unsigned               | sltu    |                              |
| Set Less/Equal                       | sle     |                              |
| Set Less/Equal Unsigned              | sleu    |                              |
| Set Greater Than                     | sgt     |                              |
| Set Greater Than Unsigned            | sgut    |                              |
| Set Greater/Equal                    | sge     |                              |
| Set Greater/Equal Unsigned           | sgeu    |                              |
| Set Not Equal                        | sne     |                              |
| Shift Left Logical                   | sll     |                              |
| Shift Right Arithmetic               | sra     |                              |
| Shift Right Logical                  | srl     |                              |
| Subtract (with overflow)             | sub     |                              |
| Subtract (without overflow)          | subu    |                              |
| Multiply                             | mult    | src1, src2                   |
| Multiply Unsigned                    | multu   |                              |

Table A-2. Main Processor Instruction Summary (continued)

| Description                        | Op-code | Operand                |
|------------------------------------|---------|------------------------|
| Branch                             | b       | label                  |
| Branch Coprocessor z True          | bczt    |                        |
| Branch Coprocessor z False         | bczf    |                        |
| Branch on Equal                    | beq     | src1, src2, label      |
| Branch on Greater                  | bgt     | src1, immediate, label |
| Branch on Greater/Equal            | bge     |                        |
| Branch on Greater/Equal Unsigned   | bgeu    |                        |
| Branch on Greater Than Unsigned    | bgtu    |                        |
| Branch on Less                     | blt     |                        |
| Branch on Less/Equal               | ble     |                        |
| Branch on Less/Equal Unsigned      | bleu    |                        |
| Branch on Less Than Unsigned       | bltu    | 4                      |
| Branch on Not Equal                | bne     |                        |
| Branch and Link                    | bal     | label                  |
| Branch on Less Than Zero and Link  | bltzal  |                        |
| Branch on Greater or Equal to zero | bgezal  | , f                    |
| and Link                           |         |                        |
| Branch on Equal Zero               | beqz    | src1, label            |
| Branch on Greater/Equal Zero       | bgez    | 1                      |
| Branch on Greater Than Zero        | bgtz    |                        |
| Branch on Less/Equal Zero          | blez    |                        |
| Branch on Less Than Zero           | bltz    |                        |
| Branch on Not Equal Zero           | bnqz    |                        |
| Jump                               | j       | address                |
| Jump and Link                      | jal     | src1                   |
| Decel                              |         |                        |
| Break                              | break   | breakcode              |
| Coprocessor z Operation            | cz      | expression             |
| Move                               | move    | destination, src1      |
| Move From HI Register              | mfhi    | register               |
| Move To HI Register                | mthi    |                        |
| Move From LO Register              | mflo    |                        |
| Move To LO Register                | mtlo    |                        |
| Move From Coprocessor z            | mfcz    | dest-gpr, source       |
| Move To Coprocessor z              | mtcz    | src-gpr, destination   |
| Control From Coprocessor z         | cfcz    | src-gpr, destination   |
| Control to Coprocessor z           | ctcz    | dest-gpr, source       |
| Control to Coprocessor 2           | CICZ    | uest gpi, source       |

 Table A-2.
 Main Processor Instruction Summary (continued)

A-4 Assembly Language Programmer's Guide

IRIS-4D Series

| Description                               | Op-code Operan |  |
|-------------------------------------------|----------------|--|
| Translation Lookaside Buffer Probe        | tlbp           |  |
| Translation Lookaside Buffer Read         | tlbr           |  |
| Translation Lookaside Buffer Write Random | tlbwr          |  |
| Translation Lookaside Write Index         | tlbwi          |  |

#### Table A-3. System Coprocessor Instruction Summary

| Description       | Op-code    | Operand                    |
|-------------------|------------|----------------------------|
| Load Fp           |            |                            |
| Double            | 1. d       | destination, offset (base) |
| Single            | 1.s        |                            |
| Store FP          |            |                            |
| Double            | s.d        | source, offset (base)      |
| Single            | <b>S.S</b> |                            |
| Absolute Value Fp |            |                            |
| Double            | abs.d      | destination, src1          |
| Single            | abs.s      |                            |
| Add Fp            |            |                            |
| Double            | add.d      | destination, src1, src2    |
| Single            | add.s      |                            |
| Divide Fp         |            |                            |
| Double            | div.d      |                            |
| Single            | div.s      |                            |
| Multiply          |            |                            |
| Double            | mul.d      |                            |
| Single            | mul.s      |                            |
| Subtract Fp       |            |                            |
| Double            | sub.d      |                            |
| Single            | sub.s      |                            |

#### Table A-4. Floating Point Instruction Summary

| Description            | Op-code | Operand           |
|------------------------|---------|-------------------|
| Convert Source to      |         |                   |
| Specified Precision Fp |         |                   |
| Double to Single       | cvt.s.d | destination, src2 |
| Fixed Point to Single  | cvt.s.w |                   |
| Fixed Point to Double  | cvt.d.w |                   |
| Single to Double       | cvt.d.s |                   |
| Double to Fixed Point  | cvt.w.d |                   |
| Single to Fixed Point  | cvt.w.s |                   |
| Negate Floating Point  |         |                   |
| Double                 | neg.d   |                   |
| Single                 | neg.s   |                   |

 Table A-4.
 Floating Point Instruction Summary (continued)

| Description | Op-code  | Operand   |
|-------------|----------|-----------|
| Compare Fp  |          |           |
| F Single    | c.f.s    | src1,src2 |
| F Double    | c.f.d    |           |
| UN Single   | c.un.s   |           |
| UN Double   | c.un.d   |           |
| *EQ Single  | c.eq.s   |           |
| *EQ Double  | c.eq.d   |           |
| UEQ Single  | c.ueq.s  |           |
| UEQ Double  | c.ueq.d  |           |
| OLT Single  | c.olt.s  |           |
| OLT Double  | c.olt.d  |           |
| ULT Single  | c.ult.s  |           |
| ULT Double  | c.ult.d  |           |
| OLE Single  | c.ole.s  |           |
| OLE Double  | c.ole.d  |           |
| ULE Single  | c.ule.s  |           |
| ULE Double  | c.ule.d  |           |
| SF Single   | c.sf.s   |           |
| SF Double   | c.sf.d   |           |
| NGLE Single | c.ngle.s |           |
| NGLE Double | c.ngle.d |           |
| SEQ Single  | c.deq.s  |           |
| SEQ Double  | c.seq.d  |           |
| NGL Single  | c.ngl.s  |           |
| NGL Double  | c.ngl.d  |           |

 Table A-4.
 Floating Point Instruction Summary (continued)

**NOTE:** Starred items (\*) are the most common Compare instructions. The machine has the other Compare instructions for IEEE compatibility.

| Description | Op-code | Operand           |  |
|-------------|---------|-------------------|--|
| Compare Fp  |         |                   |  |
| *LT Single  | c.lt.s  | src1, src2        |  |
| *LT Double  | c.lt.d  |                   |  |
| NGE Single  | c.nge.s |                   |  |
| NGE Double  | c.nge.d |                   |  |
| *LE Single  | c.le.s  |                   |  |
| *LE Double  | c.le.d  |                   |  |
| NGT Single  | c.ngt.s |                   |  |
| NGT Double  | c.ngt.d |                   |  |
| Move Fp     |         |                   |  |
| Single      | mov.s   | destination, src1 |  |
| Double      | mov.d   |                   |  |

Table A-4. Floating Point Instruction Summary (continued)

**NOTE:** Starred items (\*) are the most common **Compare** instructions. The machine has the other **Compare** instructions for IEEE compatibility.

A-8 Assembly Language Programmer's Guide

# **Appendix B:** Basic Machine Definition

The assembly language instructions described in this book are a superset of the actual machine instructions. Generally, the assembly language instructions match the machine instructions; however, in some cases the assembly language instruction are macros that generate more than one machine instruction (the assembly language multiplication instructions are examples).

You can, in most instances, consider the assembly instructions as machine instructions; however, for routines that require tight coding for performance reasons, you must be aware of the assembly instructions that generate more than one machine language instruction, as described in this appendix.

### **B.1 Load and Store Instructions**

If you use an *address* as an operand in an assembler Load or Store instruction and the address references a data item that is not addressable through register gp or the data item does not have an absolute address in the range -32768...32767, the assembler instruction generates a lui (load upper immediate) machine instruction and generates the appropriate offset to at. The assembler then uses at as the index address for the reference. This condition occurs when the address has a relocatable external name offset (or index) from where the offset began.

The assembler's **la** (load address) instruction generates an **addiu** (add unsigned immediate) machine instruction. If the address requires it, the **la** instruction also generates a **lui** (load upper immediate) machine instruction. The machine requires the **la** instruction because **la** couples relocatable information with the instruction for symbolic addresses. Depending on the expression's value, the assembler's li (load immediate) instruction can generate one or two machine instructions. For values in the -32768...65535 range or for values that have zeros as the 16 least significant bits, the li instruction generates a single machine instruction;, otherwise it generates two machine instructions.

### **B.2** Computational Instructions

If a computational instruction immediate value falls outside the 0...65535 range for Logical ANDs, Logical ORs, or Logical XORs (exclusive or), the immediate field causes the machine to explicitly load a constant to a temporary register. Other instructions generate a single machine instruction when a value falls in the -32768...32767 range.

The assembler's seq (set equal) and sne (set not equal) instructions generate three machine instructions each.

If one operand is a literal outside the range -32768...32767, the assembler's sge (set greater than or equal to) and sle (set less/equal) instructions generate two machine instructions each.

The assembler's **mulo** and **mulou** (multiply) instructions generate machine instructions to test for overflow and to move the result to a general register; if the destination register is **\$0**, the check and move are not generated.

The assembler's **mul** (multiply unsigned) instruction generates a machine instruction to move the result to a general register; if the destination register is \$0, the move and divide-by-zero checking is not generated. The assembler's divide instructions, div (divide with overflow) and divu (divide without overflow), generate machine instructions to check for division by zero and to move the quotient into a general register; if the destination register is \$0, the move is not generated.

The assembler's **rem** (signed) and **remu** (unsigned) instructions also generate multiple instructions.

The rotate instructions **ror** (rotate right) and **rol** (rotate left) generate three machine instructions each.

The abs (absolute value) instruction generates three machine instructions.

B-2 Assembly Language Programmer's Guide

**IRIS-4D Series** 

## **B.3 Branch Instructions**

If the immediate value is not zero, the branch instructions **beq** (branch on equal) and **bne** (branch on not equal), each generate a load literal machine instruction. The relational instructions generate a **slt** (set less than) machine instruction to determine whether one register is less than or greater than another. Relational instructions can reorder the operands and branch on either zero or not zero as required to do an operation.

# **B.4 Coprocessor Instructions**

For symbolic addresses, the coprocessor interface Load and Store instructions, lcz (load coprocessor z) and scz (store coprocessor z) can generate a lui (load upper immediate) machine instruction.

### **B.5 Special Instructions**

The assembler's **break** instruction packs the **breakcode** operand in unused register fields. An operating system convention determines the position.





# Silicon Graphics, Inc.

#### COMMENTS

| Date       | Manual title and version                                                         |
|------------|----------------------------------------------------------------------------------|
| Your name  |                                                                                  |
| Title      | Please list any errors, inaccuracies, or omissions you have found in this manual |
| Department |                                                                                  |
| Company    |                                                                                  |
| Address    |                                                                                  |
|            |                                                                                  |
|            |                                                                                  |
| Phone      | Please list any suggestions you may have for improving this manual               |
|            |                                                                                  |



FIRST CLASS

MOUNTAIN VIEW, CA

NO POSTAGE NECESSARY IF MAILED IN THE UNITED STATES

POSTAGE WILL BE PAID BY ADDRESSEE

Silicon Graphics, Inc. Attention: Technical Publications 2011 Stierlin Road Mountain View, CA 94043-1321