August 1978

The Design of the Parallel Arithmetic Unit in PEPE

Mark Camillo Divecchio

University of Pennsylvania

Follow this and additional works at: http://repository.upenn.edu/cis_reports

Recommended Citation
Mark Camillo Divecchio, "The Design of the Parallel Arithmetic Unit in PEPE", August 1978.


This paper is posted at ScholarlyCommons. http://repository.upenn.edu/cis_reports/721
For more information, please contact libraryrepository@pobox.upenn.edu.
The Design of the Parallel Arithmetic Unit in PEPE

Abstract
As the need for solutions to high speed processing problems increases, new and novel architectures will be developed. PEPE is an attempt to solve a specific real-time problem with use of special purpose hardware.

The Ballistic Missile Defense Advanced Technology Center in Huntsville, Alabama started development studies in the late 1960s for a processor to offload the ever increasing demands on a serial processor operating in a ballistic missile defense (BMD) environment.

PEPE’s design specifically addresses the BMD problem. It was designed to perform three basic functions previously allocated to a CDC 7700 system.

They are:

Correlation of new radar returns with current tracks

Track prediction on all current tracks

Scheduling on a radar time/power line, pulses to acquire more data on current or new tracks.

Comments
UNIVERSITY OF PENNSYLVANIA

MOORE SCHOOL

THE DESIGN OF THE
PARALLEL ARITHMETIC UNIT
IN PEPE

MARK CAMILLO DIVECCHIO

Presented to the Faculty of the College of Engineering and Applied Science (Department of Computer and Information Sciences) in partial fulfillment of the requirements for the degree of Master of Science in Engineering.

Philadelphia, Pennsylvania
August 1978

Dr. John W. Carr III

Dr. A. K. Joshi
1.0 The Parallel Element Processing Ensemble (PEPE)

1.1 Overview

As the need for solutions to high speed processing problems increases, new and novel architectures will be developed. PEPE is an attempt to solve a specific real-time problem with use of special purpose hardware.

The Ballistic Missile Defense Advanced Technology Center in Huntsville, Alabama started development studies in the late 1960s for a processor to offload the ever increasing demands on a serial processor operating in a ballistic missile defense (BMD) environment.

PEPE's design specifically addresses the BMD problem. It was designed to perform three basic functions previously allocated to a CDC 7700 system. They are:

- Correlation of new radar returns with current tracks
- Track prediction on all current tracks
- Scheduling on a radar time/power line, pulses to acquire more data on current or new tracks.

To solve these three problems, a unique multiprocessor parallel ensemble was proposed. Shown in Figure 1.1, PEPE consists of three independent control processors each commanding an ensemble of up to 288 Processing Elements (PE). The three control processors, housed in a single cabinet called a Control Console (CC), are identical with minor exceptions. Each contains:

- Sequential Control Logic (SCL)
- Program and Data Memories
- I/O Units
- Parallel Instruction Control Unit (PICU)
Figure 1.1 PEPE Block Diagram
The SCL is a stand alone processor with all facilities for program execution from memory. It performs typical arithmetic, logical, comparison, and branching functions of a sequential machine. The I/O units perform high speed input and output to the CDC 7700 using a direct channel to both program and data memories.

The PICU receives parallel instructions from the SCL. As the SCL fetches an instruction from memory, it determines if that instruction is targeted for execution in the SCL or in the parallel ensemble. Instructions for the ensemble are sent to the PICU. Described more fully in Section 4.2, the PICU decodes each parallel instruction and transmits control information to all 288 PE.

The three control units are named for their major function. The Correlation Control Unit (CCU) controls the Correlation Unit (CU) portion of the PE. Second, the Arithmetic Control Unit (ACU) handles the Arithmetic Unit (AU) and lastly, the Associative Output Control Unit (AOCU) controls the Associative Output Unit (AOU).

Other major units in the CC are the Intercommunication Logic (ICL), the Output Data Control (ODC) and the Element Memory Control (EMC). The ICL handles communication between the three SCL's and handles interrupt vectoring and masking. The ODC controls access to the main data bus used for outputting data from the ensemble. This bus is a single path shared between all AU and AOU of the ensemble. EMC controls the granting of cycles of the Element Memory (EM) which is shared between processors of the PE.
Figure 1.2 shows the floor layout of a PEPE installation. The CC is centrally located since it must communicate with each of the 8 Element Bays (EB). A Burroughs B1714 computer is the Test and Maintenance (T&M) unit for PEPE. Figure 1.3 is a picture of PEPE installed in Huntsville, Alabama.

1.1.1 The Element Bay

Each EB contains 36 PE as shown in Figure 1.4. The PE are laid out in 4 rows of 9 PE. Each row also contains a clock distribution card. Figure 1.5 is a close-up picture of the Element Bay with 11 PE.

One PE consists of 6 large 300 DIP boards, allocated as follows:

- AU 2 boards
- EM 1 board
- CU 1 board
- AOU 2 boards

The Bay Signal Distributor (BSD) occupies the left end of the cabinet. The BSD provides connectors and gates to relay control and data signals from the CC to each PE and to gather output from the PE on the Output Data Bus (ODB) and relay that back to the CC. A simplified layout is shown symbolically in Figure 1.6.
Figure 1.5 PE Bay with 11 PE Installed
PE BAY-PARALLEL ORGANIZATION
(SYMBOLIC)

Figure 1.6 Busing Diagram
1.2 Number Formats

Crucial to the understanding of the operation of a complex floating point processor such as the AU, is a detailed knowledge of the number formats processed by the AU.

The basic PEPE word is 32 bits in length. Bits are numbered right to left with bit 31 being the most significant bit (MSB).

BIT # 31 0

1.2.1 Integer Format

PEPE integers are stored in the lower 24 bits of a word. The upper right bits are always zero.

00 00 00 00 S
31 24 23
B.P.

Bit 23 is the sign bit for the two's complement representation of the number in bits 22-0. The range of values for integers is

\[ +2^{23} - 1 \geq \text{integer} \geq -2^{23} \]

A sign of "1" indicates a negative number and the binary point is considered to be to the right of bit zero.

In a two's complement number system, one more negative number may be represented than positive numbers. Consider the following for a four bit system:

<table>
<thead>
<tr>
<th>BINARY</th>
<th>DECIMAL</th>
</tr>
</thead>
<tbody>
<tr>
<td>0 1 1 1</td>
<td>+7</td>
</tr>
<tr>
<td>0 1 1 0</td>
<td>+6</td>
</tr>
<tr>
<td>:</td>
<td>:</td>
</tr>
<tr>
<td>0 0 0 0</td>
<td>0</td>
</tr>
<tr>
<td>1 1 1 1</td>
<td>-1</td>
</tr>
<tr>
<td>:</td>
<td>:</td>
</tr>
<tr>
<td>1 0 0 1</td>
<td>-7</td>
</tr>
<tr>
<td>1 0 0 0</td>
<td>-8</td>
</tr>
</tbody>
</table>
Plus seven is the largest positive integer and -8 is the largest negative integer. Since this -8 value does not have a positive "equivalent" we will see later that it requires special handling during the execution of certain PEPE instructions.

1.2.2 Double Integer Format

PEPE double integers are stored in the lower 24 bits of two PEPE words. The upper 8 bits of both words are always zero. Bit 23 of the lower word is always the same as bit 0 of the upper word. Bit 23 of the upper word is the sign bit.

```
          | S |
BIT 31  24  23  0
00000000
```

The range of values for double integers is

\[ +2^{47} - 1 \geq \text{double integer} \geq -2^{47} \]

The binary point (B.P.) is to right of bit 0 of the lower word. Double integers less than \( +2^{23} - 1 \) or greater than \( -2^{23} \) are completely represented in the lower word as standard PEPE integers.

1.2.3 Logical Format

A PEPE logical word is 32 bits long with no restriction as to its contents.
1.2.4 Floating Point Format

PEPE floating point numbers are represented in one 32 bit word. The fraction (or mantissa) part is stored in bits 23-0. The binary point is to the left of bit 22 and bit 23 is the sign bit of the fraction. The exponent (base 2) is stored in bits 31-24 with bit 31 being the sign bit. Both the fraction and exponent are carried in two's complement notation.

<table>
<thead>
<tr>
<th>BIT #</th>
<th>31</th>
<th>24</th>
<th>23</th>
<th>22</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td>SE</td>
<td>SF</td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

B.P.

EXPONENT

FRACTION

SE - Sign Exponent
SF - Sign Fraction

All floating point numbers are normalized and this is maintained by all PEPE floating point instructions. The number is considered normalized if the most significant bit of the fraction (bit 22) is different from the sign of the fraction (bit 23).

The exponent can be considered as an 8 bit integer field. Range of the exponent field is:

\[ 127 \geq \text{exponent} \geq -128 \]

which can represent decimal values from (approximately):

\[ 10^{38} \geq \text{floating point} \geq 10^{-38} \]

and

\[ -10^{-38} \geq \text{floating point} \geq -10^{38}. \]

The range of the fraction is:

\[ 2^0 - 2^{-23} \geq \text{fraction} \geq -2^0 \]

which can represent decimal values from (approximately):

\[ 0.9999998+ \geq \text{fraction} \geq -1.0000000. \]
The smallest magnitude fraction is \(0.1192 \times 10^{-6}\). The fraction is always less than magnitude one except for one special case of \(-1.000000\). This results from the same effect as discussed under integer format. PEPE floating point format allows seven significant decimal digits in the fraction.

1.2.4.1 Floating Point Zero

In any floating point system, a mantissa of zero with any exponent has a value of zero. Thus:

\[0.0 \times 2^{127} = 0.0 \times 2^0 = 0.0 \times 2^{-128}\]

PEPE hardware must, though, be able to determine equality of numbers that may both be zero. For the hardware to know that \(0.0 \times 2^0 = 0 \times 2^{10}\), poses difficulties that can be avoided. In PEPE, we define one floating point zero. It is:

\[0.0 \times 2^{-128}\]

In a 32 bit word, this is a one in bit 31 and zeros in bits 30-0. All PEPE floating point instructions expect operands of zero in this form and will produce results of zero in this form. This representation can be considered the "smallest" zero in floating point format.
2.0 AU Architecture at the Register Level

The Arithmetic Unit is designed to perform floating point and integer arithmetic, boolean operations, shift operations, single-bit memory data packing, overflow detection and normalize operations. Two's complement arithmetic is used. Double precision integer add and subtract instructions can be accomplished through software.

The AU also is designed to perform element activity instructions. Activity instructions are used for selecting a set or subset of PEPE elements. The selected set is said to be active and will have its activity flip-flop set. Elements whose activity flip-flop is not set will be inactive and will not participate in selected parallel instructions until such time as they are set active.

The ACU controls each AU operation through the use of control lines from the PICU. Global operands are transmitted to the AU on 32 of these lines in a time-sharing mode. In addition, element memory data transfer to and from the AU is done on 32 bidirectional lines. All data output for global use is available on 32 output data lines which are time-shared with AU data output under global command from Output Data Control (ODC).

Each AU also provides two outputs for the Select Highest/Lowest logic. For observation of unit faults and as a programming aid, two overflow indicators are sent to the ICL.

The AU contains a fault flip-flop which can be set by the ACU, conditional on element activity. When this flip-flop is set it will not allow the AU, AU, and CU of the element to participate in any instruction other than "clear fault flip-flop", which will re-enable the failed elements.
Shown in block diagram form in Figures 2.1 and 2.2, the AU was built on two boards named AU1 and AU2. With the exception of the input receivers, output drivers, and a data alignment network, the AU can be functionally described as being composed of an arithmetic and an element activity section.

The arithmetic section contains three working registers (A, B, and Q), an adder, and data input switches. The B register is not programmer accessible. The activity section contains a tag register, a stack (shift) register, and an element activity flip-flop. Also included is the necessary logic and switching which operate in conjunction with these components.

The register level diagram of the AU shows all of the main AU data paths and control. Not shown is the details logic of functions such as the normalize decode network, zero and overflow detection, and the element activity. These items will be discussed in Section 4. The design is based on specific arithmetic algorithms described in Section 3.
Figure 2.1 AU Card 1
Figure 2.2 AU Card 2
2.1 AU Registers

The AU contains the following registers and flip-flops visible to the programmer:

A Register (PAAREG) - Implicit operand in most instructions.
32 bits in length.

Q Register (PAQREG) - Quotient register for the divide instruction.
Used as a scratch register in the multiply instruction. 32 bits in length.

Element Activity Flip-Flop (PAEACT) - A one bit register used to indicate the state of the AU. If reset, the AU will not participate in the execution of most instructions.

Activity Stack (PASTAK) - A 21 bit FILO stack used to store the PAEACT bit. An attempt to push a bit out of the bottom of stack will send an error indication to the ICL.

Tag Register (PATAGR) - An Eight bit register used to hold a tag loaded by the control unit. The tag can be used to permit or inhibit the execution of instructions in the AU. Different AU can contain different tags.

Overflow Flip-Flop (PAOVFF) - A one bit register used to indicate if an arithmetic overflow occurred within the AU. This flip-flop is sent as an error indicator to the ICL.

Fault Flip-Flop (PAFALT) - A one bit register used to completely disable the entire Processing Element. Programmatically removes PE from the Ensemble.
Other registers not visible to the programmer are:

**B Register (PABREG)** - Used to hold the explicit operand during most instructions. It holds the operand after receiving it from the control unit or the Element Memory. 32 bits in length.

**Shift Count Register (PASHCR)** - A six bit register used to hold the shift amount. Possible values are +31 to -32 with positive numbers indicating a right shift and negative numbers a left shift.

### 2.2 AU Functional Units

Major functional units of the AU are:

**Adder (PAADMN bits 23-0 and PAADEX for bits 31-24)** - A twos complement adder divided into a mantissa section and an exponent section. It is capable of both arithmetic and logical operations. Output is PAAOUT.

**Alignment Network (PAALNO)** - A 32 bit barrel shift network capable of 32 bit left and right logical shifts and 24 bit left and right arithmetic shifts. Input is PAALNI.

**Shift Count Two's Complement (PA2SCO)** - A six bit subtractor which subtracts PASHCR from zero. It is used to take a positive shift count and negate it to permit left shifts.

**Normalize Decode Network (PANRMD)** - A combinatorial network which counts the number of leading zeros or ones in a floating point fraction. Used to produce a count of 0 to 23 to left shift the fraction for normalization during floating point instructions. Shift count output is PANRMD, a 5 bit positive value.

**Search Conversion Logic (PASRCH)** - Logic used in the conversion micro-step of the Select Highest/Lowest instructions.
3.0 Instruction Set of the AU

The PEPE Arithmetic Unit can execute 71 Instructions that can be classified into six basic types:

1. Activity - those instructions which affect the Element Activity Flip-Flop or Activity Stack.

2. Integer - those instructions involved with the processing of integer data.

3. Logical/Data Transfer - those instructions involved with the processing of logical data.

4. Floating Point - those instructions involved with the processing of floating point data.

5. Output - those instructions which cause the AU to transmit data to the EM or the ACU.

6. Distributed - those instructions which operate over a set of AU's rather than within a single AU.

Appendix C lists the instruction set in several different formats and classes each into one of the six types. The following sections will describe each type in general and will describe in detail the more interesting instructions of each type.
3.1 Activity Instructions

As mentioned in Section 2.0, the Element Activity (EA) Flip-Flop controls whether its particular AU will participate in the currently broadcast parallel instruction. Maximum use of this flip-flop is made to facilitate PEPE's associative nature. The select instructions cause the EA to set or reset depending on the result of an internal comparison between two pieces of data. For example, during Select on Not Equal Global (SNG-107), a comparison is made between a global operand and the AU A Register. The EA is reset in all AU performing an equal comparison.
3.2 Integer Instructions

3.2.1 Single Integer

The AU integer instructions perform addition, subtraction and multiplication on the 24 bit integers described in Section 1.2.1.

The implicit operand is always the A Register. The explicit operand can be global or it can be fetched from the local EM. The Arithmetic Logic Unit (ALU) performs only addition. Subtraction is accomplished by complementing the subtrahend, introducing a carry into the lower order bit and then adding. Integer multiply is described in Section 3.4.3.

Instructions are: ADI
SBI
MLI

All AU integer instructions check for overflow. During addition, overflow is generated if the result sign is different than the two operand signs if they are the same. During subtraction, overflow is generated if the result sign is different than the minuend sign if the minuend sign is different from the subtrahend sign. The overflow signal is sent as an error interrupt to the ICL.

3.2.2 Double Integer

The AU can process double integer values using a sequence of instructions. First, the lower words must be fetched. The operation can then be performed and the result stored. Then the upper words are fetched, operated on and stored. A flip-flop is used to store a borrow or carry between the lower and upper words.

Instructions are: LADI
LSBI
UADI
USBI

Overflow is checked when processing upper words.
3.3 Logical/Data Transfer Instructions

This class of instructions includes both those that perform logical (AND, OR) operations and those that cause the movement of data.

3.3.1 Logical Instructions

The AU performs logical instructions through the ALU. The A Register is one operand and the B Register is loaded with the other operand. As in most instructions the B data can be global or it can be fetched from the local EM. The result is placed in the A Register.

3.3.2 Data Transfer Instructions

These instructions are classed as logical instructions because they handle data as 32 bit words without regard to their internal format. Data transfer can occur over the paths shown in Figure 3.1.
Instructions:  
LDA  STGA  
LDE  STGI  
LDQ  TAQ  
LTAG  TIDA  
SHL  TQA  
STG  

Figure 3.1 User Controlled Data Paths
3.4 Floating Point Instructions

This class of instructions make up the more interesting operations possible on PEPE. They operate on the PEPE floating point (FP) data representation described in section 1.2.4.

3.4.1 Float/Fix

The Float instruction (FLOT) takes a 24 bit PEPE integer and converts it into a floating point number. This is done by normalizing and adding an exponent.

FLOT is performed in three steps as follows:

1. Count leading "zeros" for positive numbers (leading "ones" for negative) starting at bit 22 down.
2. Shift the 24 bit integer the counted number of places left (filling with "zeros") thus normalizing it.
3. Subtract the count from 23 to form the exponent of the new FP number.

The subtraction from 23 is required because in the conversion, the binary point is moved 23 places from the right of bit 0 to the right of bit 23. All integers can be converted to FP.

The Fix (FIX) instruction is more complicated because not all FP numbers can be represented as integers. FIX takes a FP number and converts it to an integer. Consider that the PEPE FP number can be mathematically represented by:

$$\pm M + X/Y$$

where M is a whole part and X/Y is the fraction part. FIX is defined to extract the whole part. For example:

<table>
<thead>
<tr>
<th>Value</th>
<th>FIXed</th>
</tr>
</thead>
<tbody>
<tr>
<td>5.3</td>
<td>5</td>
</tr>
<tr>
<td>5.8</td>
<td>5</td>
</tr>
<tr>
<td>-5.3</td>
<td>-6</td>
</tr>
<tr>
<td>-5.8</td>
<td>-6</td>
</tr>
</tbody>
</table>
The last two examples are so because -5.3 in two's complement form is really -6 + .7.

The FIX algorithm executes as follows:
1. First check the exponent. If it is greater than 23, we cannot fix the number. An overflow indication is sent to the ICL.
2. Subtract the exponent from 23. Save this number as a shift count.
3. Shift the fraction right extending bit 23, the sign bit. Zero out the exponent.
4. If any significant bits were shifted out, add one to the new integer if it was negative.

The fourth step handles the two's complement negative numbers.

3.4.2 Add/Subtract

FP add (ADD) and subtract (SB) are identical except for the actual arithmetic operation performed.

The ADD algorithm is:
1. The two exponents are subtracted and the result is saved as a shift count.
2. The fraction with the smaller exponent is shifted in order to make the exponents equal.
3. The add or subtract is performed.
4. If the fraction sum overflows, an attempt is made to right shift the fraction and increment the exponent to correct the overflow.
5. If the exponent overflows, an error indication is sent to the ICL.

If the exponent difference is greater than 23, one of the fractions will be shifted completely out of the register. This is not an error condition but must be handled specially (because the shifter is not defined to operate with a greater than 23 arithmetic shift). The result of the
exponent subtraction is held in two flip-flops which cause the A or B input to the ALU to be zeroed during step 3.

An ADD example:

\[
\begin{align*}
0.1 \times 2^30 & \text{ becomes: } 0.1 \times 2^30 + 0.001 \times 2^30 = 0.101 \times 2^30 \\
+ 0.1 \times 2^28 & \\
\end{align*}
\]

also:

\[
\begin{align*}
0.1 \times 2^30 & \text{ becomes: } 0.1 \times 2^30 + 0.0 \times 2^30 = 0.1 \times 2^30 \\
+ 0.1 \times 2^1 & \\
\end{align*}
\]

3.4.3 Multiply

Several multiply algorithms were considered for the AU.

The simplest is a bit by bit algorithm. That is, if a bit is "one" in the multiplier, add the multiplicand to a partial sum, shift it left one place and proceed to the next bit.

For example:

\[
\begin{align*}
00101 & = 5 \\
X 00101 & = 5 \\
\hline
1 & \text{ add multiplicand, shift } 00101 \\
0 & \text{ add } 0 , \text{ shift } 00000 \\
1 & \text{ add multiplicand, shift } 00101 \\
0 & \text{ add } 0 , \text{ shift } 00000 \\
0 & \text{ add } 0 , \text{ shift } 0000011001 \\
\hline
& = 25
\end{align*}
\]

This method requires one step per bit in the multiplier. In the AU it would have required 24 steps.

A second method, as an extension of the first, is to take 2 bits of the multiplier at a time.

<table>
<thead>
<tr>
<th>Multiplier Bits</th>
<th>Action</th>
</tr>
</thead>
<tbody>
<tr>
<td>00</td>
<td>add 0</td>
</tr>
<tr>
<td>01</td>
<td>add 1 \times multiplicand, shift two</td>
</tr>
<tr>
<td>10</td>
<td>add 2 \times multiplicand, shift two</td>
</tr>
<tr>
<td>11</td>
<td>add 3 \times multiplicand, shift two</td>
</tr>
</tbody>
</table>

Repeat this for each pair of bits in the multiplier.
For example:

\[
\begin{array}{c}
00101 = 5 \\
\times \quad 01001 = 9 \\
\hline
01 \quad \text{add multiplicand}, \text{ shift two} \\
10 \quad \text{add } 2 \times \text{ multiplicand}, \text{ shift two} \\
00 \quad \text{add } 0, \text{ shift two}
\end{array}
\]

\[
\begin{array}{c}
\text{00100} \\
\text{01010} \\
\text{00000}
\end{array}
\]

\[
\begin{array}{c}
000101101 \\
= 45
\end{array}
\]

This method needs one step per two bits in the multiplier and in the
AU could operate in 12 steps. The major difficulty with this algorithm
is the generation of the 3X multiplicand. 0 and 1X are easy to generate.
2X is just the multiplicand shifted left one place. We can, of course,
get 3X by adding 1X and 2X but this adds a step to the execution.

The algorithm used in the AU is a modified version of the two bit
multiply without the need for a 3X operand. The AU takes the two bits
plus a "look ahead", one bit of the next pair. From these three bits add
to the partial sum:

\[
\begin{array}{c}
000 \quad 0, \text{ shift two} \\
001 \quad 1X \text{ multiplicand, shift two} \\
010 \quad 1X \text{ multiplicand, shift two} \\
011 \quad 2X \text{ multiplicand, shift two} \\
100 \quad -2X \text{ multiplicand, shift two} \\
101 \quad -1X \text{ multiplicand, shift two} \\
110 \quad -1X \text{ multiplicand, shift two} \\
111 \quad 0, \text{ shift two}
\end{array}
\]

Thus, if we had a pair of bit groups as follows:

\[
\text{multiplier } 0110 = 6
\]

First, extend it right one zero bit and extend it to the left to
complete the last group of three

\[
\begin{array}{c}
0001100 \\
\text{1} \\
\text{2} \\
3
\end{array}
\]

\[
\begin{array}{c}
\text{Group 1 } 100 \quad -2 \times 2^0 = -2X \\
\text{Group 2 } 011 \quad 2 \times 2^2 = 8X \\
\text{Group 3 } 000 \quad 0 \times 2^4 = 0X \\
\end{array}
\]

\[
\begin{array}{c}
\hline
\text{6X}
\end{array}
\]

\[
\text{28}
\]
The result is 6 X the multiplicand. The first group served to correct by subtraction, what it knew was an average added by the second group. The 3 X operand becomes:

\[
\begin{align*}
&0011 \\
&\text{extend:} \\
&\quad 00110 \\
&\quad \underline{1} \\
&\quad 2 \\
&\text{Group 1} & 110 & -1 \times 2^0 = -1X \\
&\text{Group 2} & 001 & 1 \times 2^2 = \underline{4X} \\
& & & 3X
\end{align*}
\]

For our earlier example of 5 times 5:

\[
\begin{align*}
&00101 = 5 \\
&X 00101 = 5 \\
&\text{Group 1} & 010 & 1 \times 2^0 = 1X \\
&\text{Group 2} & 010 & 1 \times 2^2 = 4X \\
&\text{Group 3} & 000 & 0 \times 2^4 = 0X \\
& & & 5X
\end{align*}
\]

Another example:

\[
\begin{align*}
&00101 = 5 \\
&X 01110 = 14 \\
&\text{Group 1} & 100 & -2 \times 2^0 = -2X \\
&\text{Group 2} & 111 & 0 \times 2^2 = 0X \\
&\text{Group 3} & 001 & 1 \times 2^4 = \underline{16X} \\
& & & 14X
\end{align*}
\]

This method, used in the AU, requires only 12 steps to perform the actual multiplication. The same algorithm operates correctly on both integer and floating point numbers.
3.4.4 Divide

The AU divide (DV) algorithm resembles standard longhand division. The algorithm divides positive numbers by positive numbers only. Therefore, the first thing the AU does is check if one or the other operand (but not both) are negative. If so, it sets a flip-flop to remember and then takes the two’s complement of any negative operands. The AU also requires that the quotient be less than +1 (definition of normalized positive fractions). The AU checks this by subtracting the divisor from the dividend and examining the sign of the result.

For example:

\[
0.11010 \div 0.10000 = (13/16 \div \frac{1}{2} = 13/8)
\]

\[
0.11010 \rightarrow \text{2's comp} \quad 0.11010
\]

\[
-0.10000 \quad \text{subtraction:} +1.10000
\]

\[
0.01010
\]

A negative result would indicate that the quotient will have a positive sign and a "one" in the MSB of the fraction (thereby less than +1). A positive result would indicate that the quotient will be greater than one (as in the example). To prevent this, the AU will shift the dividend one place right sign extended and increment the exponent by one.

Then:

\[
0.11010 \times 2^0 \text{ becomes } 0.01101 \times 2^1
\]

It is no longer normalized but the algorithm can handle that.
For the divide, the AU subtracts the corrected divisor from the dividend and follows these two rules: (by subtracting the divisor, the AU guesses that the quotient bit for this position is a "one").

1. If a "one" appears as the sign bit of the result of the subtraction, that means the guess of "one" as the quotient bit is incorrect. Therefore, the quotient bit is set to "zero", the result of the subtraction is discarded, and the dividend is shifted left zero fill one place.

2. If a "zero" appears as the sign bit of the result of the subtraction, the guess of "one" as the quotient bit is correct. Therefore, the quotient bit is set to "one" and the result of the subtraction is shift left zero fill one place to become the new dividend.

Continuing our example:

\[
\begin{array}{c|c|c|c|c}
\text{new} & 0.01101 & \text{Quotient} & \text{(First result bit is the sign and will always be "zero").} \\
\text{dividend} & + & \text{bit} & \\
\hline
0.11010 & & \rightarrow & 0. \\
1.10000 & & \rightarrow & 1 \\
1.11101 & & \rightarrow & 1 \\
0.10100 & & \rightarrow & 1 \\
+ & & & \\
1.10000 & & \rightarrow & 0 \\
0.00100 & & \rightarrow & \text{1} \\
+ & & & \\
0.01000 & & \rightarrow & \text{1} \\
1.10000 & & \rightarrow & \\
1.11000 & & \rightarrow & 0 \\
0.10000 & & \rightarrow & \\
+ & & & \\
1.10000 & & \rightarrow & \\
0.00000 & & \rightarrow & 1 \\
+ & & & \\
0.00000 & & \rightarrow & \\
1.10000 & & \rightarrow & 0 \\
\end{array}
\]
Quotient is 0.11010. The result exponent is the dividend exponent minus the divisor exponent. For the example it is 1 - 0 = 1. The final quotient is

\[0.11010 \times 2^1 = 13/16 \times 2^1 = 13/8\]

The AU executes the iterative portion of the DV instruction in 24 steps.

3.4.5 Square Root

In base ten, a square root operation is a set of guesses and subtractions. The guess can be 0 to 9. In longhand decimal

```
\[\begin{array}{c|c|c|c}
\text{find} & \sqrt{104692} \\
\hline
\sqrt{10} & 46 & 92 \\
\hline
3 \times 3 & -9 \\
\hline
1 & 46 \\
62 \times 2 & -1 & 24 \\
\hline
22 & 92 \\
-19 & 39 \\
\hline
3 & 53 \\
\end{array}\]
```

If the guess of the root digit was too high as evidenced by a negative difference, the digit is decremented by one to become a new guess. The AU also guesses at the result, but in binary only two guesses are possible.

A longhand example in binary (Example A):

```
\[\begin{array}{c|c|c|c|c|c|c|c|c}
\text{\sqrt{10} } & 1 & 1 & 0 & 1 & 1 & 0 & 1 & 1 \\
\hline
1 \times 1 & -1 \\
\hline
1 & 11 \\
101 \times 1 & -1 & 01 \\
\hline
10 & 11 \\
1100 \times 0 & -0 & 00 \\
\hline
10 & 11 & 01 \\
11001 \times 1 & -1 & 10 & 01 \\
\hline
1 & 01 & 00 & 10 \\
110101 \times 1 & -0 & 11 & 01 & 01 \\
\hline
01 & 11 & 01 & 10 \\
1101101 \times 1 & -1 & 10 & 11 & 01 \\
\hline
0 & 0 & 10 & 01 \\
\end{array}\]
```

[Reference]
[Line #]
The AU guesses that each root bit is a "one", then performs a subtraction. If the guess turns out to be incorrect, the subtraction result is still used but a correction factor is added during the next subtraction. The AU proceeds as follows:

First, the exponent must be even. If it is not, the fraction is shifted right one place sign extended and one is subtracted from the exponent.

The first two bits of the square are taken. A guess is made that the first bit of the result will be a "one". The guess is subtracted from the operand. This result becomes the new operand and is concatenated with the next two bits of the square. The guess is then shifted left one place and the lower three bits are replaced by:

011 if the subtraction is negative
101 if the subtraction is positive.

This becomes the new guess. If the result of the last subtraction was negative, we add the new guess else we still subtract it. The process continues until all bits of the square are used. The root is contained in the last guess word. It is normalized and the exponent is shifted right one place sign extended to divide it by two and the instruction is complete.
For our earlier example:

\[ \sqrt{10} \]
-01 ← first guess
\[ 01 \quad 11 \]
\[ -01 \quad 01 \]
\[ + 10 \quad 11 \]
\[ 01 \quad 01 \quad 00 \quad 10 \]
\[ -00 \quad 11 \quad 01 \quad 01 \]
\[ 01 \quad 11 \quad 01 \quad 10 \]
\[ -01 \quad 10 \quad 11 \quad 01 \]

last guess → 11 01 11

normalize

0.110111

In line A4 of the longhand method, zero was subtracted because a guess of zero was used as the next bit of the answer. In line B4\(^1\) of the algorithm, a guess of "one" was subtracted (since the algorithm always guesses "one") and as seen in line B5\(^1\), a negative result indicates too large of a guess. Line B6\(^1\) applies the following insertion factor which replace the last three bits of the guess:

011

Two's complement of the new guess

Correct bit of the root.

The two's complement of the new guess serves to correct the over subtraction of this step during the next step. The "correction" field used as a result of a correct guess is:

101

New guess

Correct bit of the root.
In the example, the algorithm subtracted:

\[ 00 \quad 11 \quad 01 \quad [B4^1] \]

and produced a negative result. Therefore it must add this back during the next step while at the same time subtracting the new guess word.

\[
\begin{array}{cccc}
\text{add back to undo subtraction} & 00 & 11 & \text{current guess} \\
\text{subtract new guess} & 00 & 01 & 10 & 01 & B5^1 \\
\text{add for entire correction and new guess} & 00 & 01 & 10 & 11 & B6^1 \\
\end{array}
\]

The AU executes the iterative portion of the square root instruction in 24 steps.
3.5 Output Instructions

3.5.1 Store Instructions (STA)

All of the AU visible registers can be stored in Element Memory. During a single micro-step, the selected data is gated to the EM and written. Storable register are:

- A Register
- Q Register
- Activity Stack
- Tag Register
- EA Flip-Flop
- Overflow Flip-Flop

3.5.2 Output A Register Instruction (OTA)

A special instruction which executes both in the SCL and AU is used to retrieve a 32 bit word from one AU and place it in the SCL A Register. In preparation for the execution of an OTA instruction, appropriate select instructions should have been executed to leave only one AU active. When the SCL decodes an OTA, it sends a copy to the PICU while at the same time requesting use of the Output Data Bus (ODB) from ODC, Figure 3.2. When a bus cycle is granted, the AU gates its A Register onto the bus. After some signal propagation delay, the ODC responds to the SCL with a data ready line and the correct data.
The PAODAT Bus is 32 bits wide

Figure 3.2 Output Data Bus/OTA Instruction Block Diagram
3.6 Distributed Logic Instructions

PEPE also includes a set of logic called distributed logic. This logic is not contained in any one PEPE unit but is spread over the control console and the element bays.

3.6.1 Count AU

The SCL may at any time, request a count of the number of AU is with their EA Flip-Flops on. Special logic exists within each EB and the CC to perform this count. Shown in Figure 3.3, each AU in the ensemble presents the state of its EA to the count logic. The count logic presents a nine bit (0-255) count to the SCL which can be loaded into the SCL A Register during the execution of a Read Activity (RDA) instruction. The AU does not directly participate in this instruction.

Two other signals are generated by the AU Count logic. They are SAMANY and SAACTV. The signal SAMANY is a detection of activity count greater than one or "many" active. The signal SAACTV is also a detection of activity count, here for a count greater than zero. Both of these signals are transmitted to the SCL and SAMANY is also sent to the PICU.

3.6.1 Select Highest/Lowest Instructions

A need was identified in PEPE for a high speed algorithm to perform a maximum or minimum search over a set of data values in the element ensemble. The two search instructions are Select Highest and Select Lowest (SH/SL). The SH/SL instructions execute as follows: in the set of active AU's, reset element activity in each AU whose "A" Register contains a nonmaximal/minimal value relative to the set of active AU. This value comparison is done for the set of PEPE integer or normalized floating point numbers but never for a mix of both types.
Figure 3.3 Count AU Block Diagram
A discussion of valid operands for the SH/SL may be appropriate here. For simplicity, consider a machine with eight bit registers instead of 32 bit registers. Let four bits represent the fraction or integer and four bits represent the exponent. See figure 3.4. Note that the entire discussion following can be directly extrapolated to 24 bits fraction/integer and eight bit exponent.

When doing a search over a set of integers, SH/SL requires that the exponent and the sign of the exponent be zero. The integer value is searched for the largest/smallest value.

A search over a set of floating point numbers requires that the fraction be normalized. Normalized numbers are those whose SF and most significant bit of the fraction are different. As shown in figure 3.4, \( +\frac{1}{8} \) is not a valid floating point number for this reason. If \( +\frac{1}{8} \) were permitted, the numbers 0010 0001 and 0000 0100 would both have the value of \( +\frac{1}{4} \). The search algorithm as described later will report that the first binary pattern is larger than the second when they are actually equal.

Also note that the value of floating point zero can be represented in 16 different ways, that is with any exponent as long as the fraction and SF are all zero. In effect 0111 0000 = 0000 0000 = 1000 0000. Therefore, we define floating point zero to be the "smallest" zero representable or 1000 0000 which is \( 0 \times 2^{-8} \). Note that floating point zero is not normalized.
### Two's Complement Notation

<table>
<thead>
<tr>
<th>SIGN</th>
<th>MAGNITUDE</th>
<th>INTERPRETATION</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td></td>
<td>INTEGER</td>
</tr>
<tr>
<td>0</td>
<td>1 1 1</td>
<td>+7</td>
</tr>
<tr>
<td>0</td>
<td>1 1 0</td>
<td>+6</td>
</tr>
<tr>
<td>0</td>
<td>1 0 1</td>
<td>+5</td>
</tr>
<tr>
<td>0</td>
<td>1 0 0</td>
<td>+4</td>
</tr>
<tr>
<td>0</td>
<td>0 1 1</td>
<td>+3</td>
</tr>
<tr>
<td>0</td>
<td>0 1 0</td>
<td>+2</td>
</tr>
<tr>
<td>0</td>
<td>0 0 1</td>
<td>+1</td>
</tr>
<tr>
<td>0</td>
<td>0 0 0</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>1 1 1</td>
<td>-1</td>
</tr>
<tr>
<td>1</td>
<td>1 1 0</td>
<td>-2</td>
</tr>
<tr>
<td>1</td>
<td>1 0 1</td>
<td>-3</td>
</tr>
<tr>
<td>1</td>
<td>1 0 0</td>
<td>-4</td>
</tr>
<tr>
<td>1</td>
<td>0 1 1</td>
<td>-5</td>
</tr>
<tr>
<td>1</td>
<td>0 1 0</td>
<td>-6</td>
</tr>
<tr>
<td>1</td>
<td>0 0 1</td>
<td>-7</td>
</tr>
<tr>
<td>1</td>
<td>0 0 0</td>
<td>-8</td>
</tr>
</tbody>
</table>

* Not Valid Normalized Floating Point Patterns

**Figure 3.4 A Four Bit Number System**
3.6.2.1 Algorithm

The algorithm to be described executes in the AU and requires a maximum of 35 100 ns micro-steps. Execution consists of two parts, Conversion and Search.

3.6.2.2 Conversion

The first micro-step of SH/SL converts the set of values to be compared from the valid set of PEPE 2's complement number system into an ordered set of operands. This conversion is a mapping of PEPE numbers onto a 32 bit pure magnitude number line. In the case of Select Highest, the largest (most positive) PEPE number is converted to all ones and the smallest (most negative) PEPE number is converted to all zeros. For Select Lowest the mapping is reversed.

For the purposes of this explanation, the set of PEPE floating numbers is broken into 4 classes. Refer to figure 3.5. Column 1 represents the ordered set of 2's complement normalized PEPE operands. Top-most positive, bottom-least positive. Fraction sign (bit 23) is first and exponent sign bit 31 next, for both, 0 is positive. The exponent field (bits 30 to 24) is next. Fraction field is last (bits 22 to 0). The classes are:

Class 1 contains all operands with positive fraction and positive exponent.
Class 2 contains all operands with positive fraction and negative exponent.

Note the zero operand is the least class 2 element. Class 3 contains all operands with negative fraction and negative exponent. Here the exponent field is ordered all zeros to all ones. The fraction field is ordered by normalized negative values (zero with all ones to all zeros). This puts the operands in order (most to least positive) but not in decreasing binary
<table>
<thead>
<tr>
<th>Original Floating Point Number</th>
<th>Select Highest Mapping</th>
<th>Select Lowest Mapping</th>
</tr>
</thead>
<tbody>
<tr>
<td>SF SE EXP FRAC</td>
<td>SF SE EXP FRAC</td>
<td>SF SE EXP FRAC</td>
</tr>
<tr>
<td>0 0 1 - 111 - 1</td>
<td>1 1 1 - 111 - 1</td>
<td>0 0 0 - 000 - 0</td>
</tr>
<tr>
<td>0 0 1 - 110 - 1</td>
<td>1 1 1 - 110 - 0</td>
<td>0 0 0 - 001 - 1</td>
</tr>
<tr>
<td>0 0 0 - 011 - 1</td>
<td>1 1 0 - 011 - 1</td>
<td>0 0 1 - 100 - 0</td>
</tr>
<tr>
<td>0 0 0 - 010 - 0</td>
<td>1 1 0 - 010 - 0</td>
<td>0 0 1 - 101 - 1</td>
</tr>
<tr>
<td>0 1 1 - 111 - 1</td>
<td>0 1 0 - 111 - 1</td>
<td>0 1 0 - 000 - 0</td>
</tr>
<tr>
<td>0 1 1 - 110 - 0</td>
<td>0 1 0 - 110 - 0</td>
<td>0 1 0 - 001 - 1</td>
</tr>
<tr>
<td>0 1 0 - 011 - 1</td>
<td>0 1 0 - 011 - 1</td>
<td>0 1 1 - 100 - 0</td>
</tr>
<tr>
<td>0 1 0 - 010 - 0</td>
<td>0 1 0 - 010 - 0</td>
<td>0 1 1 - 101 - 1</td>
</tr>
<tr>
<td>0 1 0 - 000 - 0</td>
<td>0 1 0 - 000 - 0</td>
<td>0 1 1 - 111 - 1</td>
</tr>
<tr>
<td>1 1 0 - 011 - 1</td>
<td>0 1 1 - 101 - 1</td>
<td>1 0 0 - 010 - 0</td>
</tr>
<tr>
<td>1 1 0 - 000 - 0</td>
<td>0 1 1 - 100 - 0</td>
<td>1 0 0 - 011 - 1</td>
</tr>
<tr>
<td>1 1 1 - 101 - 1</td>
<td>0 1 0 - 011 - 1</td>
<td>1 0 1 - 110 - 0</td>
</tr>
<tr>
<td>1 1 1 - 100 - 0</td>
<td>0 1 0 - 000 - 0</td>
<td>1 0 1 - 111 - 1</td>
</tr>
</tbody>
</table>

**Figure 3.5** Number Classes and Conversion Floating Point

43
sequence. Class 4 contains all operands with negative fraction and positive exponent. It is ordered similarly to class 3. Column 2 shows the converted values for select highest and Column 3 shows the converted values for select lowest.

The set of PEPE integers is broken into 2 classes. Refer to figure 3.6. Column 1 represents the ordered set 2's complement PEPE integers: top-most positive, bottom-least positive. Fraction sign (bit 23) is first, zero is positive. Exponent sign and exponent are all zero. Fraction field is last (bits 22 to 0). The classes are:

Class 1 contains all positive integers.

Class 2 contains all negative integers.

Column 2 shows converted values for SH, and column 3 shows the converted values for SL. Operation of SH/SL on integer values can be considered a subset of floating point operation when the exponent is zero.

Rules for either floating point or integer conversion are:

**Select Highest (SH)**

1. Complement bit 31 if bit 23 equals zero
2. Complement bits (30-24) if bit 23 equals one
3. Bits (22-0) remain unchanged
4. Complement bit 23

**Select Lowest (SL)**

1. Complement bit 31 if bit 23 equals one
2. Complement bits (30-24) if bit 23 equals zero
3. Complement bits (22-0)
4. Bit 23 remains unchanged
<table>
<thead>
<tr>
<th>Original Integer Number</th>
<th>Select Highest Mapping</th>
<th>Select Lowest Mapping</th>
</tr>
</thead>
<tbody>
<tr>
<td><strong>SI XX XXX INT</strong></td>
<td><strong>SI XX XXX INT</strong></td>
<td><strong>SI XX XXX INT</strong></td>
</tr>
<tr>
<td>0 0 0 - 011 - 1</td>
<td>1 1 0 - 011 - 1</td>
<td>0 0 1 - 100 - 0</td>
</tr>
<tr>
<td>0 0 0 - 011 - 0</td>
<td>1 1 0 - 011 - 0</td>
<td>0 0 1 - 130 - 1</td>
</tr>
<tr>
<td>0 0 0 - 000 - 1</td>
<td>1 1 0 - 000 - 1</td>
<td>0 0 1 - 111 - 0</td>
</tr>
<tr>
<td>0 0 0 - 000 - 0</td>
<td>1 1 0 - 000 - 0</td>
<td>0 0 1 - 111 - 1</td>
</tr>
<tr>
<td><strong>CLASS 1</strong></td>
<td></td>
<td>(ZERO)</td>
</tr>
<tr>
<td>1 0 0 - 011 - 1</td>
<td>0 0 1 - 111 - 1</td>
<td>1 1 0 - 000 - 0</td>
</tr>
<tr>
<td>1 0 0 - 011 - 0</td>
<td>0 0 1 - 111 - 0</td>
<td>1 1 0 - 000 - 1</td>
</tr>
<tr>
<td>1 0 0 - 000 - 1</td>
<td>0 0 1 - 100 - 1</td>
<td>1 1 0 - 011 - 0</td>
</tr>
<tr>
<td>1 0 0 - 000 - 0</td>
<td>0 0 1 - 100 - 0</td>
<td>1 1 0 - 011 - 1</td>
</tr>
</tbody>
</table>

**Figure 3.6** Number Classes and Conversion Integer
The conversion is done in the ALU of the element. Following this mapping, each element contains a value in pure 32 bit magnitude representation with the same relative "value" as the original PEPE number. All that remains is to compare this value in all active elements and leave the element(s) with the maximum/minimum value active.

3.6.2.3 Search

Figure 3.7 illustrates the basic principle behind the search function. One bit of the converted value in each active element is gated out of the element and into the Signal Distribution System (SDS). The SDS in PEPE is that portion of logic that enables the control console and the element bays to communicate. The SDS can be considered to encompass the distributed logic. In the SDS, they are "OR"ed together and transmitted back to each element. The line sent back to each element indicates that at least one of the active elements has a "1" in the bit position being examined. A decision is then made in each element to reset the element activity if that element has a "0" in the bit position and at least one element has a "1". It is clear that any element with a "1" in a particular bit position has a number of greater magnitude than an element with a "0" in that position.

Due to PEPE layout constraints, the propagation delay encountered in the "OR" circuit will be on the order of 200 ns. Therefore one bit each 200 ns is the maximum rate in the "OR" circuit and a SH/SL operation over the full 32 bit word would take 6400 ns. In order to reduce this time, the method in Figure 3.8 was used. Two bits each 200 ns are output by the element for a search time of 3200 ns.
Figure 3.7 SH/SL One Bit Per Cycle
Figure 3.8 shows that 2 bits are decoded and sent out of the element on 3 lines. Each of the 3 lines from all of the elements are "OR"ed and returned to the element. If the element has 2 bits that are less than indicated by the 3 reset lines, the element resets its element activity. Figure 3.9 shows the truth table for this reset function. $\alpha$ and $\beta$ are the 2 bits being examined, A, B and C are the 3 output lines of the element, X, Y and Z are the 3 "OR"s of all the elements A, B and C lines, and RESET is the signal to reset the element activity flip-flop. Line A equals "1" when both bits being examined are binary "1". Line B equals "1" when the high order bit is a "1" and the low order bit is a "0". Line C equals 1 when the low order bit is a "1" and the high order bit is a "0". All 3 lines are "0" when both bits are zero. These 3 lines are "OR"ed in the CSD/BSD with the corresponding lines from all the other active elements and these 3 signals are returned to every element. The element then compares its local A, B, C lines with the global X, Y, Z lines and any elements whose ABC lines indicate its 2 bits are less than the XYZ lines show for the entire system, will reset its element activity.
Figure 3.8 SH/SL Two Bits Per Cycle
A, B, C Lines Truth Table

<table>
<thead>
<tr>
<th>ALPHA</th>
<th>BETA</th>
<th>A</th>
<th>B</th>
<th>C</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>0</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>0</td>
</tr>
</tbody>
</table>

\[ A = \text{ALPHA} \cdot \text{BETA} \]
\[ B = \text{ALPHA} \cdot \overline{\text{BETA}} \]
\[ C = \text{ALPHA} \cdot \overline{\text{BETA}} \]

Reset Element Activity Truth Table

<table>
<thead>
<tr>
<th>ALPHA</th>
<th>BETA</th>
<th>A</th>
<th>B</th>
<th>C</th>
<th>X</th>
<th>Y</th>
<th>Z</th>
<th>RESET</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
</tr>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
</tr>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>0</td>
</tr>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>0</td>
</tr>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>0</td>
</tr>
</tbody>
</table>

\[ \text{RESET} = \overline{A}X + \overline{A}B\overline{Y} + ABCZ \]

Figure 3.9 Search: Reset Element Activity Function
The flip-flops $\alpha$ and $\beta$ contents are as follows during execution of SH/SL:

<table>
<thead>
<tr>
<th>u-step</th>
<th>$\alpha$</th>
<th>$\beta$</th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>0</td>
<td>0</td>
<td></td>
</tr>
<tr>
<td>2</td>
<td>A23*</td>
<td>B31</td>
<td></td>
</tr>
<tr>
<td>3</td>
<td>B30</td>
<td>B29</td>
<td></td>
</tr>
<tr>
<td>4</td>
<td>B28</td>
<td>B27</td>
<td></td>
</tr>
<tr>
<td>5</td>
<td>B26</td>
<td>B25</td>
<td></td>
</tr>
<tr>
<td>6</td>
<td>B24</td>
<td>B23*</td>
<td></td>
</tr>
<tr>
<td>7</td>
<td>B22</td>
<td>B21</td>
<td></td>
</tr>
<tr>
<td>8</td>
<td>B20</td>
<td>B19</td>
<td></td>
</tr>
<tr>
<td>9</td>
<td>B18</td>
<td>B17</td>
<td></td>
</tr>
<tr>
<td>10</td>
<td>B16</td>
<td>B15</td>
<td></td>
</tr>
<tr>
<td>11</td>
<td>B14</td>
<td>B13</td>
<td></td>
</tr>
<tr>
<td>12</td>
<td>B12</td>
<td>B11</td>
<td></td>
</tr>
<tr>
<td>13</td>
<td>B10</td>
<td>B09</td>
<td></td>
</tr>
<tr>
<td>14</td>
<td>B08</td>
<td>B07</td>
<td></td>
</tr>
<tr>
<td>15</td>
<td>B06</td>
<td>B05</td>
<td></td>
</tr>
<tr>
<td>16</td>
<td>B04</td>
<td>B03</td>
<td></td>
</tr>
<tr>
<td>17</td>
<td>B02</td>
<td>B01</td>
<td></td>
</tr>
<tr>
<td>18</td>
<td>B00</td>
<td>0</td>
<td></td>
</tr>
</tbody>
</table>

*Note that the effective bit 23 is examined twice. This was done to simplify the SH/SL logic. The second pass of bit 23 through the compare logic is a NO-OP.

**at the end of micro-step 35, $\alpha$ and $\beta$ reset to 0.
The PICU does not directly participate in the actual search. It sends controls to the elements directing the element to generate the A, B and C lines, to do the mapping, to compare the X, Y and Z lines and to reset the element activity if the compare so indicates. The PICU can end the search early. If during the execution of the search instruction, the element activity count goes to one, the PICU terminates execution of the instruction. This early end is possible because, if there is only one element left, it has the highest or lowest valued A register in the set of active elements. This logic is described in detail in section 4.2.

3.6.3 Select First

Following the execution of a SH, SL or other select instruction, many AU may still remain active. If we wish to extract a data item via the ODB, we require only one active AU. The Select First (SF) instruction, will cause all but one AU to reset its EA.

The one AU selected to remain active will be the "first" one found. "First" is defined as the AU within the lowest numbered PE. As shown in Figure 3.10, each AU supplies its EA to the priority logic. This logic then determines the lowest numbered AU and returns a pointer to this AU. The priority logic is always active. During the execution of a SF instruction, the AU is commanded to load the pointer ("one" or "zero") into its EA flip-flop thus selecting one AU.

The priority logic is contained within each Bay and the CC.
Processing Elements are Numbered on the Basis of Physical Location

Figure 3.10 Select First Block Diagram
4.0 Detail Design of the AU

In this section I will describe in detail the implementation of the algorithms explained in Section 3. After a description of the physical hardware, I will show how the PICU controls the AU. Finally, the special hardware which enables the AU to perform as an efficient parallel associative processor is described.

4.1 Hardware

The selection of the physical and electrical parameters of PEPE played an important role in its success.

4.1.1 Circuits - ECL

The AU is designed using readily available 10K series Emitter Coupled Logic (ECL). Appendix D lists the package types used along with the logic symbols and truth tables for the more complex Medium Scale Integration (MSI) chips.

The use of 10K series logic with its two nanosecond typical delay times permits the AU to execute at a speed of 100 ns per micro-step. With the exception of the 10181 each of these chips is mounted in a 16 pin Dual-In-Line Package (DIP). The 10181 is a 24 pin DIP.

In the remainder of this section, the five digit 10K gate type will generally be abbreviated to the last three digits.

4.1.2 Boards

The AU was fabricated on two large printed circuit boards. The boards, which were specially designed for PEPE, contain eight printed circuit layers. Four of these layers carry ECL signals. Each printed line is controlled during manufacture to maintain a 55 ohm impedance. Two of the remaining layers distribute voltages to the DIPs. Voltages used are a
VCC of +2 and a VEE of -3.2. Finally, the last two layers are ground, used both for DC reference and as a ground plane for the signal lines.

In addition to printed circuit, the board can be wirewrapped with open wire and miniature coaxial cable. All of the signal carrying media are controlled impedance (55 ohm) lines. Since these lines are similar to high frequency transmission cables, they are properly terminated with a 55 ohm resistive load to minimize reflections.

Shown in figure 4.1, the board can hold up to 300 16-pin DIPS. Since the ALU (181) is a 24-pin DIP, special provision was made to permit up to 10 24-pin sockets in the center of the board. Each 24 pin socket replaces two 16 pin sockets. In the final AU design, board AU1 contains 201 16-pin ECL DIPS, 3 24-pin 181 DIPS and 73 16 pin 55-terminator modules. Board AU2 contains 219 16-pin ECL DIPS, 6 24-pin 181 DIPS and 67 16 pin terminator modules.

Interconnection into the backplane for signals, power and ground is through four 100 pin connectors. 256 of the 400 pins are available for signals.

4.1.3 Backplane

Each AU board plugs into a 55 slot backplane/card rack assembly. The backplane can accommodate nine processing elements. It is a three layer laminated assembly used not only for physical support but also for power distribution. Power enters the board from the backplane via the two center 100 pin connectors which are mounted on the backplane.

Figure 4.2 shows a rear view of the Element Bay. The Bay houses four backplane/card rack assemblies and power supply for each row. The power bussing scheme is designed as a low inductance distribution network to provide power to the row with less than a 50 milli-volt drop along the five foot backplane.
Figure 4.1 PEPE Printed Circuit Board
Figure 4.2 Element Bay Backplane
All signal interconnections between boards of the PE are via the three bottom connectors of the boards. Miniature coaxial cable is used as the media. The top connector (P1) uses a specially designed signal distribution scheme using belted cable. The belted cable carries control signals from the BSD to the PE in the row. The regular pattern of cables seen in Figure 4.2 serve to outline each PE. The signalling method used on these cables is differential balanced pair. This provides immunity to both resistive voltage losses and common mode injected noise. The Output Data Bus is also carried on these belted cables.

The control signals reach these belts after originating in the PICU and travelling through the Signal Distribution System. Upon reaching the AU, they cause registers, selectors and functional units to execute the instruction decoded in the PICU. These control lines will be described in the remainder of this section. Appendix E will serve as a quick reference for signal names used in this section.
4.2 Parallel Instruction Control Unit

The control lines for the AU are generated by the Parallel Instruction Control Unit (PICU) in the Control Console. The PICU contains a Micro-Program Memory (MPM) that stores the control bits for each micro-step. The PICU also contains control logic for correct sequencing of each instruction.

4.2.1 Purpose/MPM

The ACU-PICU is part of the Arithmetic Control Unit and is supplied parallel instructions and data from the ACU-SCL. The purpose of the PICU is to cause parallel element AUs to properly perform parallel instructions by transmitting, through the Signal Distribution System, global control and data. The PICU must also interface with the Element Memory Control (EMC) to assure proper operation in the event of Element Memory access conflicts with the two other processors in each PE.

4.2.2 Operation

Figure 4.3 shows a block diagram of the ACU-PICU. The following paragraphs describe the operation of each block.

4.2.2.1 PICU Interfaces

4.2.2.1.1 ACU-SCL/PIQ

The PICU receives instructions and data from the SCL/PIQ. These data are presented to the PICU on 47 lines:

- SAIINF(31-0) 32 bit operand
- SAIINF(39-32) 8 bit opcode
- SAIINO(40) R (EM reference) bit
- SAIINF(45-41) not used in ACU-PICU
- SAIINF(46) EM look-ahead bit

The PICU signals with SAIEMT when it has completed execution of its parallel instruction. The SCL/PIQ signals with SAIREQ to the PICU when a new parallel instruction is available. The PICU acknowledges receipt of the instruction with SAIACK.
Figure 4.3 ACU-PICU Block Diagram
4.2.2.1.2 Element Memory Control (EMC)

When the PICU requires EM access during the course of a parallel instruction execution, it signals the EMC by:

1. Placing the EM address on address bus PADAIN
2. Signaling read or write with line PADMDE
3. Requesting access by raising line PADREQ

When the EMC has selected the ACU-PICU for service, it responds with PADSEL. The PICU then utilizes this signal to gate the EM data bus to or from the AU.

4.2.2.1.3 Signal Distribution System (SDS)

The PICU receives one signal from the SDS. Named, SAMANY, it is derived from the AU count logic (see section 3.6.1) and indicates that more than one AU in the ensemble are active.

Since the AU contain little execution control logic (they are composed of registers and selectors), all switching control and data strobe signals are generated by the PICU and transmitted to the ensemble of AU through the SDS.

4.2.2.1.4 Output Data Control (ODC)

The ODC transmits PAOSEL to the ACU-PICU during the execution of an OTA instruction. The PICU then causes the active AU to place the contents of its A Register on the Output Data Bus.

4.2.2.1.5 Intercommunication Logic (ICL)

A special purpose interface exists to the ICL. It does not affect operation of the AU and will not be discussed here.
4.2.2.2 Micro-Program Memory

The MPM consists of 1024 words each 80 bits wide. The memory is addressed with 9 bits, the 8 bit opcode and the R bit. The next address field contains the address of the next micro-step. This field allows jumps within the MPM. The global control line field bits are used to define steering gates and register strobes in the ensemble AU which allow data to pass from one register to another. These lines are discussed in detail in Section 4.3.

The local PICU control line field is used to control operations within the PICU. Some of the local control bits are listed:

- **PADREQ** Request for EM cycle to EMC.
- **PADMDE** R/W indicator to EMC.
- **PAMEOI** End of instruction bit. Causes the control to switch the next parallel instruction opcode from the SCL/PIQ into the memory address register. Otherwise the MPM next address field is used.

- **PAMGDT** A two bit field used to select one of the following as input to bits 31-0 of the PAMORL Output Register:

1. **Operand from the SCL/PIQ**
   
   \[(PAMOBR(31-0))\]

2. **Mask Generation Logic**

3. **MPM Global Control Bits (31-0)**

- **PAMEXT** Set to "1" for each micro-step of a Select Highest/Lowest instruction. It is used to affect an early exit from these instruction when only one AU remains active. Section 3.6.2 describes its use in more detail.

The MPM is constructed of 1024 X 1 bit RAM and is loaded from the Test and Maintenance Computer.
4.2.2.3 Memory Address Register (PAMMAR)

PAMMAR supplies address lines to the MPM. It is loaded from the MPM next address field or from SCL/PIQ lines 40-32.

4.2.2.4 Operand Buffer Register (PAMOBR)

Thirty-two bits from the SCL/PIQ are gated into the PAMOBR when the parallel instruction is transferred into the PICU. The data represents the parallel instruction operand. It may be required during any, or none, of the micro-steps of the instruction. The Output Register selector will determine, based on PAMGDT, if and when the PAMOBR will be selected onto the PAMORL lines.

When a request is made to EMC, the lower ten bits are sent as the EM address.

Bits 15-11 are transferred to the Mask Generate Logic during execution of the instructions RBIT, SBIT, PSEL and SELB.

4.2.2.5 Output Register (PAMORL)

The PAMORL holds the current micro-step data and control being transmitted to the ensemble AU. Input to the register is controlled by PAMGDT (paragraph 4.2.2.2).

4.2.2.6 Mask Generate Logic

In order to make efficient use of EM, instructions are provided which operate on a single bit of memory at a time. The instructions are SBIT, RBIT, PSEL and SELB. They are implemented in the AU by performing logical operations over the entire 32-bit data word. To protect the 31 bits that are not involved in the instruction, the PICU generates a mask.
The mask for RBIT will contain "ones" in every bit position except for the bit indicated by PAMOBR(15-11). The mask for SBIT, PSEL and SELB will be all "zeros" with a single "one".

4.2.2.7 Timing and Control

This block performs all of the control functions of the PICU and handles all of the interfaces.

It generates PAGEN, the general clock-strobe, to the AU, causing the AU to execute the current micro-step.

4.2.2.8 The R bit

The R or routing bit is generated by the SCL when decoding the instruction. If the programmer specified a global operand, the R bit is "zero" and the PAMOBR is passed to the AU. If an EM operand was specified, the R bit is a "one". The PICU requests an EM cycle and causes the AU to use its local EM data bus as the source of the operand. The PICU implements this by using the R bit as an address bit in accessing MPM. The AU is then caused to execute a different micro-sequence based on the operand source.
4.3 Detail Block Diagram

Figure 4.4 shows a detail block diagram of the AU. Depicted are all of the registers, flip-flops, selectors, functional units and data paths. Each of these receives a control line originating in the PICU MPM. That control line or lines causes actions which enable the AU to perform instructions. Table 4.1 relates the bit numbers of the PICU Output Register (PAMORL) to the AU control line names.

4.3.1 Registers/Flip-Flops

Table 4.2 lists the control lines for the registers in the AU and Table 4.3 lists the lines for the flip-flops. A "one" on the control line causes the storage element to load the contents of its input lines under the following conditions:

- Since the instruction stream to the AU may not be continuous, signal PAGEN is sent to all AU by the PICU. If PAGEN is "zero" no storage elements are permitted to change state in the AU.

- If the AU is inactive (PAEACT is reset), no user visible registers are permitted to change state with minor exceptions. The instructions which execute in an inactive AU are only those which can activate the AU.

- If the entire PE is faulted (PAFALT is set), the AU will not respond to any instruction except "clear fault" (CF). No user visible storage element is permitted to change state.

4.3.1.1 A Register Mantissa PAAREG(23-0)

STRODE=PAEACT · PAFALT · PAMORL19
Figure 4.4a PEPE AU Detailed Block Diagram
Figure 4.4b PEPE AU Detail Block Diagram
<table>
<thead>
<tr>
<th>BIT NO.</th>
<th>SIGNAL NAME</th>
<th>DEFINITION</th>
</tr>
</thead>
<tbody>
<tr>
<td>00</td>
<td>#PAADM00</td>
<td>Adder Control (ALU)</td>
</tr>
<tr>
<td>01</td>
<td>#PAADM01</td>
<td>Adder Control</td>
</tr>
<tr>
<td>02</td>
<td>#PAADM02</td>
<td>Adder mantissa, A input select</td>
</tr>
<tr>
<td>03</td>
<td>#PAADM03</td>
<td></td>
</tr>
<tr>
<td>04</td>
<td>#PAAMA00</td>
<td>Adder mantissa, A input select</td>
</tr>
<tr>
<td>05</td>
<td>#PAAMA01</td>
<td>Adder mantissa, B input select</td>
</tr>
<tr>
<td>06</td>
<td>#PAAMA02</td>
<td>Adder mantissa, B input select</td>
</tr>
<tr>
<td>07</td>
<td>#PAAMBO0</td>
<td>Adder exponent, B input select</td>
</tr>
<tr>
<td>08</td>
<td>#PAAMBO1</td>
<td>Adder exponent, B input select</td>
</tr>
<tr>
<td>09</td>
<td>#PAAEBO0</td>
<td>Adder exponent, A input select</td>
</tr>
<tr>
<td>10</td>
<td>#PAAEBO1</td>
<td></td>
</tr>
<tr>
<td>11</td>
<td>#PAAEA00</td>
<td>Adder exponent, A input select</td>
</tr>
<tr>
<td>12</td>
<td>#PAAEA01</td>
<td>A register, mantissa input select</td>
</tr>
<tr>
<td>13</td>
<td>#PAAEA02</td>
<td>A register, mantissa input select</td>
</tr>
<tr>
<td>14</td>
<td>#PAAMS00</td>
<td>A register, exponent input select</td>
</tr>
<tr>
<td>15</td>
<td>#PAAMS01</td>
<td></td>
</tr>
<tr>
<td>16</td>
<td>#PAAES00</td>
<td></td>
</tr>
<tr>
<td>17</td>
<td>#PAAES01</td>
<td></td>
</tr>
<tr>
<td>18</td>
<td>#PAAES02</td>
<td></td>
</tr>
<tr>
<td>19</td>
<td>#PAAMM</td>
<td>A register, exponent input select</td>
</tr>
<tr>
<td>20</td>
<td>#PAEM</td>
<td>Clock enable, A mantissa</td>
</tr>
<tr>
<td>21</td>
<td>#PAQMS00</td>
<td>Clock enable, A exponent</td>
</tr>
<tr>
<td>22</td>
<td>#PAQMS01</td>
<td>Q register, mantissa input select</td>
</tr>
<tr>
<td>23</td>
<td>#PAQES00</td>
<td>Q register, mantissa input select</td>
</tr>
<tr>
<td>24</td>
<td>#PAINS00</td>
<td>Q register, exponent input select</td>
</tr>
<tr>
<td>25</td>
<td>#PAINS01</td>
<td>Local control decode</td>
</tr>
<tr>
<td>26</td>
<td>#PAINS02</td>
<td></td>
</tr>
<tr>
<td>27</td>
<td>#PAINS03</td>
<td></td>
</tr>
<tr>
<td>28</td>
<td>#PAINS04</td>
<td></td>
</tr>
<tr>
<td>29</td>
<td>#PAAL0--</td>
<td>Local control decode</td>
</tr>
<tr>
<td>30</td>
<td>#PAALNO0</td>
<td>Alignment network control</td>
</tr>
<tr>
<td>31</td>
<td>#PAALNO1</td>
<td>Alignment network input select</td>
</tr>
</tbody>
</table>

**FOR**

<table>
<thead>
<tr>
<th>BIT NO.</th>
<th>SIGNAL NAME</th>
<th>DEFINITION</th>
</tr>
</thead>
<tbody>
<tr>
<td>31-00</td>
<td>PAXDIN</td>
<td>Global Data From PICU</td>
</tr>
</tbody>
</table>

**TABLE 4.1, CONTROL SIGNALS, AU**
<table>
<thead>
<tr>
<th>BIT NO.</th>
<th>SIGNAL NAME</th>
<th>DEFINITION</th>
</tr>
</thead>
<tbody>
<tr>
<td>32</td>
<td>#PASTK</td>
<td>Clock enable, stack register</td>
</tr>
<tr>
<td>33</td>
<td>#PAEZ00</td>
<td>Element activity control decoder input</td>
</tr>
<tr>
<td>34</td>
<td>#PAEZ01</td>
<td>Element activity control decoder input</td>
</tr>
<tr>
<td>35</td>
<td>#PAEZ02</td>
<td>Element activity control decoder input</td>
</tr>
<tr>
<td>36</td>
<td>#PAEZ03</td>
<td>Element activity control decoder input</td>
</tr>
<tr>
<td>37</td>
<td>#PAEZ04</td>
<td>Element activity control decoder input</td>
</tr>
<tr>
<td>38</td>
<td>#PATGS00</td>
<td>Tag register, input select</td>
</tr>
<tr>
<td>39</td>
<td>#PATGS01</td>
<td>Tag register, input select</td>
</tr>
<tr>
<td>40</td>
<td>#PAEAC</td>
<td>Clock enable, element activity flip-flop</td>
</tr>
<tr>
<td>41</td>
<td>#PADPC</td>
<td>Clock enable, double precision carry</td>
</tr>
<tr>
<td>42</td>
<td>#PATAG</td>
<td>Clock enable, tag register</td>
</tr>
<tr>
<td>43</td>
<td>#PASTK00</td>
<td>Stack register, input select</td>
</tr>
<tr>
<td>44</td>
<td>#PASTK01</td>
<td>Stack register, input select</td>
</tr>
<tr>
<td></td>
<td><strong>SIGNAL NAME</strong></td>
<td><strong>FOR 49=0</strong></td>
</tr>
<tr>
<td>32</td>
<td>#PANOR</td>
<td>Alignment network control</td>
</tr>
<tr>
<td>33</td>
<td>#PAFIX</td>
<td>Special control inhibit A reg mantissa</td>
</tr>
<tr>
<td></td>
<td></td>
<td>Clock during microstep 5 of FIX instruction</td>
</tr>
<tr>
<td>34</td>
<td>#PATOV</td>
<td>Clock enable, temporary overflow hold FF</td>
</tr>
<tr>
<td>35</td>
<td>#PACO-23</td>
<td>Clock enable, carry out 23 hold FF</td>
</tr>
<tr>
<td>36</td>
<td>#PAQMM</td>
<td>Clock enable, Q mantissa</td>
</tr>
<tr>
<td>37</td>
<td>#PAQEM</td>
<td>Clock enable, Q exponent</td>
</tr>
<tr>
<td>38</td>
<td>#PABMSS00</td>
<td>B register, mantissa input select</td>
</tr>
<tr>
<td>39</td>
<td>#PABMSS01</td>
<td>B register, mantissa input select</td>
</tr>
<tr>
<td>40</td>
<td>#PABES00</td>
<td>B register, exponent input select</td>
</tr>
<tr>
<td>41</td>
<td>#PABES01</td>
<td>B register, exponent input select</td>
</tr>
<tr>
<td>42</td>
<td>#PABM201</td>
<td>Clock enable, B mantissa bits 7-0</td>
</tr>
<tr>
<td>43</td>
<td>#PABM202</td>
<td>Clock enable, B mantissa bits 23-8</td>
</tr>
<tr>
<td>44</td>
<td>#PABEM</td>
<td>Clock enable, B exponent</td>
</tr>
<tr>
<td>45</td>
<td>#PAASC500</td>
<td>Shift control register, input select</td>
</tr>
<tr>
<td>46</td>
<td>#PAASC501</td>
<td>Shift control register, input select</td>
</tr>
<tr>
<td>47</td>
<td>#PAOV</td>
<td>Clock enable, PAOVFF flip-flop</td>
</tr>
<tr>
<td>48</td>
<td>#PADTC</td>
<td>Control signal, data time shared with control</td>
</tr>
<tr>
<td>49</td>
<td>#PACTC</td>
<td>Control signal, select control group</td>
</tr>
<tr>
<td>50</td>
<td>#PASCR</td>
<td>Clock enable, shift count register</td>
</tr>
<tr>
<td>51</td>
<td>#PARVM</td>
<td>PAOXIN input select</td>
</tr>
</tbody>
</table>

a) The meaning of PAMORL (31-0) lines is dependent on PAMORL 48.
b) The meaning of PAMORL (44-32) lines is dependent on PAMORL 49.

**TABLE 4.1, CONTROL SIGNALS, AU**
<table>
<thead>
<tr>
<th>REGISTER (Bit #)</th>
<th>CONTROL LINE</th>
</tr>
</thead>
<tbody>
<tr>
<td>$A_m$ (23-0)</td>
<td>PAMORL19</td>
</tr>
<tr>
<td>$A_x$ (31-24)</td>
<td>PAMORL20</td>
</tr>
<tr>
<td>$B_m$ (7-0)</td>
<td>PAMORL42</td>
</tr>
<tr>
<td>$B_m$ (23-8)</td>
<td>PAMORL43</td>
</tr>
<tr>
<td>$B_x$ (31-24)</td>
<td>PAMORL44</td>
</tr>
<tr>
<td>$B_{xt}$ (Extension bit)</td>
<td>None</td>
</tr>
<tr>
<td>$Q_m$ (23-0)</td>
<td>PAMORL36</td>
</tr>
<tr>
<td>$Q_x$ (31-24)</td>
<td>PAMORL37</td>
</tr>
<tr>
<td>$Q_{xt}$ (Extension bit)</td>
<td>None</td>
</tr>
<tr>
<td>Shift Count Register</td>
<td>PAMORL50</td>
</tr>
<tr>
<td>Tag Register</td>
<td>PAMORL42</td>
</tr>
<tr>
<td>Activity Stack</td>
<td>PAMORL32</td>
</tr>
</tbody>
</table>

Table 4.2  Register Control
<table>
<thead>
<tr>
<th>CONTROL FLIP-FLOP</th>
<th>CONTROL LINE</th>
</tr>
</thead>
<tbody>
<tr>
<td>PAEACT</td>
<td>PARMORL40</td>
</tr>
<tr>
<td>PAFALT</td>
<td>Local Control</td>
</tr>
<tr>
<td>PAZROA</td>
<td>Local Control</td>
</tr>
<tr>
<td>PAZROB</td>
<td>Local Control</td>
</tr>
<tr>
<td>PASIGN</td>
<td>Local Control</td>
</tr>
<tr>
<td>PADPC</td>
<td>PARMORL41</td>
</tr>
<tr>
<td>PATOVF</td>
<td>PARMORL34</td>
</tr>
<tr>
<td>PAOVFF</td>
<td>PARMORL47</td>
</tr>
<tr>
<td>PACO23FF</td>
<td>PARMORL35</td>
</tr>
</tbody>
</table>

Table 4.3 Flip-Flop Control
4.3.1.2 A Register Exponent PAAREG(31-24)

\[ \text{STROBE} = \text{PAMORL20} \cdot \text{PAEACT} \cdot \text{PAFALT} \]

4.3.1.3 B Register Mantissa 1 PABREG(7-0)

\[ \text{STROBE} = \text{PAMORL42} \cdot \text{PAFALT} \]

4.3.1.4 B Register Mantissa 2 PABREG(23-8)

\[ \text{STROBE} = \text{PAMORL43} \cdot \text{PAFALT} \]

4.3.1.5 B Register Exponent PABREG(31-24)

\[ \text{STROBE} = \text{PAMORL44} \cdot \text{PAFALT} \]

4.3.1.6 Q Register Mantissa PAQREG(23-0)

\[ \text{STROBE} = \text{PAMORL36} \cdot \text{PAFALT} \cdot \text{PAEACT} \]

4.3.1.7 Q Register Exponent PAQREG(31-24)

\[ \text{STROBE} = \text{PAMORL37} \cdot \text{PAFALT} \cdot \text{PAEACT} \]

4.3.1.8 Shift Count Register PASHCR(5-0)

\[ \text{STROBE} = \text{PAMORL50} \cdot \text{PAEACT} \cdot \text{PAFALT} \]

4.3.1.9 Tag Register PATAGR(7-0)

\[ \text{STROBE} = \text{PAMORL42} \cdot \text{PAFALT} \]

4.3.1.10 Activity Stack PASTAK(21-0)

\[ \text{STROBE} = \text{PAMORL32} \cdot \text{PAFALT} \]

4.3.1.11 B Register Extension Bit PABEXT

\[ \text{Q Register Extension Bit PAQEXT} \]

These two single bit registers are used during the multiply instructions. They are continuously clocked. Input to PABEXT is always bit 1 of the adder. Input to PAQEXT is the result of Control Point 24. Used during the ML instruction, see 4.3.4.3.
4.3.1.12 Element Activity Flip-Flop PAEACT

\[ \text{STROBE} = (\text{PAMORL40+PAMORL}(28-24)=29) \cdot \overline{\text{PAFALT}} \]

Note: This is a special use of PAMORL(28-24). PAMORL40 is the control bit for the flip-flop only when PAMORL49 = 1. When PAMORL40 = 0, PAMORL(28-24) = 29 can be used in place of the control bit.

4.3.1.13 F.P. Add Zero A Input PAZROA
F.P. Add Zero B Input PAZROB

Set and reset by Local Control, see section 4.7.1. Used during the ADD/SB instruction, see 4.8.4.2.

4.3.1.14 DV Sign Flip-Flop PASIGN

Flip-Flop used during the DV instruction to remember to complement the result following the divide. Set and reset by Local Control, see Section 4.5.

4.3.1.15 Double Precision Carry FF PADPC

\[ \text{STROBE} = \text{PAMORL41} \cdot \text{PAEACT} \cdot \overline{\text{PAFALT}} \]

4.3.1.16 Temporary Overflow FF PATOVF

\[ \text{STROBE} = \text{PAMORL34} \]

4.3.1.17 Overflow FF PAOVFF

\[ \text{STROBE} = \text{PAMORL47} \cdot \text{PAEACT} \cdot \overline{\text{PAFALT}} \]

4.3.1.18 Carry Out of Bit 23 FF PACOFF23

\[ \text{STROBE} = \text{PAMORL35} \]

4.3.1.19 Fault Flip-Flop PAFALT

\[ \text{STROBE} = (\text{PAMORL}(28-24)=30) + (\text{PAMORL}(28-24)=31) \]
4.3.2 Selectors

Selectors or data multiplexers are numbered 1 through 24. Each selector, which may have several control lines, picks one of many possible inputs and presents that input to the register, flip-flop or functional unit attached.

4.3.2.1 B Register Exponent Input CP1

<table>
<thead>
<tr>
<th>PAMORL</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>41 40</td>
<td></td>
</tr>
<tr>
<td>0 0</td>
<td>31-24 = PAALNO(31-24)</td>
</tr>
<tr>
<td>0 1</td>
<td>31-24 = PAXDIN(31-24)</td>
</tr>
<tr>
<td>1 0</td>
<td>31-24 = PAAOUT(31-24)</td>
</tr>
<tr>
<td>1 1</td>
<td>31-24 = PASRCH(31-24)</td>
</tr>
</tbody>
</table>

4.3.2.2 B Register Mantissa Input CP2

<table>
<thead>
<tr>
<th>PAMORL</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>39 38</td>
<td></td>
</tr>
<tr>
<td>0 0</td>
<td>23-0 = PAAOUT(23-0)</td>
</tr>
<tr>
<td>0 1</td>
<td>23-0 = PAALNO(23-0)</td>
</tr>
<tr>
<td>1 0</td>
<td>23-0 = PAAOUT(23, 23, 23-2)</td>
</tr>
<tr>
<td>1 1</td>
<td>23-0 = PAXDIN(23-0)</td>
</tr>
</tbody>
</table>

4.3.2.3 A Register Exponent Input CP3

<table>
<thead>
<tr>
<th>PAMORL</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>18 17 16</td>
<td></td>
</tr>
<tr>
<td>0 0 0</td>
<td>31-24=exponent of -128</td>
</tr>
<tr>
<td>0 0 1</td>
<td>31-24=exponent of -128</td>
</tr>
<tr>
<td>0 1 0</td>
<td>31-24=ZERO</td>
</tr>
<tr>
<td>0 1 1</td>
<td>31-24=ZERO</td>
</tr>
<tr>
<td>1 0 0</td>
<td>31-24=PAAOUT(31-24)</td>
</tr>
<tr>
<td>1 0 1</td>
<td>31-24=PAALNO(31-24)</td>
</tr>
<tr>
<td>1 1 0</td>
<td>31-24=PAAREG(3i, 31-25)</td>
</tr>
<tr>
<td>1 1 1</td>
<td>31-24=PABREG(31-24)</td>
</tr>
</tbody>
</table>
4.3.2.4 A Register Mantissa Input CP4

BIT 22-0 truth table

<table>
<thead>
<tr>
<th></th>
<th></th>
<th>22-0 = PAAOUT(21-0), ZERO</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td></td>
</tr>
<tr>
<td>0</td>
<td>1</td>
<td>22-0 = PAALNO(22-0)</td>
</tr>
<tr>
<td>1</td>
<td>0</td>
<td>22-0 = PAAOUT(22-0)</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>22-0 = PABREG(22-0)</td>
</tr>
</tbody>
</table>

4.3.2.5 Q Register Exponent Input CP5

<table>
<thead>
<tr>
<th>PAMORL23</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>31-24=PAAOUT(31-24)</td>
</tr>
<tr>
<td>1</td>
<td>31-24=PABREG(31-24)</td>
</tr>
</tbody>
</table>

4.3.2.6 Q Register Mantissa Input CP6

<table>
<thead>
<tr>
<th>PAMORL</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>22</td>
<td>21</td>
</tr>
</tbody>
</table>
| 0      | 0        | 23-0=PAQREG(22-0), "bit 0 input"
| 0      | 1        | 23-0=PAAOUT(0), PASEXT, PAQREG(23-2)
| 1      | 0        | 23-0=PAAOUT(23-0) |
| 1      | 1        | 23-0=PABREG(23-0) |

Q register, mantissa bit 0 input

bit 0 = PAMORL(28-24)≠18 • PAAOUT23

4.3.2.7 Input A Exponent Adder CP7

<table>
<thead>
<tr>
<th>PAMORL</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>13</td>
<td>12</td>
</tr>
</tbody>
</table>
| 0      | 0        | 0        | 31-24=PABREG(312-4)
| 0      | 0        | 1        | 31-24=ZERO            |
| 0      | 1        | 0        | 31-24=00010111        |
| 0      | 1        | 1        | 31-24=PASTAK(20-13)   |
| 1      | 0        | 0        | 31-24=PAQREG(31-24)   |
| 1      | 0        | 1        | 31-24=PAAREG(31-24)   |
| 1      | 1        | 0        | not defined           |
| 1      | 1        | 1        | not defined           |
### 4.3.2.8 Input B Exponent Adder CP8

<table>
<thead>
<tr>
<th>PAMORL</th>
</tr>
</thead>
<tbody>
<tr>
<td>10 09</td>
</tr>
<tr>
<td>0 0</td>
</tr>
<tr>
<td>0 1</td>
</tr>
<tr>
<td>1 0</td>
</tr>
<tr>
<td>1 1</td>
</tr>
</tbody>
</table>

**Function**

- 31-24 = PA2SCD(5,5,5-0)
- 31-24 = PAAREG(31-24)
- 31-24 = PABREG(31-24)
- 31-24 = ZERO

### 4.3.2.9 Input A Mantissa Adder CP9

<table>
<thead>
<tr>
<th>PAMORL</th>
</tr>
</thead>
<tbody>
<tr>
<td>06 05 04</td>
</tr>
<tr>
<td>0 0 0 0</td>
</tr>
<tr>
<td>0 0 1 0</td>
</tr>
<tr>
<td>0 1 1 0</td>
</tr>
<tr>
<td>1 0 1 0</td>
</tr>
<tr>
<td>1 1 1 0</td>
</tr>
</tbody>
</table>

**Function**

- 23-0 = PASTAK(12-0), PAEACT, PAOVFF, PAXDFP, PATAGR(7-0)
- 23-0 = PAQREG(23-0)
- 23-0 = PABREG(23-0)
- 23-0 = ZERO(23-16), PABREG(7-0), ZERO(7-0)
- 23-0 = PAQREG(23-0)
- 23-0 = PAAREG(23-0)
- 23-0 = ZERO
- 23-0 = PAAREG(21-0), PAQREG(22-21)

### 4.3.2.10 Input B Mantissa Adder CP10

<table>
<thead>
<tr>
<th>PAMORL</th>
</tr>
</thead>
<tbody>
<tr>
<td>08 07</td>
</tr>
<tr>
<td>0 0</td>
</tr>
<tr>
<td>0 1</td>
</tr>
<tr>
<td>1 0</td>
</tr>
<tr>
<td>1 1</td>
</tr>
</tbody>
</table>

**Function**

- 23-0 = PAAREG(23-0)
- 23-0 = PAAREG23, PAAREG23, PAAREG(22-1)
- 23-0 = PABREG(23-0)
- 23-0 = ZERO

### 4.3.2.11 Alignment Network/Shifter Mantissa CP11

<table>
<thead>
<tr>
<th>PAMORL</th>
</tr>
</thead>
<tbody>
<tr>
<td>31 30</td>
</tr>
<tr>
<td>0 0</td>
</tr>
<tr>
<td>0 1</td>
</tr>
<tr>
<td>1 0</td>
</tr>
<tr>
<td>1 1</td>
</tr>
</tbody>
</table>

**Function**

- 23-0 = PABREG(23-0)
- 23-0 = PAAREG(23-0)
- 23-0 = PAQREG(23-0)
- 23-0 = Undefined

Inputs (23-0) to alignment network also are gated to a leading zeros/ones counter. Output is a binary count PNRMD(5-0). See Section 4.4.4.1.
4.3.2.12 Alignment Network/Shifter Exponent CP12

Alignment network inputs (31-24) input select truth table

<table>
<thead>
<tr>
<th>PAMORL</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>30</td>
<td>31-24 = PABREG(31-24)</td>
</tr>
<tr>
<td>0</td>
<td>31-24 = PAAREG(31-24)</td>
</tr>
</tbody>
</table>

4.3.2.13 Shift Count Register Input CP13

<table>
<thead>
<tr>
<th>PAMORL</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>46</td>
<td>45</td>
</tr>
<tr>
<td>0 0</td>
<td>5-0 = POSITIVE ONE</td>
</tr>
<tr>
<td>0 1</td>
<td>5-0 = PAAOUT(29-24)</td>
</tr>
<tr>
<td>1 0</td>
<td>5-0 = PANRMD(5-0) See Section 4.4.4.1</td>
</tr>
<tr>
<td>1 1</td>
<td>5-0 = PAXDIN(5-0)</td>
</tr>
</tbody>
</table>

4.3.2.14 Alignment Network/Shifter, Shift Distance Selector CP14

<table>
<thead>
<tr>
<th>PAMORL</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>32</td>
<td></td>
</tr>
<tr>
<td>0</td>
<td>Shift amount = PASHCR(5-0)</td>
</tr>
<tr>
<td>1</td>
<td>Shift amount = PA2SCO(5-0) (Two's complement of PASCHR)</td>
</tr>
</tbody>
</table>

4.3.2.15 Element Activity Input CP15

PAMORL(37-33) are used to select the correct input to the EA FF.

Table 4.4 lists the 32 possible input selections and the instruction during which each is used. The function is the equation which is evaluated and used to set or reset the flip-flop. "EA STATE" is used to indicate that the EA FF has to be on in order for the instruction to execute.
<table>
<thead>
<tr>
<th>DECODE</th>
<th>37 36 35 34 33</th>
<th>USED ON</th>
<th>FUNCTION</th>
<th>EA STATE</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0 0 0 0 0</td>
<td>-</td>
<td>NONE</td>
<td>D.C.</td>
</tr>
<tr>
<td>1</td>
<td>0 0 0 0 1</td>
<td>SNV</td>
<td>IF PAOVFF, EA← 0</td>
<td>1</td>
</tr>
<tr>
<td>2</td>
<td>0 0 0 1 0</td>
<td>SEG</td>
<td>IF PAAOUT(31-0) ≠ EA← 0</td>
<td>1</td>
</tr>
<tr>
<td>3</td>
<td>0 0 0 1 1</td>
<td>SNC</td>
<td>IF PAAOUT(31-0) = 0, EA← 0</td>
<td>1</td>
</tr>
<tr>
<td></td>
<td></td>
<td>SNZL</td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>PSEL</td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>SELB</td>
<td></td>
<td></td>
</tr>
<tr>
<td>4</td>
<td>0 0 1 0 0</td>
<td>SGZ</td>
<td>IF PAAOUT23 OR IF PAAOUT(23-0) = zero, EA← 0</td>
<td>1</td>
</tr>
<tr>
<td>5</td>
<td>0 0 1 0 1</td>
<td>SGE</td>
<td>IF PAAOUT23, EA← 0</td>
<td>1</td>
</tr>
<tr>
<td>6</td>
<td>0 0 1 1 0</td>
<td>SNZ</td>
<td>IF PAAOUT(23-0) = zero, EA← 0</td>
<td>1</td>
</tr>
<tr>
<td>7</td>
<td>0 0 1 1 1</td>
<td>SZR</td>
<td>IF PAAOUT(23-0) ≠ zero, EA← 0</td>
<td>1</td>
</tr>
<tr>
<td>8</td>
<td>0 1 0 0 0</td>
<td>SLE</td>
<td>IF PAAOUT23 AND IF PAAOUT(23-0) ≠ zero, EA← 0</td>
<td>1</td>
</tr>
<tr>
<td>9</td>
<td>0 1 0 0 1</td>
<td>SLZ</td>
<td>IF PAAOUT23, EA← 0</td>
<td>1</td>
</tr>
<tr>
<td>10-11</td>
<td>-</td>
<td>-</td>
<td>NONE</td>
<td></td>
</tr>
<tr>
<td>12</td>
<td>0 1 1 0 0</td>
<td>SF</td>
<td>IF PASFFT, EA← 0</td>
<td>1</td>
</tr>
<tr>
<td>13</td>
<td>0 1 1 0 1</td>
<td>SOV</td>
<td>IF PAOVFF, EA← 0</td>
<td>1</td>
</tr>
<tr>
<td>14-15</td>
<td>-</td>
<td>-</td>
<td>NONE</td>
<td></td>
</tr>
<tr>
<td>16</td>
<td>1 0 0 0 0</td>
<td>LTAG</td>
<td>EA← PAXDIN10</td>
<td>D.C.</td>
</tr>
<tr>
<td>17</td>
<td>1 0 0 0 1</td>
<td>ACT</td>
<td>IF PAAOUT(23-0) = zero, EA← 1</td>
<td>D.C.</td>
</tr>
<tr>
<td>18</td>
<td>1 0 0 1 0</td>
<td>CACT</td>
<td>EA← (PAAOUT(23-0) = zero)</td>
<td>D.C.</td>
</tr>
<tr>
<td>19</td>
<td>-</td>
<td>-</td>
<td>NONE</td>
<td></td>
</tr>
<tr>
<td>20</td>
<td>1 0 1 0 0</td>
<td>COPY</td>
<td>EA← PASTAKOO</td>
<td>D.C.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>POP</td>
<td>1</td>
<td></td>
</tr>
<tr>
<td>21</td>
<td>1 0 1 0 1</td>
<td>ORS</td>
<td>IF PASTAKOO, EA← 1</td>
<td>D.C.</td>
</tr>
<tr>
<td>22</td>
<td>1 0 1 1 0</td>
<td>ANS</td>
<td>IF PASTAKOO, EA← 0</td>
<td>D.C.</td>
</tr>
<tr>
<td>23</td>
<td>1 0 1 1 1</td>
<td>CA</td>
<td>IF PASTAKOO AND IF PAAECT, EA← 1. Also IF PAAECT, EA← 0</td>
<td>D.C.</td>
</tr>
<tr>
<td>24-31</td>
<td>-</td>
<td>-</td>
<td>NONE</td>
<td></td>
</tr>
</tbody>
</table>

D.C. = Don't Care

**TABLE 4.4, ELEMENT ACTIVITY CONTROL**
4.3.2.16 Activity Stack Input CP16

<table>
<thead>
<tr>
<th>PAMORL</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>44-43</td>
<td></td>
</tr>
<tr>
<td>0-0</td>
<td>20-0 = PAXDIN(31-11)</td>
</tr>
<tr>
<td>0-1</td>
<td>20-0 = PASTAK(20-1), 0</td>
</tr>
<tr>
<td>1-0</td>
<td>20-0 = PASTAK(19-0), PAEACT</td>
</tr>
<tr>
<td>1-1</td>
<td>PASTAK(20-1) : NO CHANGE</td>
</tr>
<tr>
<td></td>
<td>PASTAK(0) ←0 IF PAEACT</td>
</tr>
<tr>
<td></td>
<td>ELSE NO CHANGE</td>
</tr>
</tbody>
</table>

Bit 0 is the top of stack.

4.3.2.17 Data Input Selector CP17

<table>
<thead>
<tr>
<th>PAMORL 51</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>31-0 = PAMORL(31-0) i.e. global</td>
</tr>
<tr>
<td>1</td>
<td>31-0 = PXEMOT(31-0) i.e. element memory</td>
</tr>
</tbody>
</table>

4.3.2.18 Tag Register Input CP18

<table>
<thead>
<tr>
<th>PAMORL</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>39-38</td>
<td></td>
</tr>
<tr>
<td>0-0</td>
<td>7-0 = PATGLG(7-0) See Note</td>
</tr>
<tr>
<td>0-1</td>
<td>7-0 = PATGLG(7-0)</td>
</tr>
<tr>
<td>1-0</td>
<td>7-0 = PATGLG(7-0)</td>
</tr>
<tr>
<td>1-1</td>
<td>7-0 = PAXDIN(7-0)</td>
</tr>
</tbody>
</table>

Note: \( \text{PATGLGn} = \text{PAXDIN} \left( n \text{ plus } 8 \right) \cdot \text{PAXDINn} + \left( \text{PAXDIN} \left( n \text{ plus } 8 \right) \cdot \text{PATAGRn} + \text{PAXDINn} \cdot \text{PATAGRn} \right) \) for \( n = 0, 7 \)

\( \text{PATGLG} \) is the Tag Register input logic shown in Figure 4.4.
4.3.2.19 A Register Bit 23 Input CP19

PAMORL

<table>
<thead>
<tr>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>15 14</td>
</tr>
<tr>
<td>0 0</td>
</tr>
<tr>
<td>IF PAMORL(28-24)≠5, 23=PAAOUT(22)</td>
</tr>
<tr>
<td>0 1</td>
</tr>
<tr>
<td>IF PAMORL(28-24)≠5, 23=PAALNO(23)</td>
</tr>
<tr>
<td>1 0</td>
</tr>
<tr>
<td>IF PAMORL(28-24)≠5, 23=PAAOUT(23)</td>
</tr>
<tr>
<td>1 1</td>
</tr>
<tr>
<td>IF PAMORL(28-24)≠5, 23=PABREG(23)</td>
</tr>
</tbody>
</table>

Note: IF PAMORL(28-24)=5, 23=PACOFF23

4.3.2.20 DV/SQ Result Selector CP20

See equation for bit 0 in Section 4.3.2.6

4.3.2.21 B Register Bits 23-22 Input CP21

Control is generated locally, see Section 4.5.13

PAMOVB

<table>
<thead>
<tr>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>01 00</td>
</tr>
<tr>
<td>0 0</td>
</tr>
<tr>
<td>Bits (23-22) of CP2</td>
</tr>
<tr>
<td>0 1</td>
</tr>
<tr>
<td>Adder Carry out signal for both bits</td>
</tr>
<tr>
<td>1 0</td>
</tr>
<tr>
<td>PACOFF23, bit 22 of CP2</td>
</tr>
<tr>
<td>1 1</td>
</tr>
<tr>
<td>don't care</td>
</tr>
</tbody>
</table>

4.3.2.22 SQ Result Selector CP22

Control is generated locally, see Section 4.5.14

PASQRT

<table>
<thead>
<tr>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>01 00</td>
</tr>
<tr>
<td>0 0</td>
</tr>
<tr>
<td>PAALNO(2-0)</td>
</tr>
<tr>
<td>0 1</td>
</tr>
<tr>
<td>don't care</td>
</tr>
<tr>
<td>1 0</td>
</tr>
<tr>
<td>&quot;101&quot;</td>
</tr>
<tr>
<td>1 1</td>
</tr>
<tr>
<td>&quot;011&quot;</td>
</tr>
</tbody>
</table>
4.3.2.23 Q Extension Bit Input CP23

Control is generated locally, see Section 4.5.15.

<table>
<thead>
<tr>
<th>PAQSEL</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>&quot;0&quot;</td>
</tr>
<tr>
<td>1</td>
<td>PAQREG(1)</td>
</tr>
</tbody>
</table>

4.3.2.24 Extension Adder B Input CP24

Control is generated locally, see Section 4.5.16.

<table>
<thead>
<tr>
<th>PAAXB</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>PAAREG(0)</td>
</tr>
<tr>
<td>1</td>
<td>&quot;0&quot;</td>
</tr>
</tbody>
</table>

4.3.2.25 Double Precision Carry Input CP25

Control is generated locally, see Section 4.5.17.

<table>
<thead>
<tr>
<th>PADPIN</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>PAXDIN(8)</td>
</tr>
<tr>
<td>1</td>
<td>Carry out of Adder bit 22</td>
</tr>
</tbody>
</table>
4.3.3 Functional Units

4.3.3.1 Arithmetic Logic Unit

The ALU performs 32 bit logical operations and 24 bit and 8 bit arithmetic operations. The lower 24 bits, the mantissa adder PAADMN, performs all eight functions shown in the following table. The upper eight bits, or exponent adder PAADEX, perform all functions except decrement.

<table>
<thead>
<tr>
<th>PAMORL</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>00</td>
<td>exclusive OR</td>
</tr>
<tr>
<td>01</td>
<td>logical AND-NOT</td>
</tr>
<tr>
<td>02</td>
<td>logical OR</td>
</tr>
<tr>
<td>03</td>
<td>logical AND</td>
</tr>
<tr>
<td>01</td>
<td>decrement</td>
</tr>
<tr>
<td>01</td>
<td>*</td>
</tr>
<tr>
<td>00</td>
<td>*</td>
</tr>
<tr>
<td>00</td>
<td>add</td>
</tr>
<tr>
<td>00</td>
<td>*</td>
</tr>
<tr>
<td>00</td>
<td>*</td>
</tr>
<tr>
<td>00</td>
<td>*</td>
</tr>
<tr>
<td>00</td>
<td>*</td>
</tr>
<tr>
<td>00</td>
<td>*</td>
</tr>
<tr>
<td>00</td>
<td>subtract</td>
</tr>
<tr>
<td>00</td>
<td>*</td>
</tr>
<tr>
<td>00</td>
<td>*</td>
</tr>
<tr>
<td>00</td>
<td>increment</td>
</tr>
</tbody>
</table>

Notes: * = undefined

4.3.3.2 Alignment Network/Shifter PAALNO

The alignment network can perform four different shifts of up to 32 places. The shift type is selected by:

<table>
<thead>
<tr>
<th>PAMORL</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>29</td>
<td>Arithmetic Shift</td>
</tr>
<tr>
<td>0</td>
<td>Logical Shift</td>
</tr>
<tr>
<td>1</td>
<td></td>
</tr>
</tbody>
</table>
Shift amount is controlled by the output of control point 14. The most significant bit (the sign) of CP14 output also acts as a control line.

<table>
<thead>
<tr>
<th>PAMORL</th>
<th>PASCRR(5)</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td>(Output of CP14)</td>
<td></td>
</tr>
<tr>
<td>29</td>
<td>0</td>
<td>24 bit arithmetic right shift sign extended</td>
</tr>
<tr>
<td>0</td>
<td>1</td>
<td>24 bit arithmetic left shift zero fill to bit 0</td>
</tr>
<tr>
<td>1</td>
<td>0</td>
<td>32 bit logical right shift end around</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>32 bit logical left shift zero fill to bit 0</td>
</tr>
</tbody>
</table>
4.4 Micro-Level Implementation

4.4.1 Selectors/ Registers

The AU data selectors or multiplexers are constructed out of standard ECL 10K MSI packages. They are:

10164  8 Input Multiplexer
10173  Quad 2 Input Multiplexer
10174  Dual 4 Input Multiplexer

The Registers are also standard MSI packages. They are:

10135  Dual JK Flip-Flop
10141  4 bit Shift Register
10176  Hex D Flip-Flop

4.4.2 Element Activity Flip-Flop

The EA FF is JK with separate selectors for the J and K inputs. Section 4.3.2.15 shows the functions performed by the selectors.

4.4.3 Arithmetic Logic Unit

4.4.3.1 Logical Unit

The ALU directly performs AND, OR, XOR, and ANDNOT operations. The equations are:

\[
\text{AND:} \quad \text{PAOUT} \leftarrow \text{PAADA} \cdot \text{PAADB}
\]

\[
\text{OR:} \quad \text{PAOUT} \leftarrow \text{PAADA} + \text{PAADB}
\]

\[
\text{XOR:} \quad \text{PAOUT} \leftarrow \text{PAADA} \cdot \overline{\text{PAADB}} + \overline{\text{PAADA}} \cdot \text{PAADB}
\]

\[
\text{ANDNOT:} \quad \text{PAOUT} \leftarrow \text{PAADA} \cdot \overline{\text{PAADB}}
\]

where: PAADA - A input, both exponent and mantissa

PAADB - B input, both exponent and mantissa
4.4.3.2 Arithmetic Unit

The ALU performs two basic arithmetic functions: (using the 10181 chip)

1 - PAAOUT ← PAADMA plus PAADMB plus PACIN

2 - PAAOUT ← PAADMA plus PAADMB plus PACIN

where PACIN = Carry in to bit 0

The four required functions are generated as follows:

ADD - Implicit operand to PAADMA

Explicit operand to PAADMB

"0" to PACIN

perform function 1

SUBTRACT - Implicit operand to PAADMA

Explicit operand to PAADMB

"1" to PACIN

perform function 2

This takes the subtrahend and converts it to its two's complement form and performs an addition.

INCREMENT - Implicit operand to PAADMA

Explicit operand to PAADMB

"1" to PACIN

perform function 1

DECREMENT - Implicit operand to PAADMA

Explicit operand to PAADMB

"0" to PACIN

perform function 2
4.4.3.3 Carry Look Ahead

The basic ALU circuit in the AU performs a 24 bit add. The input to each bit position are two 1 bit operands and 1 bit carry-in. The result at each bit position is a 1 bit sum and a 1 bit carry out. It follows the truth table.

<table>
<thead>
<tr>
<th>A</th>
<th>B</th>
<th>Carry-In</th>
<th>Sum</th>
<th>Carry Out</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>0</td>
<td>0</td>
<td>1</td>
<td>1</td>
<td>0</td>
</tr>
<tr>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>0</td>
</tr>
<tr>
<td>0</td>
<td>1</td>
<td>1</td>
<td>0</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
<td>1</td>
</tr>
</tbody>
</table>

Take, as an example two 4 bit numbers:

\[
\begin{align*}
\text{0110} & \quad + \quad \text{1011} \quad \text{C}_{\text{IN}} = 0 \\
\hline
\text{1010} & \quad \text{Carry-Out}
\end{align*}
\]

The ALU takes them one bit position at a time with the \(C_{\text{IN}}\) to that position and generates a sum and \(C_{\text{IN}}\) for the next bit position:

\[
\begin{align*}
&\quad 0 \quad \text{existing} \\
&\quad \downarrow \quad \downarrow \\
&1 \quad 1 \quad \text{carry} \\
&\quad \downarrow \quad \downarrow \\
&1 \quad 1 \quad 0 \\
&\quad \downarrow \quad \downarrow \\
&1 \quad 1 \quad 0 \\
&\quad \downarrow \quad \downarrow \\
&1 \quad 1 \quad 0 \\
&\quad \downarrow \quad \downarrow \\
&1 \quad 1 \quad 0
\end{align*}
\]
This effectively takes four operations which must be done sequentially before the final sum and carry-out are known. This is the common ripple carry adder. When implemented in logic, the delay time required is that to let each bit add and to let the carry move down each bit position until it is available at the end of the carry chain.

As an improvement to this adder, consider that much can be inferred about the carry out of a particular stage if the bit patterns of the input to that stage are known. Consider:

\[
\begin{array}{c}
+ 0 \\
0 \\
\hline
0 \\
\end{array}
\]

Carry-out must be "0"
no matter what carry-in is.

\[
\begin{array}{c}
+ 1 \\
1 \\
\hline
1 \\
\end{array}
\]

Carry-out must be "1"
no matter what carry-in is.

\[
\begin{array}{c}
+ 0 \\
1 \\
\hline
1 \\
\end{array}
\]

Carry-out is the same as carry-in.

So, adding two bits together requires a carry-in. Look at the previous bit position inputs and from above, the carry-in can or cannot be determined. If it is not determined, look at the next previous bit position and continue until carry-in is known. This could be carry-in to the very first bit position - \( C_{IN} \). Since all of the required inputs are available at the start of the add, the logic can proceed immediately not having to wait for any ripple.

Define:

\[
\text{Generate: } G_i = A_i \cdot B_i
\]

\[
\text{Transmit: } T_i = A_i \oplus B_i \text{ (Exclusive OR)}
\]

where \( A_i \) and \( B_i \) are the \( i \)th bits of the two operands.
Generate indicates that carry-out of this pair is a 1 no matter what carry-in is (Carry is "generated").

Transmit indicates that carry-out of this pair is the same as the carry-out of the last pair (Carry is "transmitted").

So we have:

\[
\text{CARRY-IN}_i = G_{i-1} + T_{i-1} (G_{i-2} + T_{i-2} (G_{i-3} + T_{i-3} \ldots \\
\quad \ldots (C_0 + T_0 (C_{IN}) \ldots ))
\]

From this equation we can get any Carry-in by only looking at the 2 operands and the carry-in bit 0. This is called a carry look ahead adder.

The truth table for carry into stage i-from stage i-1 in a four bit ALU is:

<table>
<thead>
<tr>
<th>(T_{i-1})</th>
<th>(G_{i-1})</th>
<th>(T_{i-2})</th>
<th>(G_{i-2})</th>
<th>(T_{i-3})</th>
<th>(G_{i-3})</th>
<th>(C_{INo})</th>
<th>(C_{INi})</th>
</tr>
</thead>
<tbody>
<tr>
<td>(\emptyset)</td>
<td>1</td>
<td>(\emptyset)</td>
<td>(\emptyset)</td>
<td>(\emptyset)</td>
<td>(\emptyset)</td>
<td>(\emptyset)</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>(\emptyset)</td>
<td>(\emptyset)</td>
<td>(\emptyset)</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
<td>1</td>
<td>(\emptyset)</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>1</td>
<td>0</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>0</td>
<td>0</td>
<td>(\emptyset)</td>
<td>(\emptyset)</td>
<td>(\emptyset)</td>
<td>(\emptyset)</td>
<td>(\emptyset)</td>
<td>0</td>
</tr>
</tbody>
</table>

\(\emptyset\) = don't care

Actual implementation of this equation can vary drastically. In the AU a 24-bit look ahead adder takes less than 30 ns and uses only 8 ECL chips (6 10181s and 2 10179s).

For simplicity, transmit is often reduced to a simple OR circuit instead of an XOR. It is then referred to as:

\[
\text{Propagate: } P_i = A_i + B_i
\]

The \(A_i = B_i = 1\) case is a don't care, as generate is a higher priority term.
4.4.3.4  Overflow Detectors

4.4.3.4.1 Mantissa Overflow

To detect the overflow conditions described in Section 3.2.1, this circuit is used:

\[
\begin{array}{c|c}
\text{Add} = 1, \text{Subtract} = 0 & \text{PAMOF (Mantissa Overflow)} \\
\hline
1 1 1 1 & 0 \\
1 1 1 0 & 1 \\
1 1 0 1 & 0 \\
1 1 0 0 & 0 \\
1 0 1 1 & 0 \\
1 0 1 0 & 0 \\
1 0 0 1 & 1 \\
1 0 0 0 & 0 \\
0 1 1 1 & 0 \\
0 1 1 0 & 0 \\
0 1 0 1 & 0 \\
0 1 0 0 & 1 \\
0 0 1 1 & 1 \\
0 0 1 0 & 0 \\
0 0 0 1 & 0 \\
0 0 0 0 & 0 \\
\end{array}
\]
4.4.3.4.2 Exponent Overflow

During FP instructions, the four different types of overflow are detected separately from the exponent adder.

- Add underflow - addition of two negative numbers with a positive result
  \[ \text{PAEXUF} = \text{PAADEA31} \cdot \text{PAADEB31} \cdot \text{PAAOUT31} \]

- Add overflow - addition of two positive numbers with a negative result
  \[ \text{PAEXOF} = \overline{\text{PAADEA31}} \cdot \overline{\text{PAADEB31}} \cdot \text{PAAOUT31} \]

- Subtract underflow - subtraction of a positive number from a negative number with a positive result
  \[ \text{PASBUF} = \text{PAADEA31} \cdot \overline{\text{PAADEB31}} \cdot \overline{\text{PAAOUT31}} \]

- Subtract overflow - subtraction of a negative number from a positive number with a negative result
  \[ \text{PASROF} = \overline{\text{PAADEA31}} \cdot \text{PAADEB31} \cdot \overline{\text{PAAOUT31}} \]

4.4.3.4.3 Overflow Flip-Flop

The overflow FF is set as a result of the above overflow conditions. See Section 4.5.8.

4.4.3.5 Adder Output Detector

4.4.3.5.1 Equal to Zero

All zeros are detected with a tree of negative input AND gates (10109):

\[
\begin{align*}
\text{Output of Adder} \\
(24 \text{ bits})
\end{align*}
\]
4.4.3.5.2 Equal to Minus One

Minus One in two's complement notation is a word of all "1". It is detected by first inverting the adder output and then passing that through a circuit similar to the zeros detector.

4.4.3.5.3 Less Than Zero

Less than zero is detected by examining the sign bit. Negative two's complement numbers always have a "1" sign bit.

4.4.3.5.4 Greater Than Zero

Greater than zero is detected by ANDing "not equal to zero" and "not less than zero".

4.4.4 Alignment Network/Shifter

4.4.4.1 Normalize Decode Network (PANRMD)

A circuit designed to produce a count of the number of leading ones or zeros for the normalize function is attached to the output of CP11. The circuit compares the sign bit with each bit in the fraction starting at bit 22. The first mis-match found causes the count circuit to produce a five bit shift count.

The comparison is done with 10107 exclusive OR gates. The count is generated by selecting the first mis-match with 10165 Priority Encoders and converting that output to a count.

Figure 4.5 shows the circuit.
4.4.4.2 Two's Complement of Shift Count (PA2SCO)

The alignment network/shifter performs left shifts when presented with a negative shift count. The PANRMD produces a positive count, but for normalization, a left shift is required. This circuit, which is merely a 10181 ALU, subtracts the Shift Count Register from zero to produce its negative value.

4.4.4.3 The Shifter

The shifter is designed as a multi-level network capable of any shift in one clock cycle.

4.4.4.3.1 Stage One

Stage one generates a 63 bit shift operand. Input is the result of CP11 and CP12, PAALNI(31-0). Output is PAALNI and PAALNA(31-0). Control for this stage is:

<table>
<thead>
<tr>
<th>PAMORL</th>
<th>PASCR(5)</th>
<th>FUNCTION</th>
</tr>
</thead>
<tbody>
<tr>
<td>29</td>
<td>(Output of CP14)</td>
<td></td>
</tr>
</tbody>
</table>

<p>| | | |</p>
<table>
<thead>
<tr>
<th></th>
<th></th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>PAALNI23 duplicated</td>
</tr>
<tr>
<td>0</td>
<td>1</td>
<td>31 times (for sign extension</td>
</tr>
<tr>
<td>1</td>
<td>0</td>
<td>of arithmetic right shift)</td>
</tr>
<tr>
<td>1</td>
<td>1</td>
<td>PAALNI(31-0)</td>
</tr>
</tbody>
</table>

4.4.4.3.2 Stage Two

Stage two receives the 63 bit shift operand and performs right shifts and left shifts with zero fill. Figure 4.6 shows stages one and two. Control for stage two are bits PASCR(5-3) which are output from CP14.
Figure 4.5 Stages One and Two of Alignment Network
### EFFECTIVE SHIFT (DECIMAL)

<table>
<thead>
<tr>
<th>PASCR</th>
<th>POSITIVE COUNT</th>
<th>NEGATIVE COUNT</th>
</tr>
</thead>
<tbody>
<tr>
<td>5 4 3</td>
<td></td>
<td></td>
</tr>
<tr>
<td>1 0 0</td>
<td></td>
<td>32 left ZF</td>
</tr>
<tr>
<td>1 0 1</td>
<td></td>
<td>24 left ZF</td>
</tr>
<tr>
<td>1 1 0</td>
<td></td>
<td>16 left ZF</td>
</tr>
<tr>
<td>1 1 1</td>
<td></td>
<td>8 left ZF</td>
</tr>
<tr>
<td>0 0 0</td>
<td>0 right</td>
<td></td>
</tr>
<tr>
<td>0 0 1</td>
<td>8 right</td>
<td></td>
</tr>
<tr>
<td>0 1 0</td>
<td>16 right</td>
<td></td>
</tr>
<tr>
<td>0 1 1</td>
<td>24 right</td>
<td></td>
</tr>
</tbody>
</table>

ZF = Zero Fill

#### 4.4.4.3.3 Stage Three

Stage three receives the 39 bit operand from stage two. Figure 4.7 shows stages three and four. Control for this stage are bits PASCR(2-1) which are output from CP14.

### EFFECTIVE SHIFT

<table>
<thead>
<tr>
<th>PASCR</th>
<th>POSITIVE COUNT</th>
<th>NEGATIVE COUNT</th>
</tr>
</thead>
<tbody>
<tr>
<td>2 1</td>
<td></td>
<td></td>
</tr>
<tr>
<td>0 0</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>0 1</td>
<td>2 right</td>
<td>2 right</td>
</tr>
<tr>
<td>1 0</td>
<td>4 right</td>
<td>4 right</td>
</tr>
<tr>
<td>1 1</td>
<td>6 right</td>
<td>6 right</td>
</tr>
</tbody>
</table>

#### 4.4.4.3.3 Stage Four

Stage four receives the 33 bit operand from stage three. Figure 4.7 shows this stage. Control is bit PASCR(0) from CP14.

### EFFECTIVE SHIFT

<table>
<thead>
<tr>
<th>PASCR</th>
<th>POSITIVE COUNT</th>
<th>NEGATIVE COUNT</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td></td>
<td></td>
</tr>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>1 right</td>
<td>1 right</td>
</tr>
</tbody>
</table>

Output from stage four is PAALNO(31-0) which is the final shifted output.
Figure 4.7 Stages Three and Four of Alignment Network
4.4.4.4 Shifter Operation

The shift is the logical sum of the shifts performed by stages two, three and four. For right shifts, stage two shifts right an amount equal to the multiple of eight nearest but not less than the shift amount. Stages three and four then complete the shift.

Examples:

<table>
<thead>
<tr>
<th>Shift Amount</th>
<th>Stage 2</th>
<th>Stage 3</th>
<th>Stage 4</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>0</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>+8</td>
<td>+3</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>+12</td>
<td>+3</td>
<td>+4</td>
<td>0</td>
</tr>
<tr>
<td>+29</td>
<td>+24</td>
<td>+4</td>
<td>+1</td>
</tr>
<tr>
<td>-4</td>
<td>-8</td>
<td>+4</td>
<td>0</td>
</tr>
<tr>
<td>-16</td>
<td>-16</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td>-25</td>
<td>-32</td>
<td>+6</td>
<td>+1</td>
</tr>
</tbody>
</table>

Where "+" is a right shift and "-" is a left shift.
4.5 Local Control

Global control signals PAMORL(28-24) enable special logic in the AU. This is required because the data in each PE can cause slightly different execution of some instructions. Table 4.5 lists the 32 possible decodings of these lines, the instructions affected by each and the function carried out.

Following is a description of the local control modification produced by the PAMORL(28-24) lines.

4.5.1 ALU Control

\[
\begin{align*}
&\text{IF PAMORL(28-24)=06} & \text{PAQREG01, select subtract} \\
&\text{IF PAMORL(28-24)=06} & \text{PAQREG01, select add} \\
&\text{IF PAMORL(28-24)=18} & \text{PAAREG23, select add} \\
&\text{IF PAMORL(28-24)=18} & \text{PAAREG23, select subtract} \\
\end{align*}
\]

\[
\begin{align*}
C_{\text{in}} &= \text{PAMORLO3 \cdot PAMORL(28-24) \neq (06+18+26) +} \\
&\text{PAMORL(28-24)=06} & \text{PAOXT} + \\
&\text{PAMORL(28-24)=18} & \text{PAAREG23} + \\
&\text{PAMORL(28-24)=26} & \text{PADPC} \\
\text{IF PAMORL(28-24)=6 or 18, PAMORL(3-0) must be 0.} \\
\text{IF PAMORL(28-24)=26} & \text{PAMORL(03) must be 0} \\
\end{align*}
\]

4.5.2 Input A Mantissa Adder CF9

\[
\begin{align*}
&\text{PAMORLO4 = PAMORLO4 + PAMORL(28-24)=02 \cdot PAZROA} \\
&\text{PAMORLO5 = PAMORLO5 + PAMORL(28-24)=02 \cdot PAZROA} \\
&\text{PAMORLO6 = PAMORLO6 + PAMORL(28-24)=02} \\
\text{IF PAMORL(28-24)=2, PAMORL(6-4) must be 0.} \\
\end{align*}
\]

4.5.3 Input B Mantissa Adder CP10

\[
\begin{align*}
&\text{PAMORLO7 = PAMORLO7 + PAMORL(28-24)=02 \cdot PAZROB +} \\
&\text{PAMORL(28-24) = 06 (PAQREG00=PAQREG01 +} \\
&\text{PAQREG00 \neq PAQEXT)} \\
&\text{PAMORLO8 = PAMORLO8 + PAMORL(28-24)=02} + \\
&\text{PAMORL(28-24)=06 (PAQREG01 \cdot PAQREG00 \cdot} \\
&\text{PAQEXT+PAQREG01 \cdot PAQREG00 \cdot PAQEXT)} \\
\text{IF PAMORL(28-24)=2 or 6, PAMORL(8-7) must be 0.} \\
\end{align*}
\]
<table>
<thead>
<tr>
<th>DECODE</th>
<th>PAMORL</th>
<th>USED ON</th>
<th>FUNCTION</th>
</tr>
</thead>
<tbody>
<tr>
<td>00</td>
<td>0 0 0 0 0 0</td>
<td>- -</td>
<td>Rest State</td>
</tr>
<tr>
<td>01</td>
<td>0 0 0 0 0 1</td>
<td>ADD 7, SB 7, ML 18, ADE 2</td>
<td>A Register mantissa and exponent select as a function of PAEXUF. Also IF PAEXOF, PAOVFF ← 1</td>
</tr>
<tr>
<td>02</td>
<td>0 0 0 1 0</td>
<td>ADD 4, SB 4</td>
<td>Adder mantissa A input select as a function of PAZROA Adder mantissa B input select as a function of PAZROB</td>
</tr>
<tr>
<td>03</td>
<td>0 0 0 0 1 1</td>
<td>FIX 1</td>
<td>Clock enable, A mantissa as a function of PAAREG31. Also IF PAAOUT31 ⋅ PAAREG31, PAOVFF ← 1</td>
</tr>
<tr>
<td>04</td>
<td>0 0 1 0 0</td>
<td>ADI 2, SBI 2, LINA 2, LDEA 2</td>
<td>IF PAMOF, PAOVFF ← 1</td>
</tr>
<tr>
<td>05</td>
<td>0 0 1 0 1</td>
<td>ADD 5, SB 5, ML 16, DV 7</td>
<td>A register bit 23 select as a function of PACOFF23. Also IF PATOVF ⋅ PAEXGF, PAOVFF ← 1</td>
</tr>
<tr>
<td>06</td>
<td>0 0 1 1 0</td>
<td>ML 3-14, MLI 3-14</td>
<td>Add/subtract control and adder mantissa control. Also IF DECODE=1,2,5 or 6, PAADXB ← PAAREG(CO) IF DECODE=0,3,4 or 7, PAADXB ← zero. Also IF PAMOF, PABREG(23,22) ← PACOUT(23,23) IF PAMOF, PABREG(23,22) ← PAAOUT(23,23). Also enable PAQEXT ← PAQREG(01) ML 14, MLI 14</td>
</tr>
<tr>
<td>07</td>
<td>0 0 1 1 1</td>
<td>MLI 15</td>
<td>Clock enable, A mantissa as a function of PAOVFF</td>
</tr>
<tr>
<td>08</td>
<td>0 1 0 0 0</td>
<td>DV 32</td>
<td>A register, mantissa input select as a function of PAAOUT23. Also Q register bit 0 input as a function of PAAOUT23.</td>
</tr>
</tbody>
</table>

**TABLE 4.5, PAMORL(28-24) DECODE TABLE**

99
<table>
<thead>
<tr>
<th>DECODE</th>
<th>PAMORL</th>
<th>USED ON</th>
<th>FUNCTION</th>
</tr>
</thead>
<tbody>
<tr>
<td>09</td>
<td>01001</td>
<td>DV 33</td>
<td>Clock enable, A mantissa and clock enable, Q mantissa as a function of PASIGN</td>
</tr>
<tr>
<td>10</td>
<td>01010</td>
<td>DV 2</td>
<td>IF PAAOUT(23-0) = zero, PAOVFF ← 1</td>
</tr>
<tr>
<td>11</td>
<td>01011</td>
<td>DV 3</td>
<td>Enable PASIGN ← PAAOUT23</td>
</tr>
<tr>
<td>12</td>
<td>01100</td>
<td>DV 4</td>
<td>Clock enable, A mantissa as a function of PABREG23</td>
</tr>
<tr>
<td>13</td>
<td>01101</td>
<td>DV 5</td>
<td>Clock enable, B mantissa and exponent. Also IF PATOVF · PAEXOF, PAOVFF ← 1</td>
</tr>
<tr>
<td>14</td>
<td>01110</td>
<td>DV 8</td>
<td>Clock enable, A mantissa and exponent as a function of PAAOUT23. Also IF PAAOUT23 · PASBOF, PAOVFF ← 1</td>
</tr>
<tr>
<td>15</td>
<td>01111</td>
<td>DV 9-31</td>
<td>A register input select as a function of PAAOUT23. Also Q register, mantissa bit 0 input as a function of PAAOUT23</td>
</tr>
<tr>
<td>16</td>
<td>10000</td>
<td>SQ 1</td>
<td>IF PAREG23, PAOVFF ← 1</td>
</tr>
<tr>
<td>17</td>
<td>10001</td>
<td>SQ 2</td>
<td>Clock enable, A mantissa and exponent as a function of PAREG24. Also IF PAREG24 · PAEXOF, PAOVFF ← 1</td>
</tr>
<tr>
<td>18</td>
<td>10010</td>
<td>SQ 5-28</td>
<td>Adder control as follows: IF PAREG23 select SUBTRACT IF PAREG23 select ADD. Also B register input select as a function of PAREG23</td>
</tr>
<tr>
<td>19</td>
<td>10011</td>
<td>ADD 8</td>
<td>Clock enable, A exponent as a function of PAAOUT(23-0) ≠ zero</td>
</tr>
<tr>
<td></td>
<td></td>
<td>SB 8</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>ML 19</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>DV 38</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>SQ 32</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>FLOT 3</td>
<td></td>
</tr>
<tr>
<td>20</td>
<td>10100</td>
<td>DV 36</td>
<td>Clock enable, A mantissa and A mantissa input select as a function of PASBUT. Also IF PASBOF, PAOVFF ← 1</td>
</tr>
<tr>
<td></td>
<td></td>
<td>SBE 2</td>
<td></td>
</tr>
</tbody>
</table>

**TABLE 4.5, CONTINUED**
<table>
<thead>
<tr>
<th>pamlor decode</th>
<th>decode</th>
<th>used on instr.</th>
<th>used on step</th>
<th>function</th>
</tr>
</thead>
<tbody>
<tr>
<td>21</td>
<td>10101</td>
<td>ml</td>
<td>15</td>
<td>clock enable and a mantissa input select as a function of paexuf. also if paexof, paovff ← 1</td>
</tr>
<tr>
<td>22</td>
<td>10110</td>
<td>add</td>
<td>3</td>
<td>alignment network control, alignment network input select, clock enable a exponent and mantissa and clock enable b mantissa</td>
</tr>
<tr>
<td>23</td>
<td>10111</td>
<td>stag</td>
<td>1</td>
<td>enable paxdot(31-0) ← paargout(31-0) also padowsl ← 1</td>
</tr>
<tr>
<td>24</td>
<td>11000</td>
<td>sl</td>
<td>1</td>
<td>b register, exponent input select as a function of paareg23. also paalf ← paargout23</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td>pabeta ← paargout31</td>
</tr>
<tr>
<td>25</td>
<td>11001</td>
<td>ladi</td>
<td>2</td>
<td>if paoutcome22, pads ← 1 otherwise pads ← 0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>lsbi</td>
<td>2</td>
<td></td>
</tr>
<tr>
<td>26</td>
<td>11010</td>
<td>uadi</td>
<td>2</td>
<td>adder c in ← pads. also if pamo, paovff ← 1</td>
</tr>
<tr>
<td></td>
<td></td>
<td>usbi</td>
<td>2</td>
<td></td>
</tr>
<tr>
<td>27</td>
<td>11011</td>
<td>sh</td>
<td>1</td>
<td>b register, exponent input select as a function of paareg23. also paalf ← paargout23</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td>pabeta ← paargout31</td>
</tr>
<tr>
<td>28</td>
<td>11100</td>
<td>add</td>
<td>2</td>
<td>if paargout(31-24) &gt; 23 or if pasbof, paizbof ← 1. also if paargout(31-24) &gt; 23 and if pasbof, paizbof ← 0. also if paargout(31-24) &lt; -23 or if pasbuf, paizroa ← 1. also if paargout(31-24) &lt; -23 and if pasbuf, paizroa ← 0. also paovff ← 0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>sb</td>
<td>2</td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td>rov</td>
<td>1</td>
<td></td>
</tr>
<tr>
<td>29</td>
<td>11101</td>
<td>sh</td>
<td>3-35</td>
<td>if (pabita • pabitb + pabita • pabitc • pabity + pabita • pabitb • pabity) then pafact ← 0. also paalf ← pabreg31. also pabeta ← pabreg30.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>sl</td>
<td>(odd)</td>
<td></td>
</tr>
<tr>
<td>30</td>
<td>11110</td>
<td>sff</td>
<td>1</td>
<td>pafalt ← 1</td>
</tr>
<tr>
<td>31</td>
<td>11111</td>
<td>cf</td>
<td>1</td>
<td>pafalt ← 0</td>
</tr>
</tbody>
</table>

Table 4.5, continued
4.5.4 A Register Mantissa Input CP4

\[
PAMORL14 = \text{PAMORL}14 + \text{PAMORL}(28-24)=08 \cdot \text{PAAOUT}23^+ \\
PAMORL(28-24)=15 \cdot \text{PAAOUT}23^+ \\
PAMORL(28-24)=01 \cdot \text{PAEXUF} \\
PAMORL15 = \text{PAMORL}15 + \text{PAMORL}(28-24)=08 \cdot \text{PAAOUT}23^+ \\
PAMORL(28-24)=01 \cdot \text{PAEXUF}
\]

IF PAMORL(28-24)=1, 8, or 15, PAMORL(15-14) must be 0.

4.5.5 A Register Exponent Input CP3

\[
PAMORL18 = \text{PAMORL}18 + \text{PAMORL}(28-24)=01 \cdot \text{PAEXUF}^+ \\
PAMORL(28-24)=20 \cdot \text{PASBUF}^+ \\
PAMORL(28-24)=21 \cdot \text{FAEXUF}
\]

IF PAMORL(28-24)=1, 20, or 21, PAMORL(18-16) must be 0.

4.5.6 Strobe A Register Mantissa

Enable strobe if \( X \) where

\[
X = \text{PAMORL}(28-24)=03 \cdot \text{PAAREG}31^+ \\
\text{PAMORL}(28-24)=05 \cdot \text{PAOVF}^+ \\
\text{PAMORL}(28-24)=07 \cdot \text{PAOVFF}^+ \\
\text{PAMORL}(28-24)=09 \cdot \text{PASIGN}^+ \\
\text{PAMORL}(28-24)=12 \cdot \text{PAAREG23}^+ \\
\text{PAMORL}(28-24)=14 \cdot \text{PAAOUT}23^+ \\
\text{PAMORL}(28-24)=17 \cdot \text{PAAREG24}^+ \\
\text{PAMORL}(28-24)=20 \cdot \text{PASBUF}^+ \\
\text{PAMORL}(28-24)=21 \cdot \text{PAEXUF}^+ \\
\text{PAMORL}33 \cdot (\text{PAAREG23}+\text{PAAREG23})^+ \\
\text{PAMORL}(28-24)=22 \cdot (\text{PASHCRO5} \cdot \text{PAZROA} \cdot \text{PAZROB})
\]

IF PAMORL(28-24) equals any of the above values or if PAMORL33, PAMORL19 must be \( \geq 1 \)

4.5.6 Strobe A Register Exponent

Enable strobe of \( X \) where

\[
X = \text{PAMORL}(28-24)=05 \cdot \text{PAOVF}^+ \\
\text{PAMORL}(28-24)=14 \cdot \text{PAAOUT}23^+ \\
\text{PAMORL}(28-24)=17 \cdot \text{PAAREG24}^+ \\
\text{PAMORL}(28-24)=19 \cdot \text{FAAOUT}(23-0)\#\text{ZERO}^+ \\
\text{PAMORL}(28-24)=22 \cdot \text{PASBOF} + (\text{PASBUF} \cdot \text{PAAOUT}31)
\]

IF PAMORL(28-24) equals any of the above values, PAMORL20 must be \( \geq 1 \).
4.5.7 Alignment Network/Shifter CP11 and CP12

\[
PAMORL30 = PAMORL30 + PAMORL(28-24) = 22 \cdot PASCHRO5
\]

IF PAMORL(28-24) = 22, PAMORL(31-30) must be 0.

4.5.8 Alignment Network/Shifter, Shift Distance Selector CP14

\[
PAMORL32 = PAMORL32+PAMORL(28-24)=22 \cdot PASHCRO5
\]

IF PAMORL(28-24)=22, PAMORL32 must be 0.

4.5.9 Strobe Overflow FF

\[
\begin{align*}
PAOVFF &\quad 0 \text{ IF STROBE} \cdot \text{PAMORL}(28-24) = 28 \\
PAOVFF &\quad 1 \text{ IF STROBE} \cdot \text{PAMORL}(28-24) =
\end{align*}
\]

01 • PAEXOF +
03 • PAAOUT31 +
04 • PAMOF +
05 • PATOVF • PAEXOF +
06 • PAMLOF +
10 • PAAOUT(23-0) = zero +
13 • PATOVF • PAEXOF +
14 • PASBOF • PAAOUT23 +
16 • PAAREG23 +
17 • PAEXOF • PAAREG24 +
20 • PASBOF +
21 • PAEXOF +
26 • PAMOF

IF PAMORL(28-26) equals any of the above values, PAMORL47 must be set to effect the change in PAOVFF

4.5.10 Strobe Q Mantissa

Enable Strobe if X

where \(X = (\text{PAMORL}(38-24) = 09) \cdot \overline{PASIGN}\)

IF PAMORL(28-24) = 9, PAMORL36 MUST BE 1.

4.5.11 Strobe B Exponent

Enable strobe if X

where \(X = (\text{PAMORL}(28-24) \neq 13) + \text{PATOVF}\)
4.5.12 Strobe B Mantissa

Enable strobe if X

where \( X = PAMORL(28-24) = 12 \times \overline{PA2BREG23} + \\
\quad PAMORL(28-24) = 13 \times \overline{PATOVF} + \\
\quad PAMORL(28-24) = 22 \times (PASHCRO5 + PAXROA + PAZROB) \)

IF PAMORL(28-24) = 12, 13 or 22, PAMORL42 and 43 must be 1

4.5.13 B Register Bits 23-22 Input CP21

Control lines locally generated.

PAMOVBO = (PAMORL(28-24) = 6) \times \overline{PAMOF} \\
PAMOVBO1 = (PAMORL(28-24) = 13)

4.5.14 SQ Result Selector CP22

Control lines locally generated.

PASQRT2 = (PAMORL(28-24) = 18) \\
PASQRT20 = (PAMORL(28-24) = 18) \times \overline{PAAREG(23)}

4.5.15 Q Extension Bit Input CP23

Control is locally generated.

PAQSEL = (PAMORL(28-24) = 6)

4.5.16 Extension Adder B Input CP24

Control is locally generated.

PAAXB = PAQXT (Q Extension FF)

4.5.17 DPC Input CP25

Control is locally generated.

PADPIN = (PAMORL(28-24) = 25)

4.5.18 Strobe Tag Register

Enable strobe if X

where \( X = PAMORL(39-38) = 1 \times \overline{PAEACT} + \\
\quad PAMORL(39-38) = 2 \times \overline{PAEACT} + \)
### 4.5.19 Control Line Select Signals

#### 4.5.19.1 Control signal, data time shared with control

<table>
<thead>
<tr>
<th>PAMORL 48</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>Control lines PAMORL(31-0) contain data</td>
</tr>
<tr>
<td>0</td>
<td>Control lines PAMORL(31-0) contain control</td>
</tr>
</tbody>
</table>

#### 4.5.19.2 Control signal, select control group

<table>
<thead>
<tr>
<th>PAMORL 49</th>
<th>Function</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>Control lines PAMORL(44-32) contain group 2 control</td>
</tr>
<tr>
<td>1</td>
<td>Control lines PAMORL(44-32) contain group 1 control</td>
</tr>
</tbody>
</table>
4.6 Double Integer Hardware

4.6.1 Double Precision Carry FF

Used to hold the carry or borrow between double integer instructions. Input is from a circuit which detects the carry from bit position 22 of the ALU. See Section 4.3.2.25.

4.6.2 Carry-out Bit 22

Since the ALU is constructed of 4 bit wide 10181 ALU chips, the carry between bits 22 and 23 is completely within the chip. The following circuit was designed to reconstruct that line outside of the chip.

\[
\begin{array}{c|c}
\text{ADD } = 1, \text{ SUBTRACT } = 0 & \text{PACOUT 22} \\
\hline
1 & 1 \\
1 & 1 \\
1 & 0 \\
1 & 0 \\
1 & 0 \\
0 & 1 \\
0 & 1 \\
0 & 0 \\
0 & 0 \\
0 & 0 \\
\end{array}
\]

ADD=1

PAADMA23

PAADMB23

PAAOUT23

PACOUT 22
4.7 Floating Point Hardware

4.7.1 PAZROA and PAZROB

These FF are used during the FP ADD and SB instruction. As explained in Section 3.4.2, when an addition of two numbers where one is more than \(2^{23}\) greater or less than the other causes a shift of more than 23 places, special handling is required.

In micro-step two, after the exponents are subtracted, the result is compared to 23. If the result is greater than 23, PAZROB is set. If the result is less than -23, PAZROA is set. Then in micro-step four, the appropriate input to the adder is zeroed out.

4.7.1.1 Greater Than 23 Detector

The circuit for the > 23 detector is shown in Figure 4.8.

4.7.1.2 Less Than -23 Detector

The circuit for the < -23 detector is shown in Figure 4.9.

4.7.2 Extension Hardware

The multiply algorithm described in Section 3.4.3 requires a 25 bit adder. The additional bit is implemented by extending the B and Q Register one bit each and extending the adder one position. The extension is on the least significant bit end.

4.7.3 Sign FF PASIGN

This FF is used during the DV instruction to save the sign of the result. As described in Section 3.4.4, the DV algorithm only divides two positive numbers. If the hardware must take the two's complement of one of the operands, it sets the sign FF. Then following the divide, if the sign FF is set, the two's complement of the quotient is taken.
<table>
<thead>
<tr>
<th>PAAOUT</th>
<th>DECIMAL</th>
<th>FUNCTION</th>
</tr>
</thead>
<tbody>
<tr>
<td>1 0 0' 0 0 0 0 0</td>
<td>-128</td>
<td>0</td>
</tr>
<tr>
<td>1 1 1 1 1 1 1 1</td>
<td>-1</td>
<td>0</td>
</tr>
<tr>
<td>0 0 0 0 0 0 0 0 0</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>0</td>
</tr>
<tr>
<td>0 0 0 1 0 1 1 0 0</td>
<td>22</td>
<td>0</td>
</tr>
<tr>
<td>0 0 0 1 0 1 1 1</td>
<td>23</td>
<td>0</td>
</tr>
<tr>
<td>0 0 0 1 1 0 0 0 0</td>
<td>24</td>
<td>1</td>
</tr>
<tr>
<td>0 0 0 1 1 0 0 1</td>
<td>25</td>
<td>1</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1</td>
</tr>
<tr>
<td>0 0 1 0 0 0 0 0 0</td>
<td>32</td>
<td>1</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1</td>
</tr>
<tr>
<td>0 1 0 0 0 0 0 0 0</td>
<td>64</td>
<td>1</td>
</tr>
<tr>
<td></td>
<td></td>
<td>1</td>
</tr>
<tr>
<td>0 1 1 1 1 1 1 1</td>
<td>127</td>
<td>1</td>
</tr>
</tbody>
</table>

Function = 31 \cdot (30 + 29 + (28 \cdot 27))

Tests \geq 24
Tests \geq 32
Tests \geq 0

Figure 4.8 Adder Output Greater Than 23 Detector
<table>
<thead>
<tr>
<th>PLACOUT</th>
<th>DECIMAL</th>
<th>FUNCTION</th>
</tr>
</thead>
<tbody>
<tr>
<td>0 1 1 1 1 1 1 1 1</td>
<td>127</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>0 0 0 0 0 0 0 0 0</td>
<td>0</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>1 1 1 1 1 1 1 1 1</td>
<td>-1</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>1 1 1 0 1 1 0 0 0</td>
<td>-20</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>1 1 1 0 1 0 1 1 1</td>
<td>-21</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>1 1 1 0 1 0 1 0 0</td>
<td>-22</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>1 1 1 0 1 0 0 1 1</td>
<td>-23</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>1 1 1 0 1 0 0 0 0</td>
<td>-24</td>
<td>1</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>1 1 1 0 0 1 1 1 1</td>
<td>-25</td>
<td>1</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>1 1 1 0 0 1 1 0 0</td>
<td>-26</td>
<td>1</td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>1 0 0 0 0 0 0 0 0</td>
<td>-128</td>
<td>1</td>
</tr>
</tbody>
</table>

Function = \[31 \cdot (\overline{30} + \overline{29} + (\overline{28} \cdot \overline{27}) + (\overline{26} \cdot 27 \cdot 26 \cdot 25 \cdot \overline{24}))\]

Figure 4.9 Adder Output Less Than -23 Detector
Input to the FF is bit 23 of the ALU output (the sign bit). The FF is clocked when local control senses PAMORL(28-24) = 11.

4.7.4 Temporary Overflow Hold FF PATOVF

This FF is used to temporarily store an overflow indication during ML, ADD, SB and DV instructions. When an overflow is detected during an FP instruction, the AU attempts to correct it by right shifting the fraction and incrementing the exponent. Only if the exponent then overflows, is the overflow made permanent and PAOVFF is set.

Input to the FF is the mantissa overflow detector PAMOF (Section 4.4.3.4.1).

4.7.5 Carry Out of Bit 23 FF PACOFF23

This FF holds the carry-out of the ALU during ML, ADD, SB and DV instructions. As described in the previous section, when the fraction overflows, a correction is attempted. This FF becomes the new bit 23 of the right shifted fraction.

Input is the carry out of the last ALU chip only if there is a mantissa overflow (PAMOF).
4.8 Instruction Implementation

Each of the following sections is centered around the micro-step sequence of some of the more interesting AU instructions of each of the six types.

Each sequence contains the following information:

- Opcode - 8 bits, in octal
- Instruction Name
- Mnemonic
- Description of the operation
- Number of micro-steps in execution
- Micro-step Sequence

In the sequence, this symbology is used:

$\leftarrow$ indicates that the register or data lines on the left receive the register or data lines on the right.

"Select" indicates the operation being performed by the ALU.

"iff" If and only if.

; Qualifier. The operation on the left is not performed unless the equation on the right is true.

, Concatenate. The bits on the left are combined with the bits on the right to produce a single value.

(XX-YY) Bits XX through YY of the named register or data path.

4.8.1 Activity Instructions

Instructions chosen as examples are SNZL, SNG, ANS, CAS, and PSEL.
4.8.1.1 SNZL

The A Register is compared to zero. If it is, the EA FF is reset. Figure 4.10.

4.8.1.2 SNG

An operand is formed in the B Register during micro-step one. During step 2, the A and B are exclusive "OR"ed by the ALU. The result is checked for zero and if it is, the EA FF is reset. Figure 4.11.

4.8.1.3 ANS

The top of the stack (PASTAK(0)) is "AND"ed with the EA FF and the result placed in the EA FF. The stack is then popped. Figure 4.12.

4.8.1.4 CAS

The top of the stack is set to "0". Figure 4.13.

4.8.1.5 PSEL

The EA is pushed into the activity stack. A 32 bit mask of zeros with a single "1" is generated in the PICU and sent to the AU B Register. A word is fetched from Element Memory and "AND"ed with the mask by the ALU. The output of the ALU is checked for all zeros, if so, the EA FF is reset. Figure 4.14.

4.8.2 Integer Instructions

Instructions chosen as examples are ADI, LSBI and USBI.

4.8.2.1 ADI

During the first micro-step the operand is formed in the B Register. In the second step, the A and B Registers are added together with the result placed in the A Register. Overflow is checked and the Overflow FF is set if necessary. Figure 4.15 shows the micro-step sequence.
<table>
<thead>
<tr>
<th>OPCODE</th>
<th>INSTRUCTION</th>
<th>MNEMONIC</th>
</tr>
</thead>
<tbody>
<tr>
<td>166</td>
<td>Select on Non-Zero Logical</td>
<td>SNZL</td>
</tr>
</tbody>
</table>

**DESCRIPTION**

Set EA to zero in those active AUs in which A-register (31-0) equal zero

Execution: 1 step

**MICRO_STEP_SEQUENCE**

1) \( \text{PAADEA}(31-24) \leftarrow \text{PAAREG}(31-24) \)

\( \text{PAADEB}(31-24) \leftarrow 0 \)

\( \text{PAADMA}(23-0) \leftarrow \text{PAAREG}(23-0) \)

\( \text{PAADMb}(23-0) \leftarrow 0 \)

Select OR

\( \text{PAEACT} \leftarrow 0; \text{iff PAAOUT}(31-0)=0 \)

**FIGURE 4.10, SNZL**
<table>
<thead>
<tr>
<th>OPCODE</th>
<th>INSTRUCTION</th>
<th>MNEMONIC</th>
</tr>
</thead>
<tbody>
<tr>
<td>167</td>
<td>Select on Not Equal Global</td>
<td>SNG</td>
</tr>
</tbody>
</table>

**DESCRIPTION**

Set EA to zero in those active AUs in which A-Register (31-0) are equal to Operand (31-0)

**Execution:** 2 steps

**MICRO STEP SEQUENCE**

1) If \( R=1 \):

\[
\begin{align*}
&\text{PADMDE} \leftarrow 0 \\
&PADAIN(10-0) \leftarrow \text{PAMOBR}(10-0) \\
&\text{Transmit PADREQ to EMC} \\
&PAXDIN(31-0) \leftarrow \text{PXENOT}(31-0); \text{when PADSEL}=1
\end{align*}
\]

If \( R=0 \):

\[
\begin{align*}
&PAMOBR(31-0) \leftarrow \text{PAMOBR}(31-0) \\
&PAXDIN(31-0) \leftarrow \text{PAMORL}(31-0) \\
&PABREG(31-0) \leftarrow \text{PAXDIN}(31-0); \\
\end{align*}
\]

2) \( \text{PAADEA}(31-24) \leftarrow \text{PAAREG}(31-24) \)

\( \text{PADEB}(31-24) \leftarrow \text{PABREG}(31-24) \)

\( \text{PAADMB}(23-0) \leftarrow \text{PABREG}(23-0) \)

\( \text{PAADMA}(23-0) \leftarrow \text{PAAREG}(23-0) \)

Select EXCLUSIVE OR

\[
\text{PAEACT} \leftarrow 0; \text{iff PAOUT}(31-0)=0
\]

**FIGURE 4.11, SNG**

114
OPCODE | INSTRUCTION | MNEMONIC
--- | --- | ---
252 | AND of STACK | ANS

**DESCRIPTION**

In all ensemble AUs:

\[
\text{PAEACT} \leftarrow \text{PAEACT} \cdot \text{AND} \cdot \text{PASTAK}(O)
\]

\[
\text{PASTAK}(I) \leftarrow \text{PASTAK}(I + 1) \; ; \; I = 0, \ldots, 19
\]

\[
\text{PASTAK}(20) \leftarrow 0
\]

Execution: 1 step

**MICRO STEP SEQUENCE**

1) \[
\text{PAEACT} \leftarrow (\text{PAEACT}) \cdot (\text{PASTAK}(O))
\]

\[
\text{PASTAK}(20-0) \leftarrow (0, \text{PASTAK}(20-1))
\]

**FIGURE 4.12, ANS**
<table>
<thead>
<tr>
<th>_OPCODE</th>
<th>INSTRUCTION</th>
<th>MNEMONIC</th>
</tr>
</thead>
<tbody>
<tr>
<td>253</td>
<td>Clear Active Stack</td>
<td>CAS</td>
</tr>
</tbody>
</table>

**DESCRIPTION**

In all active AUs

PASTAK(0) ← 0

Execution: 1 step

**MICRO_STEP_SEQUENCE**

1) PASTAK(20-0) ← PASTAK(20-1), 0; iff PAEACT = 1

**FIGURE 4.13, CAS**
<table>
<thead>
<tr>
<th>OPCODE</th>
<th>INSTRUCTION</th>
<th>MNEMONIC</th>
</tr>
</thead>
<tbody>
<tr>
<td>013</td>
<td>Push and Select on Bit</td>
<td>PSEL</td>
</tr>
</tbody>
</table>

**DESCRIPTION**

In all ensemble AUs, EA is pushed into the activity stack as follows:

\[
\text{PASTAK}(20-1) \leftarrow \text{PASTAK}(19-0)
\]

\[
\text{PASTAK}(0) \leftarrow \text{PAEACT}
\]

If \(\text{PASTAK}(19)=1\), prior to "Push", the overflow error is generated to the ICL.

After PUSH is complete, select on bit (where \(\text{PAEACT} = 1\)) as follows:

Reset \(\text{PAEACT}\) in AUs where the bit specified by Operand 
(15-11) in memory word specified by Operand (10-0) is zero.

Execution: 5 steps

**MICRO STEP SEQUENCE**

1) \[\text{PASTAK}(20-1) \leftarrow \text{PASTAK}(19-0)\]

\[\text{PASTAK}(0) \leftarrow \text{PAEACT}\]

If \(\text{PASTAK}(19)=1\); generate overflow error to ICL

2) \[\text{PAMORL}(31-0) \leftarrow \text{Mask Generate Logic (31-0)}\]

\[\text{PAXDIN}(31-0) \leftarrow \text{PAMORL}(31-0)\]

\[\text{PABREG}(31-0) \leftarrow \text{PAXDIN}(31-0)\]

3) \[\text{PAAREG}(31-0) \leftarrow \text{PABREG}(31-0)\]

4) \[\text{PADAIN}(10-0) \leftarrow \text{PAMOBR}(10-0)\]

\[\text{PADMDE} \leftarrow 0\]

Transmit \(\text{PADREQ}\) to EMC

\[\text{PAXDIN}(31-0) \leftarrow \text{PXEMOT}(31-0)\); when \(\text{PADSEL} = 1\)

\[\text{PABREG}(31-0) \leftarrow \text{PAXDIN}(31-0)\]

5) \[\text{PAADEA}(31-24) \leftarrow \text{PAAREG}(31-24)\]

\[\text{PADEB}(31-24) \leftarrow \text{PABREG}(31-24)\]

\[\text{PAADMA}(23-0) \leftarrow \text{PAAREG}(23-0)\]

\[\text{PAADM}(23-0) \leftarrow \text{PABREG}(23-0)\]

Select AND

\[\text{PAEACT} \leftarrow 0\]; iff \((\text{PAEACT} = 1) \cdot (\text{PAAOUT}(31-0) = 0)\)

\[\text{PAEACT} \leftarrow \text{PAEACT} ; \text{iff} \ (\text{PAEACT} = 0) + (\text{PAAOUT}(31-0) \neq 0)\]

**FIGURE 4.14, PSEL**

117
220  |  Add Integer  |  ADI

**DESCRIPTION**

In all active AUs:

\[ \text{PAAREG(23-0)} \leftarrow \text{PAAREG(23-0)} + \text{OPERAND(23-0)} \]
\[ \text{PAAREG(31-24)} \leftarrow 0 \]

Execution: 2 steps

**MICRO_STEP_SEQUENCE**

1) If R=1:

\[ \text{PADMDE} \leftarrow 0 \]
\[ \text{PADAIN(10-0)} \leftarrow \text{PAMOBR(10-0)} \]

Transmit PADREQ to EMC

\[ \text{PAXDIN(31-0)} \leftarrow \text{EMOT(31-0)}; \text{ when PADSEL = 1} \]

If R=0:

\[ \text{PAMORL(31-0)} \leftarrow \text{PAMOBR(31-0)} \]
\[ \text{PAXDIN(31-0)} \leftarrow \text{PAMORL(31-0)} \]

\[ \text{PABREG(31-0)} \leftarrow \text{PAXDIN(31-0)}; \text{ iff PAEACT=1} \]

2) \[ \text{PAADMA(23-0)} \leftarrow \text{PAAREG(23-0)} \]
\[ \text{PAADMB(23-0)} \leftarrow \text{PABREG(23-0)} \]

Select ADD

\[ \text{PAAREG(23-0)} \leftarrow \text{PAAOUT(23-0)}; \text{ iff PAEACT=1} \]
\[ \text{PAOVFF} \leftarrow 1; \text{ iff (PAEACT=1) \times (overflow)} \]
\[ \text{PAAREG(31-24)} \leftarrow 0; \text{ iff PAEACT=1} \]

**FIGURE 4.15, ADI**
4.8.2.2 LSBI and USBI

During the first micro-step of both instructions the operand is formed. The second micro-step of the LSBI subtracts the two operands and saves the borrow in the DPC FF. The second step of the USBI subtracts its two operands using the DPC FF as a borrow into the low order bit of the ALU. Overflow is checked and the Overflow FF is set if necessary. Figures 4.16 and 4.17 show the micro-step sequences.

4.8.3 Logical/Data Transfer Instructions

Instructions chosen as examples are ORA and TAQ.

4.8.3.1 ORA

During the first micro-step, the explicit operand is loaded into the B Register. Then, the A and B Registers are "OR"ed in the ALU and returned to the A Register. Figure 4.18 shows the micro-step sequence.

4.8.3.2 TAQ

In this instruction, the A Register is routed to the input of the Q Register which is then clocked. Figure 4.19 shows the micro-step sequence.

4.8.4 Floating Point Instructions

Instructions chosen as examples are FIX, FLOT, ADD, ML, DV and SQ. These instructions were described in detail in Section 3.4. The micro-steps in this section follow those algorithms.

4.8.4.1 FIX/FLOT

In the FIX instruction, the exponent is compared to 23. If it is greater than 23, the overflow flip-flop is set. The A Register is then shifted until the exponent is zero and placed in the B Register. The B Register is then shifted right to its original position. The A and B Register are compared and the difference is placed in B. The A Register is then shifted again until the exponent is zero. If the difference in
<table>
<thead>
<tr>
<th>OPCODE</th>
<th>INSTRUCTION</th>
<th>MNEMONIC</th>
</tr>
</thead>
<tbody>
<tr>
<td>226</td>
<td>Lower Subtract Integer</td>
<td>LSBI</td>
</tr>
</tbody>
</table>

**DESCRIPTION**

In all active AUs;

\[ \text{PAAREG}(23-0) \leftarrow \text{PAAREG}(23-0) - \text{OPERAND}(23-0) \]

\[ \text{PAAREG}(31-24) \leftarrow 0 \]

\[ \text{PAXDPC} \leftarrow 1; \text{iff borrow out} \]

Execution: 2 steps

**MICRO_STEP_SEQUENCE**

1) If \( R=1 \):

\[ \text{PADMDE} \leftarrow 0 \]

\[ \text{PADAIN}(10-0) \leftarrow \text{PAMOBR}(10-0) \]

Transmit \( \text{PADREQ} \) to EMC

\[ \text{PAXDIN}(31-0) \leftarrow \text{PXEMOT}(31-0); \text{when PADSEL}=1 \]

If \( R=0 \):

\[ \text{PAMORL}(31-0) \leftarrow \text{PAMOBR}(31-0) \]

\[ \text{PAXDIN}(31-0) \leftarrow \text{PAMORL}(31-0) \]

\[ \text{PABREG}(31-0) \leftarrow \text{PAXDIN}(31-0) \]

2) \[ \text{PAADMA}(23-0) \leftarrow \text{PAAREG}(23-0) \]

\[ \text{PAADMB}(23-0) \leftarrow \text{PAAREG}(23-0) \]

Select SUBTRACT

\[ \text{PAAREG}(23-0) \leftarrow \text{PAAOUT}(23-0); \text{iff PEAECT}=1 \]

\[ \text{PAXDPC} \leftarrow 1 \text{ iff } (\text{borrow out}) \cdot (\text{PEAECT}=1) \]

\[ \text{PAAREG}(31-24) \leftarrow 0; \text{iff PEAECT}=1 \]

**FIGURE 4.16, LSBI**

120
<table>
<thead>
<tr>
<th>OPCODE</th>
<th>INSTRUCTION</th>
<th>MNEMONIC</th>
</tr>
</thead>
<tbody>
<tr>
<td>216</td>
<td>Upper Subtract Integer</td>
<td>USBI</td>
</tr>
</tbody>
</table>

**DESCRIPTION**

In all active AUs;

PAAREG(23-0) ← PAAREG(23-0) - OFERAND(23-0) - PAXDPC

PAAREG(31-24) ← 0

Execution: 2 steps

**MICRO STEP SEQUENCE**

1) If R=3:

   PADMDE ← 0
   PADAIN(10-0) ← PAMOBR(10-0)
   Transmit PADREQ to EMC
   PAXDIN(31-0) ← PXENOT(31-0); when PADSEL=1

   If R≠3:

   PAMORL(31-0) ← PAMOBR(31-0)
   PAXDIN(31-0) ← PAMORL(31-0)
   PABREG(31-0) ← PAXDIN(31-0).

2) PAADMA(23-0) ← PAAREG(23-0)
   PAADMB(23-0) ← PABREG(23-0)
   Select SUBTRACT, carry-in = PAXDPC
   PAAREG(23-0) ← PAAOUT(23-0); iff PAEACT=1
   PAOVFF ← 1 if overflow and PAEACT=1
   PAAREG(31-24) ← 0; iff PAEACT=1

**FIGURE 4.17, USBI**
OPCODE | INSTRUCTION | MNEMONIC
--- | --- | ---
244 | Logical OR | ORA

DESCRIPTION
In all active AUs;
PAAREG(31-0) ← PAAREG(31-0).OR.OPERAND(31-0)

Execution: 2 steps

MICRO STEP SEQUENCE

1) If R=1:

\[
\begin{align*}
& \text{PADE} \leftarrow 0 \\
& \text{PAADIN}(10-0) \leftarrow \text{PAMOB}(10-0) \\
& \text{Transmit PADREQ to EMC} \\
& \text{PAXDIN}(31-0) \leftarrow \text{PXEMOT}(31-0); \text{when PADSEL}=1 \\
\end{align*}
\]

If R=0:

\[
\begin{align*}
& \text{PAMORL}(31-0) \leftarrow \text{PAMOB}(31-0) \\
& \text{PAXDIN}(31-0) \leftarrow \text{PAMORL}(31-0) \\
& \text{PABREG}(31-0) \leftarrow \text{PAXDIN}(31-0) \\
\end{align*}
\]

2) \text{PAADEA}(31-24) ← PAAREG(31-24)

\[
\begin{align*}
& \text{PAADEB}(31-24) \leftarrow \text{PABREG}(31-24) \\
& \text{PAADMA}(23-0) \leftarrow \text{PAAREG}(23-0) \\
& \text{PAADMB}(23-0) \leftarrow \text{PABREG}(23-0) \\
& \text{Select OR} \\
& \text{PAAREG}(31-0) \leftarrow \text{PAAOOUT}(31-0); \text{iff PAEACT}=1 \\
\end{align*}
\]

FIGURE 4.18, ORA
<table>
<thead>
<tr>
<th>OPCODE</th>
<th>INSTRUCTION</th>
<th>MNEMONIC</th>
</tr>
</thead>
<tbody>
<tr>
<td>043</td>
<td>Transfer A to Q</td>
<td>TAQ</td>
</tr>
</tbody>
</table>

**DESCRIPTION**

AU A-Register 31-0 is moved to AU Q-Register 31-0 in those AUs which are active.

Execution: 1 step

**MICRO STEP SEQUENCE**

1) \( \text{PAADEA}(31-24) \leftarrow \text{PAAREG}(31-24) \)
\( \text{PAADEB}(31-24) \leftarrow 0 \)
\( \text{PAADMA}(23-0) \leftarrow \text{PAAREG}(23-0) \)
\( \text{PAADMB}(23-0) \leftarrow 0 \)
Select OR
\( \text{PAQREG}(31-0) \leftarrow \text{PAAOUT}(31-0); \text{iff PAEACT} = 1 \)

**FIGURE 4.19, TAQ**

123
B is less than zero and the A Register is negative, the A Register is incremented by one to become the final integer value.

In the FLOT instruction, the number of leading ones/zeros is counted and placed in the Shift Count Register. This value is subtracted from 23 and the result becomes the exponent of the FP number. The fraction is then shifted right by the amount determined in step one to become the final fraction.

Figure 4.20 shows the micro-step sequence for FIX and 4.21 shows it for FLOT.

4.8.4.2 FP ADD/SB

The floating point add and subtract instructions require a binary point alignment step before the fractions of the two operands can be added or subtracted. During this alignment step, the exponents of the operands are differenced and the fraction with the smallest exponent is enabled, via local control flip-flops, into the alignment network. See Figure 4.22.

After the binary points are aligned and the fractions added (subtracted), the result is normalized and the exponent updated. If the result of the add (subtract) operation is zero, -128 is gated into the exponent of the result.

The floating point add and subtract instructions are identical except for micro-step 4 as shown in the flow chart (Figure 4.23).
<table>
<thead>
<tr>
<th>OPCODE</th>
<th>INSTRUCTION</th>
<th>MNEMONIC</th>
</tr>
</thead>
<tbody>
<tr>
<td>205</td>
<td>Fix</td>
<td>FIX</td>
</tr>
</tbody>
</table>

**DESCRIPTION**

In all active AU's, the floating point number contained in the A-Register (31-0) is converted to integer format in A-Register (23-0) with PAREG(31-24) set to zero. It is assumed that the floating point mantissa has no overflow and is normalized. Truncation is performed as follows:

\[ \pm m + \frac{x}{y} \rightarrow \pm m. \]

Execution: 5 steps

**MICRO_STEP_SEQUENCE**

1) \[ \text{PADEA}(31-24) \leftarrow 23_{10} \]
   \[ \text{PADEB}(31-24) \leftarrow \text{PAREG}(31-24) \]
   \[ \text{PAADMA}(23-0) \leftarrow 0 \]
   \[ \text{PAADMB}(23-0) \leftarrow 0 \]
   Select SUBTRACT
   \[ \text{PASHCR}(5-0) \leftarrow \text{PAOUT}(29-24); \text{iff PAAECT}=1 \]
   \[ \text{PAOVFF} \leftarrow 1; \text{iff (PAAECT}=1) \cdot (\text{PAOUT}(31-24)<0) \]
   \[ \text{PAAREG}(23-0) \leftarrow \text{PAOUT}(23-0); \text{iff (PAAECT}=1) \cdot (\text{PAAREG}(31-24)<0) \]

2) \[ \text{PAALNI}(23-0) \leftarrow \text{PAAREG}(23-0) \]
   \[ \text{PASCN}(5-0) \leftarrow \text{PASHCR}(5-0) \]
   \[ \text{PAAREG}(31-24) \leftarrow 0; \text{iff PAAECT}=1 \]
   \[ \text{PABREG}(23-0) \leftarrow \text{PAALNO}(23-0) \]

**FIGURE 4.20, FIX**
3) \( PAALNI(23-0) \leftarrow PABREG(23-0) \)
\( PASCR(5-0) \leftarrow PA2SCO(5-0) \)
\( PABREG(23-0) \leftarrow PAALNO(23-0) \)

4) \( PAADMA(23-0) \leftarrow PABREG(23-0) \)
\( PAADMB(23-0) \leftarrow PAAREG(23-0) \)
Select SUBTRACT
\( PABREG(23-0) \leftarrow PAAOUT(23-0) \)
\( PASCR(5-0) \leftarrow PASHCR(5-0) \)
\( PAALNI(23-0) \leftarrow PAAREG(23-0) \)
\( PAAREG(23-0) \leftarrow PAALNO(23-0); \text{iff } PAEACT=1 \)

5) \( PAADMA(23-0) \leftarrow PAAREG(23-0) \)
\( PAADMB(23-0) \leftarrow 0 \)
Select INCREMENT
\( PAAREG(23-0) \leftarrow PAAOUT(23-0); \text{iff} \)
\( (PAEACT=1) \cdot (PABREG(23-0) < 0) \cdot (PAAREG(23-0) < 0) \)
**OPCODE**  | **INSTRUCTION** | **MNEMONIC**
--- | --- | ---
206 | Float | FLOT

**DESCRIPTION**

In all active AUs, the integer number contained in the A-Register (31-0) is converted to normalized floating point format in A-Register (31-0).

Execution: 3 steps

**MICRO STEPSEQUENCE**

1) \( \text{PAALNI}(23-0) \leftarrow \text{PAAREG}(23-0) \)
\( \text{PASHCR}(5-0) \leftarrow \text{PANRMD}(5-0); \text{iff PAEACT} = 1 \)

2) \( \text{PASCR}(5-0) \leftarrow \text{PA2SCO}(5-0) \)
\( \text{PAALNI}(23-0) \leftarrow \text{PAAREG}(23-0) \)
\( \text{PAAREG}(23-0) \leftarrow \text{PAALNO}(23-0); \text{iff PAEACT}=1 \)
\( \text{PAADEA}(31-24) \leftarrow +2310 \)
\( \text{PAADEB}(31-24) \leftarrow \text{PA2SCO}(5,5,5-0) \)
Select ADD
\( \text{PAAREG}(31-24) \leftarrow \text{PAAOUT}(31-24); \text{iff PAEACT}=1 \)

3) \( \text{PAADMA}(23-0) \leftarrow \text{PAAREG}(23-0) \)
\( \text{PAADMB}(23-0) \leftarrow 0 \)
\( \text{PAADMB}(23-0) \leftarrow 0 \)
Select ADD
Iff \( \text{PAAOUT}(23-0) = 0 \):
\( \text{PAAREG}(31-24) \leftarrow -128_{10}; \text{iff PAEACT}=1 \)

**FIGURE 4.21, FLOT**
<table>
<thead>
<tr>
<th>Opcode</th>
<th>Instruction</th>
<th>Mnemonic</th>
</tr>
</thead>
<tbody>
<tr>
<td>200</td>
<td>Add Floating Point</td>
<td>ADD</td>
</tr>
</tbody>
</table>

**Description**

In all active AUs, A-Register (31-0) is added to operand (31-0) and the sum placed in A-Register (31-0).

Execution: 8 steps

**Micro Step Sequence**

1) If R=1:

   \[
   \begin{align*}
   & PADMDE \leftarrow 0 \\
   & PADM(10-0) \leftarrow PAMOB(10-0) \\
   & \text{Transmit PARDREQ to EMC} \\
   & PAXDIN(31-0) \leftarrow PXEMOT(31-0) \text{; when PAXSEL = 1} \\
   \end{align*}
   
   If R=0:

   \[
   \begin{align*}
   & \text{PAMORL(31-0) } \leftarrow \text{PAMOB(31-0)} \\
   & \text{PAXDIN(31-0) } \leftarrow \text{PAMORL(31-0)} \\
   & \text{PABREG(31-0) } \leftarrow \text{PAXDIN(31-0)} \\
   \end{align*}
   
2) \text{PADEA}(31-24) \leftarrow \text{PAAREG}(31-24) \\
   \text{PADEB}(31-24) \leftarrow \text{PABREG}(31-24)

Select Subtract

\[
\begin{align*}
& \text{PASCHR(5-0) } \leftarrow \text{PAAOOUT(29-24)} \text{; iff PAEACT=1} \\
& \text{PAZROB } \leftarrow 1 \text{; if (PAAOOUT(29-24) > 24) } \cdot (\text{PAEACT=1}) \\
& \quad \text{or exponent overflow : Else 0} \\
& \text{PAZROA } \leftarrow 1 \text{; if (PAAOOUT(29-24) < -23) } \cdot (\text{PAEACT=1}) \\
& \quad \text{or exponent underflow : Else 0} \\
\end{align*}
\]

3) If (PASCHR(5-0) > 0) \cdot (\overline{\text{PAZROA}}) \cdot (\text{PAZROB}):

   \[
   \begin{align*}
   & \text{PASCHR } \leftarrow \text{PASCHR(5-0)} \\
   & \text{PAALNI(23-0) } \leftarrow \text{PABREG(23-0)} \\
   & \text{PABREG(23-0) } \leftarrow \text{PAALNO(23-0)} \text{; iff PAEACT = 1} \\
   \end{align*}
   
\]

**Figure 4.22**
If \((\text{PASHCR}(5-0) < 0) \cdot (\text{PAZROA}) \cdot (\text{PAZROB}) = 1:\)

\[\text{PASCR} \leftarrow \text{PA2SCO}\]

\[\text{PAALNI}(23-0) \leftarrow \text{PAAREG}(23-0)\]

\[\text{PAAREG}(23-0) \leftarrow \text{PAALNO}(23-0); \text{iff PAEACT} = 1\]

If \((\text{PASHCR}(5-0) < 0) \cdot (\text{PAEACT} = 1)\):

\[\text{PAAREG}(31-24) \leftarrow \text{PABREG}(31-24); \text{iff PAEACT} = 1\]

4) \[\text{PAADMA}(23-0) \leftarrow (\text{PAZROA}) \cdot (\text{PAAREG}(23-0))\]

\[\text{PAADMB}(23-0) \leftarrow (\text{PAZROB}) \cdot (\text{PABREG}(23-0))\]

Select \text{ADD}

\[\text{PAAREG}(23-0) \leftarrow \text{PAAOUT}(23-0); \text{iff PAEACT} = 1\]

\[\text{PATOVF} \leftarrow 0 \text{ if } \text{no overflow} \cdot (\text{PAEACT} = 1)\]

If \((\text{overflow}) \cdot (\text{PAEACT} = 1)\):

\[\text{PATOVF} \leftarrow 1\]

\[\text{PACO23FF} \leftarrow \text{CO23 (carry out of bit 23)}\]

\[\text{PASCHR}(5-0) \leftarrow 1; \text{iff PAEACT} - 1\]

5) If \((\text{PATOVF} = 1) \cdot (\text{PAEACT} = 1)\):

\[\text{PASCR}(5-0) \leftarrow \text{PASHCR}(5-0)\]

\[\text{PAALNI}(23-0) \leftarrow \text{PAAREG}(23-0)\]

\[\text{PAADEA}(31-24) \leftarrow \text{PAAREG}(31-24)\]

\[\text{PAADEB}(31-24) \leftarrow 0\]

Select \text{INCREMENT}

\[\text{PAAREG}(22-0) \leftarrow \text{PAALNO}(22-0)\]

\[\text{PAAREG}(23) \leftarrow \text{PACO23FF}\]

\[\text{PAAREG}(31-24) \leftarrow \text{PAAOUT}(31-24)\]

\[\text{PAOVFF} \leftarrow 1; \text{iff overflow from exponent}\]

If \((\text{PATOVF} = 0)\):

No operation

6) \[\text{PAALNI}(23-) \leftarrow \text{PAAREG}(23-0)\]

\[\text{PASHCR}(5-0) \leftarrow \text{PANRMD}(5-0); \text{iff PAEACT} = 1\]

\[\text{FIGURE 4.22, CONTINUED}\]
7) PASC(5-0) ← PA2SCO(5-0)
    PAALNI(23-0) ← PAAREG(23-0)
    PAADEA(31-24) ← PAAREG(31-24)
    PADEB(31-24) ← PA2SCO(5, 5, 5-0)
    PAADMA ← 0
    PAADMB ← 0
    Select ADD
    If no underflow from exponent:
    PAAREG(23-0) ← PAALNO(23-0); iff PAEACT = 1
    PAAREG(31-24) ← PAOUT(31-24); iff PAEACT = 1
    If exponent(31) underflow:
    PAAREG(31-24) ← -128_{10}
    PAAREG(23-0) ← PAOUT(23-0)

8) PAADMA(23-0) ← PAAREG(23-0)
    PAADMB(23-0) ← 0
    Select ADD
    If PAOUT(23-0) = 0:
    PAAREG(31-24) ← -128; iff PAEACT = 1

FIGURE 4.22, CONTINUED
ADD

START

μs 1

PAXDIN31-0 → PABREG31-0

μs 2

PAAREG(31-24) - PABREG(31-24) → PASHCR

Set ZROAFF

YES

UV or Exp. Diff. <-23

NO

OV or Exp. Diff. >23

YES

Set ZROBFF

NO

μs 3

PASCHR +

YES

If ZROAFF, ZROBFF = 0,
PASHCR → PASCR
PABREG23-0 → PAALNW23-0 → PABREG23-0

NO

If ZROAFF, ZROBFF = 0,
PA2SC0 → PASCR
PAAREG23-0 → PAALNW23-0 → PAAREG23-0
If ZROAFF, ZROBFF = 0, Or
ZROAFF = 1,
PABREG31-24 → PAAREG31-24

*SB is identical except in
μs 4 A and B reg. subtracted.

SH2

Figure 4.23a
Floating Point Add/Subtract
Algorithm

131
Figure 4.23b
Floating Point Add/Subtract
Algorithm
4.8.4.3 Multiply Floating Point

The method used in the execution of the FP multiply instruction is comprised of three functional categories. They are initialize, iterate, and terminate. See Figure 4.24.

During the initialization phase, the multiplier is received and placed into the Q register. The partial product register is zero filled, and the lowest three multiplier bits are decoded as shown below.

<table>
<thead>
<tr>
<th>Multiplier Bits</th>
<th>Operation</th>
</tr>
</thead>
<tbody>
<tr>
<td>0 0 0</td>
<td>+0X</td>
</tr>
<tr>
<td>0 0 1</td>
<td>+1X</td>
</tr>
<tr>
<td>0 1 0</td>
<td>+1X</td>
</tr>
<tr>
<td>0 1 1</td>
<td>+2X</td>
</tr>
<tr>
<td>1 0 0</td>
<td>-2X</td>
</tr>
<tr>
<td>1 0 1</td>
<td>-1X</td>
</tr>
<tr>
<td>1 1 0</td>
<td>-1X</td>
</tr>
<tr>
<td>1 1 1</td>
<td>-0X</td>
</tr>
</tbody>
</table>

During the iterate phase, two final product bits are calculated for each micro-step. Therefore, twelve steps are performed to generate the 24 bit final product.

In the terminate phase, the 24 bits of the fraction are normalized, the addition of the multiplicand and multiplier exponents is performed, and the result (of exponent addition) is corrected for any normalization shifts. In addition, the 24 bit fraction is checked and corrected for the one overflow fault resulting from the multiplication of two -1.0 operands. After all operand corrections have been accomplished, the final exponent and fraction are checked for exponent overflow and fraction equal to zero conditions to set the appropriate indicator. Figure 4.25 illustrates the ML instruction processing.
<table>
<thead>
<tr>
<th>OPCODE</th>
<th>INSTRUCTION</th>
<th>MNEMONIC</th>
</tr>
</thead>
<tbody>
<tr>
<td>202</td>
<td>Multiply Floating Point</td>
<td>ML</td>
</tr>
</tbody>
</table>

**DESCRIPTION**

In all active AU, operand (31-0) is multiplied by A-Register (31-0) and the product is placed in the A-Register (23-0). A-Register is corrected for (1) mantissa overflow during adds, (2) result of multiplication equal to zero, (3) exponent underflow, and (4) A not normalized.

If A-Register of product mantissa equal to 0, then the A-exponent set to -128.

If A-exponent greater than or equal to +128 set overflow flip-flop to 1.

Q register contents are destroyed.

Execution: 19 steps

**MICRO STEP SEQUENCE**

1) If R=1:

```
PADMDE ← 0
PADAIN(10-0) ← PAMOBR(10-0)
Transmit PADREQ to EMC
PAXDIN(31-0) ← PXEMOT(31-0) when PADSEL=1
```

If R=0:

```
PAMORL(31-0) ← PAMOBR(31-0)
PAXDIN(31-0) ← PAMORL(31-0)
PABREC(31-0) ← PAXDIN(31-0)
```

**FIGURE 4.24, ML**
2) PAQREG(23-0) ← PABREG(23-0), iff PAEACT=1
PAQEXT ← 0
PAADMA(23-0) ← 0
PAADMB(23-0) ← 0
Select add (PAADMN)
PABREG(23-0) ← PAAOUT(23-0)
PABEXT ← PAAOUT1

3-13) Decode PAQREG1, PAQREG0, PAQEXT into octal 0-7.
PAADMA(23-0) ← PABREG(23-0)

If decode = 0,7 : PAADMB(23-0) ← 0
If decode = 3,4 : PAADMB(23-0) ← PAAREG(23-0)
If decode = 1,2,5,6: PAADMB(23-0) ← PAAREG(23,23-1)
If decode = 0,1,2,3: Select Add (PAADMN)
If decode = 4,5,6,7: Select subtract (PAADMN)
PABREG(2i-0) ← PAAOUT(23-2)

If decode = 1,2:
PAQREG(22) ← PABEXT + PAAREGO; iff PAEACT=1
If decode = 5,6:
PAQREG(22) ← PABEXT - PAAREGO; iff PAEACT=1
If decode = 0,3,4,7:
PAQREG(22) ← PABEXT; iff PAEACT=1
PAQREG(23) ← PAAOUTO; iff PAEACT=1
PABEXT ← PAAOUT1
PAQREG(21-0) ← PAQREG(23-2); iff PAEACT=1
PAQEXT ← PAQREG1

FIGURE 4.24 (CONTINUED)
If PAMOF: PABREG(23) ← PACO23FF
       PABREG(22) ← PACO23FF

If PAMOF: PABREG(23) ← PAOUT(23)
       PABREG(22) ← PAOUT(23)

14) Decode PAQREG(1), PAQREG(0), PAQEXT into octal 0-7

PAADMA(23-0) ← PABREG(23-0)

If decode = 0, 7 : PAADM(23-0) ← 0
If decode = 3, 4 : PAADM(23-0) ← PAAREG(23-0)
If decode = 1, 2, 5, 6: PAADM(23-0) ← PAAREG(23-23-1)
If decode = 0, 1, 2, 3: Select Add (PAADMN)
If decode = 4, 5, 6, 7: Select Subtract (PAADMN)

PAAREG(23-0) ← PAOUT(23-0); iff PAEACT=1

If decode = 1, 2:
   PAQREG(22) ← PABEXT + PAAREG; iff PAEACT=1

If decode = 5, 6:
   PAQREG(22) ← PABEXT - PAAREG; iff PAEACT=1

If decode = 0, 3, 4, 7:
   PAQREG(22) ← PABEXT; iff PAEACT=1

PAQREG(23) ← PAOUTO; iff PAEACT=1
PAQREG(21-0) ← PAQREG(23-2); iff PAEACT=1
PASHCR(5-0) ← 1; iff PAEACT=1

If PAMOF: PATOVF ← 1

PACO23FF ← CARRY OUT BIT 23
Else PATOVF ← 0

FIGURE 4.24 (CONTINUED)
15) \text{PAADEA}(31-24) \leftarrow \text{PAAREG}(31-24) \quad \text{[ADD EXPONENTS]}
\text{PADEB}(31-24) \leftarrow \text{PABREG}(31-24)

Select Add (PAADEX)

\text{PAADMA}(23-0) \leftarrow 0
\text{PAADMB}(23-0) \leftarrow 0

Select Add (PAADMN)

\text{If } \text{PAEXOF}: \text{PAOVFF} \leftarrow 1

\text{If } \text{PAEXUF} \quad \text{PAAREG}(31-24) \leftarrow -128; \text{ iff } \text{PAEACT}=1

\text{PAAREG}(23-0) \leftarrow \text{PAAOUT}(23-0); \text{ iff } \text{PAEACT}=1

\text{If } \text{PAEXUF}: \text{PAAREG}(31-24) \leftarrow \text{PAAOUT}(31-24); \text{ iff } \text{PAEACT}=1

16) \text{If } \text{PATOVF} = 1:

\text{PASCR}(5-0) \leftarrow \text{PASHCR}(5-0)
\text{PAALNI}(23-0) \leftarrow \text{PAAREG}(23-0)

\text{PAAREG}(22-0) \leftarrow \text{PAALNO}(22-0); \text{ iff } \text{PAEACT}
\text{PAAREG}(23) \leftarrow \text{PAO23FF}; \text{ iff } \text{PAEACT}
\text{PAADEA}(31-24) \leftarrow \text{PAAREG}(31-24)

\text{PADEB}(31-24) \leftarrow 0 \quad \text{[ADD ONE TO EXPONENT IF]}

Select INCREMENT \quad \text{[MANTISSA OVERFLOWS]}

\text{PAAREG}(31-24) \leftarrow \text{PAAOUT}(31-24); \text{ iff } \text{PAEACT}

\text{If } \text{PAEXOF}: \text{PAOVFF} \leftarrow 1; \text{ iff } \text{PAEACT}

17) \text{PAALNI}(23-) \leftarrow \text{PAAREG}(23-0)

\text{PASHCR}(5-0) \leftarrow \text{PANRMD}(5-0); \text{ iff } \text{PAEACT}=1

\text{FIGURE 4.24 (CONTINUED)}
18) \( \text{PAALNI}(23-0) \leftarrow \text{PAAREG}(23-0) \)
\( \text{PAADMA}(23-0) \leftarrow 0 \)
\( \text{PAADMB}(23-0) \leftarrow 0 \)
Select add (PAADMN)
\( \text{PAADEA}(31-24) \leftarrow \text{PAAREG}(31-24) \)
\( \text{PAADEB}(31-24) \leftarrow \text{PA2SCO}(5,5,5,0) \)
Select add (PAADEDX)
If \( \text{PAEXUF} \): \( \text{PAAREG}(31-24) \leftarrow -128 \)
\( \text{PAAREG}(23-0) \leftarrow \text{PAAOUT}(23-0) \)
If \( \text{PAEXUF} \): \( \text{PAAREG}(31-24) \leftarrow \text{PAAOUT}(31-24) \)
\( \text{PAAREG}(23-0) \leftarrow \text{PAALNO}(23-0) \) iff PAEACT=1

19) \( \text{PAADMA}(23-0) \leftarrow \text{PAAREG}(23-0) \)
\( \text{PAADMB}(23-0) \leftarrow 0 \)
Select Add (PAADMN)
If \( \text{PAAOUT}(23-0) = 0: \)
\( \text{PAAREG}(31-24) \leftarrow -128 \) iff PAEACT=1

FIGURE 4.24 (CONTINUED)
MULTIPLY FLOATING POINT (1)

START

μs 1
PAXDIN31-0 → PABREG31-0

μs 2
PABREG → PAQREG
0 → PABREG
0 → B.Q Extension Bits

μs 3-13
Partial Product (B) ± N*M and (A) → PABREG
(N determined by Multiplier Bits)
Product Bits stored in Q23, Q22

Diamond

NO

MOV=1
PAAOUT23 → PABREG23, 22

YES

CO23 → PABREG23, 22

μs 14
Partial Product (B) ± N*M and (A) → PAQREG
(N determined by Multiplier Bits)
Product Bits stored in Q23, Q22

Diamond

YES

MOV=1
1 → TOVFF
CO23 → CO23FF

NO

us 15
Add A, B Reg. Exponents

A

Figure 4.25a
Floating Point Multiply Algorithm
MULTIPLY FLOATING POINT (2)

**Figura 4.25b**

Floating Point Multiply Algorithm
MULTIPLY FLOATING POINT (3)

Figure 4.25c
Floating Point Multiply Algorithm
4.8.4.4 FP Divide

The execution of the floating point divide instruction is comprised of three functional steps; initialize, iterate and terminate. Figure 4.26 shows the micro-step sequence and Figure 4.27 is a flow chart.

The main function performed during the initialize phase is to check the divisor and set the overflow flip-flop if the divisor is zero. When the divisor is not zero, the divisor and dividend fractions are made positive (if negative). It is assumed that the dividend and divisor are normalized initially. In this algorithm, the developed quotient must be less than one. This is accomplished by performing a trial subtraction of the divisor from the dividend. If the result is positive (dividend ≥ divisor), then the dividend is shifted right one bit position and its exponent is corrected for this shift. This procedure thus insures the first quotient bit to be zero and that the quotient will have a value less than one.

In the iterate phase, the divisor is subtracted from the dividend. If the result is positive, a zero bit is entered in the quotient and the dividend is shifted left one place. If the result is negative, a one bit is entered in the quotient and the difference becomes the new dividend.

During the terminate cycle, the quotient and remainder are sign corrected and placed into their respective final registers. The divisor exponent is subtracted from the dividend exponent. The quotient is then normalized and the normalization shift is added to the quotient exponent. The remainder is not normalized. Finally, the exponent is checked for overflow and the quotient for zero to set the appropriate indicator.
<table>
<thead>
<tr>
<th>OPCODE</th>
<th>INSTRUCTION</th>
<th>MNEMONIC</th>
</tr>
</thead>
<tbody>
<tr>
<td>203</td>
<td>Divide Floating Point</td>
<td>DV</td>
</tr>
</tbody>
</table>

**DESCRIPTION**

In all active AUs, the floating point A-Register (31-0) is divided by the floating point operand (31-0) with the quotient going to A-Register (31-0) and the remainder to Q-Register (23-0). The divisor and dividend are assumed to be normalized. The quotient shall be normalized and the remainder not.

If the divisor equals zero or if exponent overflow occurs, then the overflow indicator shall be set and the overflow error transmitted to the ICL.

Execution: 38 steps

**MICRO STEP SEQUENCE**

1) If $R=1$:

   - `PADMDE ← 0`
   - `PADAINT(10-0) ← PAMOBR(10-0)`
   - Transmit PADREQ to EMC
   - `PAXDIN(31-0) ← PXEMOT(31-0);` when `PADSEL=1`

2) If $R=0$:

   - `PAMORL(31-0) ← PAMOBR(31-0)`
   - `PAXDIN(31-0) ← PAMORL(31-0)`
   - `PABREG(31-0) ← FAXDIN(31-0)`

**FIGURE 4.26, DV**
2) \( \text{PAADMB}(23-0) \leftarrow \text{PABREG}(23-0) \)
\( \text{PAADMA}(23-0) \leftarrow 0 \)
Select \text{ADD}
\( \text{PAOVOF} \leftarrow 1; \text{iff} \ \text{PAAOUT}(23-0) = 0 \) and \( \text{PAEACT}=1 \)

3) \( \text{PAADMA}(23-0) \leftarrow \text{PAAREG}(23-0) \)
\( \text{PAADMB}(23-0) \leftarrow \text{PABREG}(23-0) \)
Select \text{EXCLUSIVE OR}
\( \text{PASIGN} \leftarrow \text{PAAOUT}(23) \)

4) \( \text{PAADMA}(23-0) \leftarrow 0 \)
\( \text{PAADMB}(23-0) \leftarrow \text{PABREG}(23-0) \)
Select \text{SUBTRACT}
If \( \text{PABREG}(23) = 1 \):
\( \text{PABREG}(23-0) \leftarrow \text{PAAOUT}(23-0) \)
If overflow:
\( \text{PATOVF} \leftarrow 1 \)
\( \text{PACO23FF} \leftarrow \text{CO}23 \)
\( \text{PASHCR}(5-0) \leftarrow (000001); \text{iff} \ \text{PAEACT}=1 \)

5) If \( \text{PATOVF} = 0 \):
Do nothing
If \( \text{PATOVF} = 1 \):
\( \text{PASCR}(5-0) \leftarrow \text{PASHCR}(5-0) \)
\( \text{PAALNI}(31-0) \leftarrow \text{PABREG}(31-0) \)
\( \text{PAADEA}(31-24) \leftarrow 0 \)
\( \text{PAADEB}(31-24) \leftarrow \text{PABREG}(31-24) \)
Select \text{INCREMENT}

\text{FIGURE 4.26 (CONTINUED)}
PABREG(31-24) ← PAAOUT(31-24)
PAOVF ← 1 ; iff exponent overflow PAEACT=1
PABREG(23) ← PACO23FF
PABREG(22-0) ← PAALNO(22-0)

6) PAADMA(23-0) ← 0
PAADMBO(23-0) ← PAAREG(23-0)

Select SUBTRACT

If PAAREG(23) = 1:
   PAAREG(23-0) ← PAAOUT(23-0); iff PAEACT=1

If overflow:
   PATOVF ← 1
   PACO23FF ← CO23

If not overflow:
   PATOVF ← 0

7) If PATOVF = 0:
   Do nothing

If PATOVF = 1:
   PASCR(5-0) ← PASHCR(5-0)
   PAALNI(31-0) ← PAAREG(31-0)
   PAADEA(31-24) ← PAAREG(31-24)
   PAADEB(31-24) ← 0

Select INCREMENT

PAAREG(31-24) ← PAAOUT(31-24)
PAOVF ← 1 ; iff overflow . PAEACT=1
PAAREG(23) ← PACO23FF ; iff PAEACT=1
PAAREG(22-0) ← PAALNO(23-0) ; iff PAEACT=1

FIGURE 4.26 (CONTINUED)
8) \( \text{PAADMA}(23-0) \leftarrow \text{PAAREG}(23-0) \)
   \( \text{PAADBM}(23-0) \leftarrow \text{PABREG}(23-0) \)
   \( \text{PAADEA}(31-24) \leftarrow \text{PAAREG}(31-24) \)
   \( \text{PADEB}(31-24) \leftarrow \text{PA2SCO}(5,5,5-0) \)

Select SUBTRACT

\( \text{PASCR}(5-0) \leftarrow \text{PASHCR}(5-0) \)
\( \text{PAALNI}(23-0) \leftarrow \text{PAAREG}(23-0) \)

If \( \text{PAAOUT}(23) = 0: \) (A \( \geq \) B)
   \( \text{PAAREG}(23-0) \leftarrow \text{PAALNO}(23-0) \)
   \( \text{PAAREG}(31-24) \leftarrow \text{PAAOUT}(31-24) \) \( ; \text{iff PAEACT}=1 \)
   \( \text{PAOVFF} \leftarrow 1 \); iff overflow \((\text{PAADEX})\)

If \( \text{PAAOUT}(23) = 1: \) (A \( < \) B)
   Do nothing

9-30) \( \text{PAADMA}(23-0) \leftarrow \text{PAAREG}(23-0) \)
   \( \text{PAADMB}(23-0) \leftarrow \text{PABREG}(23-0) \)

Select SUBTRACT

\( \text{PASCR}(5-0) \leftarrow \text{PA2SCO}(5-0) \)
\( \text{PAALNI}(23-0) \leftarrow \text{PAAREG}(23-0) \)
\( \text{PAQREG}(23-1) \leftarrow \text{PAQREG}(22-0) ; \text{iff PAEACT}=1 \)

If \( \text{PAAOUT}(23) = 1: \)
   \( \text{PAAREG}(23-0) \leftarrow \text{PAALNO}(23-0) \)
   \( \text{PAQREG}(0) \leftarrow 0 \) \( ; \text{iff PAEACT}=1 \)

If \( \text{PAAOUT}(23) = 0: \)
   \( \text{PAAREG}(23-1) \leftarrow \text{PAAOUT}(22-0) \)
   \( \text{PAAREG}(0) \leftarrow 0 \)
   \( \text{PAQREG}(0) \leftarrow 1 \)

**FIGURE 4.26 (CONTINUED)**
31) \( \text{PAADM}(23-0) \leftarrow \text{PAARE}(23-0) \)
\( \text{PAADM}(23-0) \leftarrow \text{PABREG}(23-0) \)
\( \text{PAADE}(31-24) \leftarrow 0 \)
\( \text{PADEB}(31-24) \leftarrow 0 \)

Select SUBTRACT

\( \text{PASCR}(5-0) \leftarrow \text{PA2SCO}(5-0) \)
\( \text{PAALNI}(23-0) \leftarrow \text{PAARE}(23-0) \)
\( \text{PAQREG}(23-1) \leftarrow \text{PAQREG}(22-0) \); iff \( \text{PAEACT}=1 \)

If \( \text{PAOUT}(23) = 1 \):

\( \text{PAARE}(23-0) \leftarrow \text{PAALNO}(23-0) \)
\( \text{PAQREG}(0) \leftarrow 0 \)

If \( \text{PAOUT}(23) = 0 \); if \( \text{PAEACT}=1 \)

\( \text{PAARE}(23-1) \leftarrow \text{PAOUT}(22-0) \)
\( \text{PAARE}(0) \leftarrow 0 \)
\( \text{PAQREG}(0) \leftarrow 1 \)

\( \text{PAHCR}(5-0) \leftarrow \text{PAOUT}(29-24) \)

32) \( \text{PAADM}(23-0) \leftarrow \text{PAARE}(23-0) \)
\( \text{PAADM}(23-0) \leftarrow \text{PABREG}(23-0) \)

Select SUBTRACT

\( \text{PASCR}(5-0) \leftarrow \text{PA2SCO}(5-0) \)
\( \text{PAALNI}(23-0) \leftarrow \text{PAARE}(23-0) \)
\( \text{PAQREG}(23-1) \leftarrow \text{PAQREG}(22-0) \); iff \( \text{PAEACT}=1 \)

If \( \text{PAOUT}(23) = 1 \):

\( \text{PAARE}(23-0) \leftarrow \text{PAALNO}(23-0) \)
\( \text{PAQREG}(0) \leftarrow 0 \) \quad \text{iff \( \text{PAEACT}=1 \)}

If \( \text{PAOUT}(23) = 0 \):

\( \text{PAARE}(23-1) \leftarrow \text{PAOUT}(23-0) \)
\( \text{PAQREG}(0) \leftarrow 1 \)

FIGURE 4.26 (CONTINUED)
33) \[ \text{PAADMA}(23-0) \leftarrow \overline{\text{PAQREG}(23-0)} \]
\[ \text{PAADMB}(23-0) \leftarrow 0 \]
Select INCREMENT
If PASIGN = 0:
\[ \text{Do nothing} \]
If PASIGN = 1:
\[ \text{PAQREG}(23-0) \leftarrow \text{PAAOUT}(23-0); \text{iff PAEACT=1} \]

34) \[ \text{PAALNI}(23-0) \leftarrow \text{PAQREG}(23-0) \]
\[ \text{PASHCR}(5-0) \leftarrow \text{PANRMD}(5-0) \]
\[ \text{PAADMA}(23-0) \leftarrow 0 \]
\[ \text{PAADMB}(23-0) \leftarrow \text{PAAREG}(23-0) \]
Select SUBTRACT
If PASIGN = 1:
\[ \text{PAAREG}(23-0) \leftarrow \text{PAAOUT}(23-0); \text{iff PAEACT=1} \]

35) \[ \text{PASCN}(5-0) \leftarrow \text{PA2SCO}(5-0) \]
\[ \text{PAALNI}(23-0) \leftarrow \text{PAQREG}(23-0) \]
\[ \text{PAADMA}(23-0) \leftarrow \text{PAAREG}(23-0) \]
\[ \text{PAADMB}(23-0) \leftarrow 0 \]
Select ADD
\[ \text{PAAREG}(23-0) \leftarrow \text{PAALNO}(23-0); \text{ iff PAEACT=1} \]
\[ \text{PAQREG}(23-0) \leftarrow \text{PAAOUT}(23-0) \]

**FIGURE 4.26 (CONTINUED)**
36) \( \text{PAADEA}(31-24) \leftarrow \text{PAAREG}(31-24) \)
\( \text{PAADEB}(31-24) \leftarrow \text{PABREG}(31-24) \)
\( \text{PAADMA}(23-0) \leftarrow 0 \)
\( \text{PAADMB}(23-0) \leftarrow 0 \)
Select SUBTRACT
If not underflow:
\( \text{PAAREG}(31-24) \leftarrow \text{PAOUT}(31-24) \); iff PAEACT=1
If overflow (exponent):
\( \text{PAOVFF} \leftarrow 1 \); iff PAEACT=1
If underflow (exponent):
\( \text{PAAREG}(31-24) \leftarrow -128_{10} \); iff PAEACT=1
\( \text{PAAREG}(23-0) \leftarrow \text{PAOUT}(23-0)=0 \)

37) \( \text{PAADEA}(31-24) \leftarrow \text{PAAREG}(31-24) \)
\( \text{PAADEB}(31-24) \leftarrow \text{PA2SCO}(5,5,5-0) \)
\( \text{PAADMA}(23-0) \leftarrow 0 \)
\( \text{PAADMB}(23-0) \leftarrow 0 \)
Select ADD
If no underflow (exponent):
\( \text{PAAREG}(31-24) \leftarrow \text{PAOUT}(31-24) \)
If underflow: \( \text{PAAREG}(31-24) \leftarrow -128_{10} \)
\( \text{PAAREG}(23-0) \leftarrow \text{PAOUT}(23-0) \)

38) \( \text{PAADMA}(23-0) \leftarrow \text{PAAREG}(23-0) \)
\( \text{PAADMB}(23-0) \leftarrow 0 \)
Select ADD
If \( \text{PAOUT}(23-0) = 0 \):
\( \text{PAAREG}(31-24) \leftarrow -128_{10} \); iff PAEACT=1

FIGURE 4.26 (CONTINUED)
DIVIDE FLOATING POINT (2)

A

μs 34

SIGNFF = 1

YES

2's Complement A, Return to A
Q → Normalize Decode

NO

μs 35

Normalize Q, Return to A. A → Q

μs 36

A(31-24) → B(31-24)

Overflow

YES

Set OVFF

NO

Underflow

YES

-128 → A(31-24)
0 → A(23-0)

NO

μs 36

PAAOUT(31-24) → A(31-24)

μs 37

A(31-24) - PASHCR5, 5, 5-0

Underflow

YES

-128 → A(31-24)
0 → A(23-0)

NO

μs 37

PAAOUT(31-24) → A(31-24)

μs 38

Check for 0 Result, if YES,
-128 → A(31-24)

END

Figure 4.27b
Floating Point Divide Algorithm

151
4.8.4.5 F.P. Square Root

The execution of the floating point square root instruction is comprised of two functional steps; initialize and iterate. Figure 4.28 shows the micro-step sequence for SQ and Figure 4.29 is the flow chart.

The main function of the initialize phase is to determine if the fraction is negative. If so, the overflow flip-flop is set. If not, the exponent portion of the operand is examined to determine if it is even or odd. If it is odd, the fraction is shifted right one position. Then, the exponent is shifted right one place to divide it by two. The first trial divisor is then taken.

The iterative phase is composed of pairs of micro-steps wherein one square root bit is generated per pair. Within each pair, a trial divisor is generated, a subtraction of this divisor from the dividend (or previous remainder) is performed, a square root bit is generated based on the sign of the result of the subtraction, and a new trial divisor is formed.

The result of the iterative phase is a square root of 12 bits.

4.8.5 Output Instructions

Instructions chosen as examples are STA and OTA.

4.8.5.1 STA

During the STA instruction, the A Register is gated unchanged through the ALU to EM. The micro-step sequence is shown in Figure 4.30. The actual EM write cycle is handled by the PICU and EMC as described in Section 4.2.2.1.2.

4.8.5.2 OTA

During the OTA instruction, the A Register is gated unchanged through the ALU onto the Output Data Bus (PAODAT). The micro-step sequence is shown in Figure 4.31. The AU should have been selected such that only one AU in the ensemble has its EA flip-flop set.
<table>
<thead>
<tr>
<th>OPCODE</th>
<th>INSTRUCTION</th>
<th>MNEMONIC</th>
</tr>
</thead>
<tbody>
<tr>
<td>204</td>
<td>Square Root Floating Point</td>
<td>SQ</td>
</tr>
</tbody>
</table>

**DESCRIPTION**

In all active AUs, the square root of the floating point contents of A-Register (31-0) is placed in A-Register (31-0). Overflow error is generated and PAOVFF set in the case of negative operands. Q-Register contents are destroyed.

**Execution:** 32 steps

**MICRO STEP SEQUENCE**

1) \( \text{PAADEA} \leftarrow 0 \)
   
   \( \text{PAADEB} \leftarrow 0 \)
   
   \( \text{PAADMA} \leftarrow 0 \)
   
   \( \text{PAADMB} \leftarrow 0 \)

Select INCREMENT

\( \text{PABREG}(23-0) \leftarrow \text{PAAOUT}(23-0) = (0...01) \)

\( \text{PASHCR}(5-0) \leftarrow \text{PAAOUT}(29-24) = +1 \) iff \( \text{PAEACT}=1 \)

\( \text{PAOVFF} \leftarrow 1; \) iff \( \text{PAAREG}(23) = 1 \cdot \text{PAEACT}=1 \)

2) \( \text{PAALNI}(23-0) \leftarrow \text{PAAREG}(23-0) \)

If \( \text{PAAREG}(24) = 1 \):

\( \text{PAADEA}(31-24) \leftarrow \text{PAAREG}(31-24) \)

\( \text{PAADEB}(31-24) \leftarrow 0 \)

Select INCREMENT

\( \text{PAOVFF} \leftarrow 1 \) iff overflow

\( \text{PAAREG}(31-24) \leftarrow \text{PAAOUT}(31-24) \) ; iff \( \text{PAEACT}=1 \)

\( \text{PAAREG}(23-0) \leftarrow \text{PAALNO}(23-0) \)

**FIGURE 4.28, SQ**
3) \( \text{PAAREG}(31-24) \leftarrow \text{PAAREG}(31,31-25) \)
\( \text{PAADMA}(23-0) \leftarrow \text{PAAREG}(23-0) \)
\( \text{PAADMB}(23-0) \leftarrow 0 \)
Select ADD
\( \text{PAQREG}(23-0) \leftarrow \text{PAAOOUT}(23-0) \); iff PAEACT=1

4) \( \text{PAADMA}(23-0) \leftarrow 0 \)
\( \text{PAADMB}(23-0) \leftarrow 0 \)
Select ADD
\( \text{PAAREG}(23-0) \leftarrow \text{PAAOOUT}(23-0) \); iff PAEACT=1

5,7,9-27) \( \text{PAQREG}(23-0) \leftarrow (\text{PAQREC}(22-0, 0)) \); iff PAEACT=1
\( \text{PAADMA}(23-0) \leftarrow (\text{PAAREG}(21-0), \text{PAQREG}(22,21)) \)
\( \text{PAADMB}(23-0) \leftarrow \text{PABREG}(23-0) \)
If PAAREG(23) = 0:
Select SUBTRACT
\( \text{PAAREG}(23-0) \leftarrow \text{PAAOOUT}(23-0) \); iff PAEACT=1
If PAAREG(23) = 1:
Select ADD
\( \text{PAAREG}(23-0) \leftarrow \text{PAAOOUT}(23-0) \); iff PAEACT=1

6,8,10,-28) \( \text{PAQREG}(23-0) \leftarrow (\text{PAQREC}(22-0,0)) \); iff PAEACT=1
\( \text{PASCR}(5-0) \leftarrow \text{PA2SCO}(5-0) \)
\( \text{PAALNI}(23-0) \leftarrow \text{PABREG}(23-0) \)
\( \text{PABREG}(23-0) \leftarrow \text{PAALNO}(23-0) \)
If PAAREG(23) = 0 (correct guess)
\( \text{PABREG}(2-0) \leftarrow (101) \); iff PAEACT=1
If PAAREG(23) = 1 (wrong guess)
\( \text{PABREG}(2-0) \leftarrow (011) \); iff PAEACT=1

**Figure 4.28 (continued)**
SQUARE ROOT (1)

START

Sign A Negative

us 1

YES → Set OVFF

NO

Initialize B Mantissa

us 2

Exponent Add

YES → Rt. Shift A Mantissa

Add 1 to A Exponent

NO

Exp. OV

YES → Set OVFF

us 3

Rt. Shift A Exponent 1 bit

A Mantissa + Q Mantissa

us 4

Set A Mantissa = 0

A

Figure 4.29a
Floating Point Square Root Algorithm
SQUARE ROOT (2)

\[ A \]

\[ \mu s \ 5, 7, \ldots, 27 \]

NO \[
A_{23} = 0
\]

YES \[
(A_{REG21-0}, Q_{REG22, 21}) + (B_{REG23-0}) \rightarrow A_{REG23-0}
\]

YES \[
(A_{REG21-0}, Q_{REG22, 21}) - (B_{REG23-0}) \rightarrow A_{REG23-0}
\]

\rightarrow\text{Shift Q Mantissa left 1 bit}

\[ \mu s \ 6, 8, \ldots, 28 \]

\rightarrow\text{Shift B Mantissa left 1 bit}

\rightarrow\text{Shift Q Mantissa left 1 bit}

\rightarrow\text{Shift B Mantissa right 2 bits}

\[ \mu s \ 29 \]

\rightarrow\text{Shift B Mantissa right 2 bits}

\[ \mu s \ 30 \]

\rightarrow\text{B Mantissa \rightarrow Normalize Decoder}

\[ \mu s \ 31 \]

\rightarrow\text{Normalize B Mantissa}

\rightarrow\text{B Mantissa \rightarrow A Mantissa}

\[ \mu s \ 32 \]

\rightarrow\text{A Mantissa = 0}

NO \[
\rightarrow\text{END}
\]

YES \[
-128 \rightarrow A \text{ Exponent}
\]

Figure 4.29b

Floating Point Square Root Algorithm
<table>
<thead>
<tr>
<th>OPCODE</th>
<th>INSTRUCTION</th>
<th>MNEMONIC</th>
</tr>
</thead>
<tbody>
<tr>
<td>031</td>
<td>Store A-Register</td>
<td>STA</td>
</tr>
</tbody>
</table>

**DESCRIPTION**

AU A-Register (31-0) is stored in element memory (word specified by operand (bits 10-0)) from those AUs which are active.

Execution: 1 step

**MICRO STEP SEQUENCE**

1)  \(\text{PADMDE} \leftarrow 1\)

   \(\text{PADAIN}(10-0) \leftarrow \text{FAMOBR}(10-0)\)

   Transmit \(\text{PADREQ}\) to EMC

   \(\text{PAADEA}(31-24) \leftarrow \text{PAAREG}(31-24)\)

   \(\text{PADEB}(31-24) \leftarrow 0\)

   \(\text{PAADMA}(23-0) \leftarrow \text{PAAREG}(23-0)\)

   \(\text{PAADMB}(23-0) \leftarrow 0\)

   Select OR

   \(\text{PXEMOT}(31-0) \leftarrow \text{PAAOUT}(31-0); \text{iff PADSEL} \cdot (\text{PAEACT}=1)\)

   \(\text{PADWSL} \leftarrow 1; \text{iff PADSEL} \cdot (\text{PAEACT}=1)\)

**FIGURE 4.30, STA**
<table>
<thead>
<tr>
<th>OPCODE</th>
<th>INSTRUCTION</th>
<th>MNEMONIC</th>
</tr>
</thead>
<tbody>
<tr>
<td>047</td>
<td>Output to A</td>
<td>OTA</td>
</tr>
</tbody>
</table>

DESCRIPTION

AU A Register (31-0) is gated to the SCL A Register via the Adder Output to the Output Data Bus if Element Activity is 1.

Execution: 1 step

MICRO STEP SEQUENCE

1) PADEEA(31-24) ← PAAREA(31-24)
   PAADEB(31-24) ← 0
   PAADMA(23-0) ← PAAREA(23-0)
   PAADM(23-0) ← 0
   Select OR
   PAODAT(31-0) ← PAOUT(31-0); if PAEACT=1

Figure 4.31, OTA
4.8.6 Distributed Logic Instructions

Instructions chosen as examples are RDA (executed in the SCL), SH, SL and SF.

4.8.6.1 Count AU/RDA

AU Count logic is based on a design by Drs. C.C. Foster (F076) and K. E. Batcher at Goodyear Aerospace. It is an exact counter which requires no more than one full adder per PE.

Each row of nine AU contains one set of the logic shown in Figure 4.32. The AU supplies the output of its EA FF. The sum is generated and sent to the CC where standard adders generate a count for the entire system.

The nine EA signals are fed to a group of full adders. The outputs are labeled with a power of two weighting. The remaining adder stages gather all like weighted signals together to produce a single four bit count.

In the CC, the AU count is available to the SCL, when, during the execution of a RDA, the count of active AU is loaded into the SCL A Register.

4.8.6.2 Select Highest/Lowest

The SH/SL algorithm was described in detail in Section 3.6.2.

Figures 4.33 shows the micro-step sequence for SH. In the following micro-step description

\[
\text{PAALFA} - \text{ Flip-Flop} \\
\text{PABETA} - \text{ Flip-Flop} \\
\text{SAHANY} - \text{ When low, indicates to the PICU that zero or one AU remains active.} \\
\text{PABITA} \\
\text{PABITB} - \text{ A, B, C lines} \\
\text{PABITC} \\
\text{PABITX} \\
\text{PABITY} - \text{ X, Y, Z lines} \\
\text{PABITZ} \\
\]
These logic gates are ECL type 182 single bit full adders.

Figure 4.32 Row Count Logic
OPCODE     INSTRUCTION     MNEMONIC
164          Select Highest      SH

DESCRIPTION
Set EA to zero in all active AU's whose A-Registers contain non-maximal values relative to set of active elements (Integer or Floating Point).

Execution: 35 Steps (Maximum)

MICRO STEP SEQUENCE

1) PAADMA(23-0) ← 0
   PAADMB(23-0) ← PAAREG(23-0)
   SELECT OR
   PABREG(23-0) ← PAAOUT(23-0)
   If PAAREG(23) = 1
      PABREG(31) ← PAAREG(31)
      PABREG(30-24) ← PAAREG(30-24)
   If PAAREG(23) = 0
      PABREG(31) ← PAAREG(31)
      PABREG(30-24) ← PAAREG(30-24)
   PAALFA ← PAAOUT(23)
   If PAAREG(23) = 1
      PABETA ← PAAREG(31)
   If PAAREG(23) = 0
      PABETA ← PAAREG(31)
   PASHCR(5-0) ← 018; Iff PAEACT = 1

FIGURE 4.33, SH
2) PABITA ← PAALFA·PABETA·PAEACT
   PABITB ← PAALFA·PABETA·PAEACT
   PABITC ← PAALFA·PABETA·PAEACT

3) PABITA ← PAALFA·PABETA·PAEACT
   PABITB ← PAALFA·PABETA·PAEACT
   PABITC ← PAALFA·PABETA·PAEACT

PASCR(5-0) ← PA2SCO(5-0)
PAALNI(31-0) ← PABREG(31-0)
PABREG(31-0) ← PAALNO(31-0)

If (PABITA·PABITX) +
   (PABITA·PABITB·PABITY) +
   (PABITA·PABITB·PABITC·PABITZ)

Then PAEACT ← 0
Else PAEACT not changed.

PAALFA ← PABREG(31)
PABETA ← PABREG(30)

even, 4-34) If SAMANY = 0:
   EXIT

If SAMANY = 1:
   PABITA ← PAALFA·PABETA·PAEACT
   PABITB ← PAALFA·PABETA·PAEACT
   PABITC ← PAALFA·PABETA·PAEACT
   PASCR(5-0) ← PA2SCO(5-0)
   PAALNI(31-0) ← PABREG(31-0)
   PABREG(31-0) ← PAALNO(31-0)

FIGURE 4.33 (CONTINUED)
odd, 5-35) If SAMANY = 0:

EXIT

If SAMANY = 1:

PABITA ← PAALFA·PABETA·PAEACT
PABITB ← PAALFA·PABETA·PAEACT
PABITC ← PAALFA·PABETA·PAEACT
PASCN(5-0) ← PA2SCO(5-0)
PAALNI(31-0) ← PABREG(31-0)
PABREG(31-0) ← PAALNO(31-0)

IF (PABITA·PABITX) +
   (PABITA·PABITB·PABITY) +
   (PABITA·PABITB·PABITC·PABITZ)
Then PAEACT ← 0

Else PAEACT not changed

PAALFA ← PABREG(31)
PABETA ← PABREG(30)

FIGURE 4.33 (CONTINUED)
The conversion micro-step uses both normal ALU hardware and special logic. The fraction part of the conversion is performed by the mantissa adder. The exponent part, converted by the Search Conversion Logic, PASRCH(31-24), is as follows:

\[
\begin{align*}
\text{(IF} & \text{ (PAMORL(28-24) = 27 \cdot PAAREG23 OR } \\
& \text{ (PAMORL(28-24) = 24 \cdot PAAREG23 THEN } \\
& \text{ PASRCH31 } \leftarrow \text{ PAAREG31, } \\
& \text{ PASRCH(30-24) } \leftarrow \text{ PAAREG(30-24)) OR } \\
\text{ (IF} & \text{ (PAMORL(28-24) = 27 \cdot PAAREG23 OR } \\
& \text{ (PAMORL(28-24) = 24 \cdot PAAREG23 THEN } \\
& \text{ PASRCH31 } \leftarrow \text{ PAAREG31, } \\
& \text{ PASRCH(30-24) } \leftarrow \text{ PAAREG(30-24))}
\end{align*}
\]

The conversion logic is shown in block diagram form in Figure 4.34.

The two bit interactive logic is shown in Figure 4.35. The A, B, C output lines are sent to the BSD where they are "OR"ed with all other like lines with the Bay. Then they are sent to the CSD where all eight Bays are "OR"ed. The result, now called X, Y, Z lines are then broadcast back to all AU. The global X, Y, Z lines are compared to the local A, B, C lines and the EA FF reset if necessary.

**4.8.6.3 Select First**

The SF instruction encompasses very little logic in the AU. The AU sends its EA FF to the BSD and receives a pointer, PASFPT, back. The BSD sends a row activity signal to the CSD. The row activity is an "OR" of the EA of each AU in the row. The CSD sends a pointer to the first BSD in the first Bay with an active AU. The BSD then sends the pointer to the first active AU in the row.

The AU, upon execution of an SF instruction, waits one clock for the propagation delay of the distributed logic to settle and then the AU loads the pointer into its EA Flip-flop. Figure 4.36 shows these two micro-steps.

The SF logic on the BSD is shown in Figure 4.37. The nine EA lines enter at the left and the nine pointers exit on the right.
Figure 4.34  SN/SL Conversion Micro-Step Logic
Figure 4.35 AU Search Logic
<table>
<thead>
<tr>
<th>OPCODE</th>
<th>INSTRUCTION</th>
<th>MNEMONIC</th>
</tr>
</thead>
<tbody>
<tr>
<td>150</td>
<td>Select First</td>
<td>SF</td>
</tr>
</tbody>
</table>

**DESCRIPTION**

Set EA to zero in all active AUs except one. The one remaining active AU is chosen on the basis of physical hardware location.

Execution: 2 steps

**MICRO STEP SEQUENCE**

1) NOP

2) \( \text{PAEACT} \leftarrow 0; \text{if PASFPT} = 0 \)

Else PAEACT not changed

FIGURE 4.36, SF
Figure 4.37 Row Select First Logic
5.0 Conclusion

The AU design successfully met all design and operational goals set by the specification document. It and the remainder of PEPE was delivered on budget and on schedule to the BMD Advanced Technology Center in Huntsville, Alabama.

A critique of the final design found two areas of the AU that could be improved. The FIX instruction algorithm could be reduced from 5 steps to 3 steps with the addition of hardware. The rather clumsy algorithm now used, were developed because an error was found in the AU during final tests. The correction had to entail minimum change and therefore was not an elegant solution. Hardware to detect the case where significant bits are shifted out would cut steps 4 and 5 from the current algorithm.

The quotient of the floating point multiply and the dividend of the floating point divide should be 48 bits long to avoid any loss of accuracy. AU hardware does not support the long words but this could possibly be added to enhance the AU performance.

PEPE has been studied by Burroughs and System Development Corporation for use in other high speed processing problems. Areas suggested have been air traffic control, image processing, weather forecasting and wind tunnel simulation. Since PEPE was designed as a processor ensemble, there is no direct communication between AU's. Most problems that are amenable to parallel solutions require at least nearest neighbor communications. The possibility of using the AOCU/AOU parallel ensemble as a programmable routing network has been proposed. Under this scheme, the ACU/AU ensemble could be processing data while the AOCU/AOU ensemble was routing data between the
Element Memories. The major advantage of this software routing is that in a hardwired system, like ILLIAC IV, if a PE failed, the communication network was left with a hole whereas if a PE failed, software could reestablish the network using another PE. This work has not yet been completed.

The analysis of algorithms for parallelism is a new, largely unexplored field. Some work [Ra76] and [KU76] has been done but a useable parallelism detecting compiler is still in the future.
References and Bibliography

Search Algorithm


Select First Algorithm


PEPE Hardware


PEPE Software


Parallelism Detection


APPENDICES
Arithmetic Control Unit (ACU) - Controls the Arithmetic Units (AU) in the PEPE elements so that they can execute the required parallel algorithms in an efficient manner. In addition, it executes sequential instructions retrieved from the program memory (PGRM) similar to that of a conventional processor.

Associative Output Unit (AOU) - Performs computations including, but not limited to, those required in the output of data from the PE to the Control Unit.

Associative Output Control Unit (AOCU) - Similar to ACU except controls the associative output units (AOU) in the PEPE elements.

Arithmetic Unit (AU) - Performs computations including, but not limited to, those required to apply complex arithmetic functions to data contained in element memory (EM).

Bay Signal Distributor (BSD) - The portion of the SDS contained in the Element Bay.

Central Signal Distributor (CSD) - The portion of the SDS contained in the Control Console.

Control Console (CC) - That portion of PEPE consisting of all of the functional units except the PE.

Correlation Control Unit (CCU) - Similar to ACU except controls the correlation units (CU) in the PEPE elements.

Correlation Unit (CU) - Performs computations including, but not limited to, those required to correlate incoming data with data already resident in the element.

APPENDIX A, PEPE DICTIONARY
Data Memory (DATA) - One of three memories, one in each control unit, used to hold data in transit between the Host computer and the elements.

Element Bay (EB) - That portion of PEPE consisting of up to 36 processing elements (PE). A complete PEPE system consists of eight element bays.

Element Memory (EM) - A word addressed memory of 2048 locations contained in each processing element (PE). It is shared between the AU, AOU, and CU.

Element Memory Control (EMC) - Unit which performs conflict resolution on requests from the three control units for use of element memory.

Intercommunication Logic (ICL) - A unit in the PEPE Control Console which handles:

- Inter-control unit communication
- Control unit interrupts
- PEPE status and maintenance
- Data collection and timing

Input-Output Unit (IOU) - Provides communication between the PEPE control units and external computers or another PEPE.

Interval Timer (IT) - A programmable timer in the ICL which can be set to interrupt the ACU after a time-out.

Maintenance Control and Diagnostic Unit (MCDU) - Unit which can perform test and diagnostic operations on each control under either manual or external computer control.

Micro-Program Memory (MPM) - A programmable and alterable memory in each SCL and PICU which controls registers and gating in order to execute the micro-sequences that comprise an instruction.
Output Data Control (ODC) - Unit which performs conflict resolution on requests from the ACU and AOCU for use of the output data bus from the elements.

Parallel Instruction Execution - The process of the PICU receiving instructions from the SCL, translating the instruction into micro-sequence control bits and transmitting these controls to the element for execution.

Processing Element (PE) - One minimum building block of PEPE. Includes one AU, AOU, and CU and one EM.

PE Clock Distributor (PECLKD) - The logic which receives a square wave clock signal from the BSD and generates all clock and write enable pulses required by the PE.

Parallel Element Processing Ensemble (PEPE) - A highly-parallel computer consisting of three control units and 288 content-addressable processing elements (PE). PEPE is capable of executing three independent instruction streams simultaneously.

Program Memory (PGRM) - One of three memories, one in each control unit, used to hold the program to be executed by that control unit. Contains both parallel and sequential instructions.

Parallel Instruction Control Unit (PICU) - Unit which transmits controls and data to the element. One PICU in each control unit.

Parallel Instruction Queue (PIQ) - First-in, first-out buffer between the ACU SCL and the ACU PICU. Used to increase the execution overlap of long arithmetic parallel instructions with shorter arithmetic sequential instructions.
Real Time Clock (RTC) - A clock in the ICL which can follow real-time or simulated real-time. Can cause an interrupt to the ACU when it equals a programmable value.

Routing - The process whereby the SCL fetches an instruction from PGRM, determines if it is a parallel or sequential instruction and transfers it to the sequential execution sub-unit of the SCL or to the PICU for parallel execution.

Select First (SF) - A distributed logic function whereby the first active element is selected to perform a sequence of instructions while non-first elements remain idle.

Select Highest/Select Lowest (SH/SL) - A distributed logic function whereby the active element with the maximal/minimal binary value in its A register is selected to perform a sequence of instruction while non-maximal/minimal elements remain idle.

Sequential Control Logic (SCL) - That part of a control unit that performs instruction fetching, sequential instruction execution and parallel instruction preparation.

Signal Distribution System (SDS) - The logic which carries data and control lines from the PICU to the PE and which returns data and status signals from the PE to the PICU and SCL. Contains the Distributed Logic.

Sub-Processor - One of the three processors in the processing element (PE). The three sub-processors are the AU, AOU and CU.

Test and Maintenance Computer (T & M) - An external computer interfaced with PEPE via the MCDU and IOUs. The T & M can have complete control over PEPE for program loading, execution and debugging and for maintenance and diagnostic functions. The T & M is a Burroughs B1714 system.
1. **CLOCK RATE** - 10 MHz, ± 0.01% Crystal Controlled

2. **MECL 10K**
   - 29 types used
   - 16 and 24 pin Dual-In-Line package (DIP)
   - 2 to 30 gates per 16 pin DIP (62 gates per 24 pin DIP)
   - 2 nanosecond delay and rise time

3. **NO. OF CIRCUIT TYPES USED** (excluding memory) - 29
   - PE BAY (11 PE system) - 10,000 IC DIPS
   - (36 PE system) - 33,000 IC DIPS
   - (288 PE system) - 262,000 IC DIPS
   - CONTROL CONSOLE - 11,000 IC DIPS
   - TOTAL EQUIVALENT GATES (288 PE system) - 3 million
     11 gates/DIP average

4. **MEMORY TYPES AND SIZES**
   - Control Console SCL MPM, 3-1K X 48: 144 IC DIPS
   - Control Console DATA & PGM, 5-4K X 32: 640 IC DIPS
   - Control Console ACU PGM, 1-32K X 32: 1024 IC DIPS
   - Control Console PICU MPM, 3-1K X 80: 240 IC DIPS
   - ELEMENT BAY EM, 11-2K X 32: 704 IC DIPS
   - ELEMENT BAY CU Reg. 11-16 X 32: 88 IC DIPS

   TOTAL MEMORY DIPS = 2840
   (2 circuit types)

5. **IMPEDANCE CHARACTERISTICS**
   - Laminated Copper backplane used for low impedance power distribution
   - 50 ohm transmission lines used for signal distribution throughout the system - including all boards and backplanes.

6. **NUMBER OF BOARDS**
   - Control Console, 6 layer - 105 boards
   - No. of types, 6 layer - 41
   - Element Bay, 8 layer - 66 boards
   - No. of types, 6 layer - 6

APPENDIX B, PEPE HARDWARE FEATURES
7. COOLING TECHNIQUES

- Cooled Forced Air with Fin and Tube Heat Exchanger
- Direct Liquid Cooling

8. POWER SUPPLY

- High Frequency Power Conversion
- Output - 4300 Watts
  - 600 Amps at 5.2V
  - 600 Amps at 2.0V
- Efficiency $\geq 70\%$
- Regulation; $\pm 2\%$
- Fault Control - over/under voltage
  - over current
  - over temperature
  - air flow
- Output Rectification - Schottky Diodes
- Cooling - Direct Liquid Cooling for Schottky Diodes and Power Switches
<table>
<thead>
<tr>
<th></th>
<th>0</th>
<th>1</th>
<th>2</th>
<th>3</th>
<th>4</th>
<th>5</th>
<th>6</th>
<th>7</th>
</tr>
</thead>
<tbody>
<tr>
<td><strong>ACT</strong></td>
<td>A</td>
<td>C</td>
<td>A</td>
<td>T</td>
<td>C</td>
<td>A</td>
<td>T</td>
<td>push</td>
</tr>
<tr>
<td><strong>LDE</strong></td>
<td>L</td>
<td>D</td>
<td>E</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td><strong>LDA</strong></td>
<td>L</td>
<td>D</td>
<td>A</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td><strong>STA</strong></td>
<td>S</td>
<td>T</td>
<td>A</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td><strong>TGA</strong></td>
<td>T</td>
<td>G</td>
<td>A</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td><strong>FRO</strong></td>
<td>F</td>
<td>R</td>
<td>O</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td><strong>SEG</strong></td>
<td>S</td>
<td>E</td>
<td>G</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td><strong>SHL</strong></td>
<td>S</td>
<td>H</td>
<td>L</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td><strong>ADD</strong></td>
<td>A</td>
<td>D</td>
<td>D</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td><strong>ADI</strong></td>
<td>A</td>
<td>D</td>
<td>I</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td><strong>CPY</strong></td>
<td>C</td>
<td>P</td>
<td>Y</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

**Appendix C**  Instruction Set of the AU
Listed in the following tables are the AU instructions. First is a list ordered by mnemonic and second is a list ordered by opcode with a brief description of the operation of each instruction.

<table>
<thead>
<tr>
<th>MNEMONIC</th>
<th>Four character code accepted by the PEPE Assembler</th>
</tr>
</thead>
<tbody>
<tr>
<td>TYPE</td>
<td>Instruction class</td>
</tr>
<tr>
<td>1</td>
<td>Activity</td>
</tr>
<tr>
<td>2</td>
<td>Integer</td>
</tr>
<tr>
<td>3</td>
<td>Logical/Data Transfer</td>
</tr>
<tr>
<td>4</td>
<td>Floating Point</td>
</tr>
<tr>
<td>5</td>
<td>Output</td>
</tr>
<tr>
<td>6</td>
<td>Distributed</td>
</tr>
<tr>
<td>OPCODE</td>
<td>8 bit binary</td>
</tr>
<tr>
<td>STEPS</td>
<td>Number of 100 ns clock periods required for execution.</td>
</tr>
<tr>
<td>MNEMONIC</td>
<td>OPCODE</td>
</tr>
<tr>
<td>----------</td>
<td>--------</td>
</tr>
<tr>
<td>ACT</td>
<td>1 010</td>
</tr>
<tr>
<td>ADD</td>
<td>4 200</td>
</tr>
<tr>
<td>ADE</td>
<td>4 210</td>
</tr>
<tr>
<td>ADI</td>
<td>2 220</td>
</tr>
<tr>
<td>ANA</td>
<td>3 242</td>
</tr>
<tr>
<td>ANNT</td>
<td>3 243</td>
</tr>
<tr>
<td>ANS</td>
<td>1 252</td>
</tr>
<tr>
<td>CA</td>
<td>1 251</td>
</tr>
<tr>
<td>CACT</td>
<td>1 011</td>
</tr>
<tr>
<td>CAS</td>
<td>1 253</td>
</tr>
<tr>
<td>CF</td>
<td>1 105</td>
</tr>
<tr>
<td>COPY</td>
<td>1 250</td>
</tr>
<tr>
<td>DV</td>
<td>4 203</td>
</tr>
<tr>
<td>FIX</td>
<td>4 205</td>
</tr>
<tr>
<td>FLOT</td>
<td>4 206</td>
</tr>
<tr>
<td>LADI</td>
<td>2 225</td>
</tr>
<tr>
<td>LDA</td>
<td>3 021</td>
</tr>
<tr>
<td>LDE</td>
<td>3 026</td>
</tr>
<tr>
<td>LDEA</td>
<td>2 224</td>
</tr>
<tr>
<td>LDQ</td>
<td>3 023</td>
</tr>
<tr>
<td>LINA</td>
<td>2 223</td>
</tr>
<tr>
<td>LSBT</td>
<td>2 226</td>
</tr>
<tr>
<td>LTAG</td>
<td>3 027</td>
</tr>
<tr>
<td>ML</td>
<td>4 202</td>
</tr>
<tr>
<td>MLI</td>
<td>2 222</td>
</tr>
<tr>
<td>OCL</td>
<td>3 246</td>
</tr>
<tr>
<td>ORA</td>
<td>3 224</td>
</tr>
<tr>
<td>ORS</td>
<td>1 254</td>
</tr>
<tr>
<td>OTA</td>
<td>5 047</td>
</tr>
<tr>
<td>POP</td>
<td>1 012</td>
</tr>
<tr>
<td>PSEL</td>
<td>1 013</td>
</tr>
<tr>
<td>PUSH</td>
<td>1 014</td>
</tr>
<tr>
<td>RBIT</td>
<td>1 100</td>
</tr>
<tr>
<td>ROV</td>
<td>2 107</td>
</tr>
<tr>
<td>SB</td>
<td>4 201</td>
</tr>
<tr>
<td>SBE</td>
<td>4 211</td>
</tr>
</tbody>
</table>
## AU INSTRUCTION SET

### ACTIVITY INSTRUCTIONS

| 010 ACT | ACTIVATE | PAEACT=TAG REGISTER INPUT LOGIC |
| 011 CACT | CLEAR AND ACTIVATE | PAEACT=1 IF TAG MATCH ELSE 0 |
| 012 POP | POP STACK | PAEACT=FASTAK(0) |
| 013 PSEL | PUSH AND SELECT ON BIT | COMBINATION OF PUSH AND SELB |
| 014 PUSH | PUSH ACTIVITY STACK | FASTAK(0)=PAEACT |
| 015 ACTIVATE | | FASTAK(20-1)=FASTAK(19-0) |

### LOAD INSTRUCTIONS

| 021 LDA | LOAD A REG | PAAREG=OPERAND |
| 023 LDQ | LOAD Q REG | PAQREG=OPERAND |
| 026 LIE | LOAD A EXPONENT | PAAREG(31-24)=OPERAND(31-24) |
| 027 LTAG | LOAD TAGS | PATAGR=OPERAND(7-0) |
|          |            | PAPE=OPERAND(8) |
|          |            | PAEACT=OPERAND(10) |
|          |            | FASTAK=OPERAND(31-11) |

### STORE INSTRUCTIONS

| 031 STA | STORE A | EM(OPERAND(10-0))=PAAREG |
| 033 STQ | STORE Q | EM(OPERAND(10-0))=PAQREG |
| 037 STAG | STORE TAGS | EM(OPERAND(10-0))=REGISTERS IN THE SAME FORMAT AS LTAG |

### TRANSFER INSTRUCTIONS

| 042 TQA | TRANSFER Q TO A | PAAREG=PAQREG |
| 043 TAQ | TRANSFER A TO Q | PAQREG=PAAREG |
| 046 TIDA | TRANSFER ID TO A | PAAREG(8-0)=FE NUMBER |
| 047 OTA | OUTPUT TO A | PAOTAT=PAAREG |
| 100 RBIT | RESET BIT | EM(OPERAND(10-0){OPERAND(15-11)}=0 |
| 101 SBIT | SET BIT | EM(OPERAND(10-0){OPERAND(15-11)}=1 |
| 102 STG | SET TAG IN ACTIVE AU | PATAGR=OPERAND(7-0).AND.OPERAND(15-8).AND.PAEACT=1 |
| 103 STGA | SET TAG IN ALL AU | PATAGR=OPERAND(7-0).AND.OPERAND(15-8) |
| 104 STGI | SET TAG IN IN ACTIVE AU | PATAGR=OPERAND(7-0).AND.OPERAND(15-8).AND.PAEACT=0 |
| 105 CF | CLEAR FAULT | FAFALT=0 |
| 106 SFF | SET FAULT | FAFALT=1 IF PAEACT=1 |
| 107 ROV | RESET OVERFLOW | PAOVFF=0 |
** SELECT INSTRUCTIONS **

120 SEG SELECT ON EQUAL GLOBAL
121 SZL SELECT ON ZERO LOGICAL
122 SGZ SELECT ON GT ZERO
123 SLZ SELECT ON LT ZERO ** SET PEAECT=0 IN THOSE
124 SGE SELECT ON GE ZERO ** ACTIVE AU WHERE THE CONDITION
125 SLE SELECT ON LE ZERO ** IS NOT SATISFIED
126 SNZ SELECT ON NE ZERO
127 SZR SELECT ON EQ ZERO
140 SNOV SELECT ON NON-OVERFLOW
147 SOV SELECT ON OVERFLOW
150 SF SELECT FIRST SET PEAECT TO ZERO IN ALL
   AU EXCEPT THE ONE WITH THE LOWEST
   PE NUMBER
157 SELB SELECT ON BIT SET PEAECT TO ZERO IN ALL ACTIVE AU
   WHERE EM(OPERAND(10-0)(OPERAND(15-11))
   IS NOT EQUAL TO ZERO
164 SH SELECT HIGEST ** SET PEAECT TO ZERO IS ALL ACTIVE AU
165 SL SELECT LOWEST ** WHOSE A REGISTER CONTAIN A
   ** NON MAXIMAL/MINIMAL VALUE
166 SNZL SELECT ON NON ZERO LOGICAL
167 SNG SELECT ON NOT EQUAL GLOBAL

** ARITHMETIC INSTRUCTIONS **

171 SHAI SHIFT ARITHMETIC INTEGER
173 SHL SHIFT LOGICAL
200 ADD ADD FLOATING POINT
201 SUB SUBTRACT FLOATING POINT
202 ML MULTIPLY FLOATING POINT
203 DV DIVIDE FLOATING POINT
204 SQ SQUARE ROOT FLOATING POINT
205 FIX FIX
206 FLOT FLOAT
210 ADE ADD EXPONENT PAREG(31-24)=PAREG(31-24) PLUS
   OPERAND(31-24)
211 SBE SUBTRACT EXPONENT
215 UADI UPPER ADD INTEGER
216 USBI UPPER SUBTRACT INTEGER
220 ADI ADD INTEGER
221 SBI SUBTRACT INTEGER
222 MLI MULTIPLY INTEGER
223 LINA LOAD AND INCREMENT A
224 LDEA LOAD AND DECREMENT A
225 LADI LOWER ADD INTEGER
226 LSBI LOWER SUBTRACT INTEGER
LOGICAL INSTRUCTIONS

242 ANA LOGICAL .AND.
243 ANNT LOGICAL .ANDNOT.
244 ORA LOGICAL .OR.
245 XDA LOGICAL .XOR.
246 OCL LOGICAL .NOT.
247 TCI TWOS COMPLEMENT INTEGER

ACTIVITY INSTRUCTIONS

250 COPY COPY ACTIVITY
    STACK
    PAEACT=PASTAK(0)
251 CA COMPLEMENT
    ACTIVITY
    PAEACT .NOT. PAEACT .AND. PASTAK(0)
252 ANS AND OF STACK
    PAEACT=PAEACT .AND. PASTAK(0)
    PASTAK(20-0)=0, PASTAK(19-1)
253 CAS CLEAR ACTIVE
    STACK
    PASTAK(0)=0
254 ORS OR OF STACK
    PAEACT=PAEACT .OR. PASTAK(0)
    PASTAK(20-0)=0, PASTAK(19-1)
APPENDIX D, MECL 10K Logic Family Used in the AU
ECL 10117
Dual 2 Wide 2-3 Input
OR/AND

ECL 10121
4 Wide 3 Input
OR/AND
**FUNCTION TABLE**

<table>
<thead>
<tr>
<th>SELECT</th>
<th>OPERATING MODE</th>
</tr>
</thead>
<tbody>
<tr>
<td>S1</td>
<td>S2</td>
</tr>
<tr>
<td>L</td>
<td>L</td>
</tr>
<tr>
<td>L</td>
<td>H</td>
</tr>
<tr>
<td>H</td>
<td>L</td>
</tr>
<tr>
<td>H</td>
<td>H</td>
</tr>
</tbody>
</table>

**TRUTH TABLE**

<table>
<thead>
<tr>
<th>PULSE</th>
<th>INPITS</th>
<th>OUTPUTS*</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>L</td>
<td>L</td>
</tr>
<tr>
<td>1</td>
<td>L</td>
<td>L</td>
</tr>
<tr>
<td>2</td>
<td>L</td>
<td>H</td>
</tr>
<tr>
<td>3</td>
<td>L</td>
<td>H</td>
</tr>
<tr>
<td>4</td>
<td>L</td>
<td>X</td>
</tr>
<tr>
<td>5</td>
<td>L</td>
<td>X</td>
</tr>
<tr>
<td>6</td>
<td>L</td>
<td>X</td>
</tr>
<tr>
<td>7</td>
<td>L</td>
<td>X</td>
</tr>
<tr>
<td>8</td>
<td>H</td>
<td>X</td>
</tr>
<tr>
<td>9</td>
<td>H</td>
<td>X</td>
</tr>
<tr>
<td>10</td>
<td>H</td>
<td>X</td>
</tr>
<tr>
<td>11</td>
<td>H</td>
<td>X</td>
</tr>
<tr>
<td>12</td>
<td>H</td>
<td>X</td>
</tr>
<tr>
<td>13</td>
<td>H</td>
<td>X</td>
</tr>
<tr>
<td>14</td>
<td>H</td>
<td>X</td>
</tr>
</tbody>
</table>

---

**ECL 10141**

4 Bit Shift Register

**ECL 10135**

Dual J-K Flip-Flop
ECL 10161
1 to 8 Decoder

Enable Inputs | Outputs
<table>
<thead>
<tr>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>02 15  A4 A2 A1</td>
<td>D0 D1 D2 D3 D4 D5 D6 D7</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>L  L  L  L  L</td>
<td>H  H  H  H  H  H  H  H</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>L  L  L  H  H</td>
<td>H  L  H  H  H  H  H  H</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>L  L  H  H  H</td>
<td>H  H  H  H  H  H  H  L</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>L  L  φ  φ  φ</td>
<td>H  H  H  H  H  H  H  H</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
<tr>
<td>L  φ  φ  φ  φ</td>
<td>H  H  H  H  H  H  H  H</td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

φ = dont care

ECL 10164
8 Input Multiplexer

Enable Inputs | Output
<table>
<thead>
<tr>
<th></th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>02  A4 A2 A1</td>
<td>Z</td>
</tr>
<tr>
<td>L  L  L</td>
<td>x0</td>
</tr>
<tr>
<td>L  L  H</td>
<td>x1</td>
</tr>
<tr>
<td>L  H  H</td>
<td>x2</td>
</tr>
<tr>
<td>L  φ  φ</td>
<td>φ</td>
</tr>
</tbody>
</table>

190
ECL 10165
3 Input Priority Encoder

Data Inputs

<table>
<thead>
<tr>
<th>D0</th>
<th>D1</th>
<th>D2</th>
<th>D3</th>
<th>D4</th>
<th>D5</th>
<th>D6</th>
<th>D7</th>
</tr>
</thead>
<tbody>
<tr>
<td>H</td>
<td>Ø</td>
<td>Ø</td>
<td>Ø</td>
<td>Ø</td>
<td>Ø</td>
<td>Ø</td>
<td>Ø</td>
</tr>
<tr>
<td>L</td>
<td>H</td>
<td>Ø</td>
<td>Ø</td>
<td>Ø</td>
<td>Ø</td>
<td>Ø</td>
<td>Ø</td>
</tr>
<tr>
<td>L</td>
<td>L</td>
<td>H</td>
<td>Ø</td>
<td>Ø</td>
<td>Ø</td>
<td>Ø</td>
<td>Ø</td>
</tr>
<tr>
<td>L</td>
<td>L</td>
<td>L</td>
<td>L</td>
<td>L</td>
<td>L</td>
<td>L</td>
<td>H</td>
</tr>
<tr>
<td>L</td>
<td>L</td>
<td>L</td>
<td>L</td>
<td>L</td>
<td>L</td>
<td>L</td>
<td>L</td>
</tr>
</tbody>
</table>

Outputs

<table>
<thead>
<tr>
<th>Q3</th>
<th>Q2</th>
<th>Q1</th>
<th>Q0</th>
</tr>
</thead>
<tbody>
<tr>
<td>H</td>
<td>L</td>
<td>L</td>
<td>L</td>
</tr>
<tr>
<td>H</td>
<td>L</td>
<td>L</td>
<td>H</td>
</tr>
<tr>
<td>H</td>
<td>L</td>
<td>H</td>
<td>L</td>
</tr>
<tr>
<td>H</td>
<td>H</td>
<td>H</td>
<td>H</td>
</tr>
<tr>
<td>L</td>
<td>L</td>
<td>L</td>
<td>L</td>
</tr>
</tbody>
</table>

Ø = don't care
Quad 2 Input Multiplexer

ECL 10173

Quad 4 Input Multiplexer

ECL 10174
ECL 10176
Hex D Flip-Flops

ECL 10176
Look Ahead Carry Block
ECL 10180
Dual Full Adders

Function Select Table

<table>
<thead>
<tr>
<th>SA</th>
<th>SB</th>
<th>FUNCTION</th>
</tr>
</thead>
<tbody>
<tr>
<td>H</td>
<td>H</td>
<td>S=A plus B</td>
</tr>
<tr>
<td>H</td>
<td>L</td>
<td>S=A minus B</td>
</tr>
<tr>
<td>L</td>
<td>H</td>
<td>S=B minus A</td>
</tr>
<tr>
<td>L</td>
<td>L</td>
<td>S=0 minus A minus B</td>
</tr>
</tbody>
</table>

194
### Operations of Interest

<table>
<thead>
<tr>
<th>Select S3 S2 S1 S0</th>
<th>Mode M</th>
<th>Output</th>
</tr>
</thead>
<tbody>
<tr>
<td>L H H L L</td>
<td>L</td>
<td>A plus B plus CN (add)</td>
</tr>
<tr>
<td>H H L H L</td>
<td>L</td>
<td>A plus B plus CN (subtract)</td>
</tr>
<tr>
<td>L H H L H</td>
<td>H</td>
<td>A exclusive OR B</td>
</tr>
<tr>
<td>H L H H H</td>
<td>H</td>
<td>A OR B</td>
</tr>
<tr>
<td>H H L H H</td>
<td>H</td>
<td>A AND B</td>
</tr>
<tr>
<td>H H H L H</td>
<td>H</td>
<td>A AND B</td>
</tr>
</tbody>
</table>

**ECL 10131**

4 Bit Arithmetic Logic Unit
<table>
<thead>
<tr>
<th>Signal Mnemonic</th>
<th>Definition</th>
</tr>
</thead>
<tbody>
<tr>
<td>PAALO</td>
<td>Alignment Network Control</td>
</tr>
<tr>
<td>PAADEA</td>
<td>A Input to Exponent Adder</td>
</tr>
<tr>
<td>PADEEB</td>
<td>B Input to Exponent Adder</td>
</tr>
<tr>
<td>PAADMA</td>
<td>A Input to Mantissa Adder</td>
</tr>
<tr>
<td>PAADMB</td>
<td>B Input to Mantissa Adder</td>
</tr>
<tr>
<td>PAAALFA</td>
<td>Search: ALPHA Flip-Flop</td>
</tr>
<tr>
<td>PAAALNA</td>
<td>Alignment Network Output of Stage One</td>
</tr>
<tr>
<td>PAAALNB</td>
<td>Alignment Network Output of Stage Two</td>
</tr>
<tr>
<td>PAAALNC</td>
<td>Alignment Network Output of Stage Three</td>
</tr>
<tr>
<td>PAAALNI</td>
<td>Input to Alignment Network</td>
</tr>
<tr>
<td>PAAALNO</td>
<td>Output of Alignment Network</td>
</tr>
<tr>
<td>PAAOUT</td>
<td>ALU Output</td>
</tr>
<tr>
<td>PAAREG</td>
<td>A Register</td>
</tr>
<tr>
<td>PAAXB</td>
<td>Local Control - Extension Adder B Input</td>
</tr>
<tr>
<td>PABETA</td>
<td>Search: BETA Flip-Flop</td>
</tr>
<tr>
<td>PABEXT</td>
<td>B Extension Flip-Flop</td>
</tr>
<tr>
<td>PABITA</td>
<td></td>
</tr>
<tr>
<td>PABITB</td>
<td>Search: Decoded Output</td>
</tr>
<tr>
<td>PABITC</td>
<td></td>
</tr>
<tr>
<td>PABITX</td>
<td></td>
</tr>
<tr>
<td>PABITY</td>
<td>Search: Input to AU</td>
</tr>
<tr>
<td>PABITZ</td>
<td></td>
</tr>
<tr>
<td>PABREG</td>
<td>B Register</td>
</tr>
</tbody>
</table>

APPENDIX E, Signal Name Glossary
<table>
<thead>
<tr>
<th>Signal Mnemonic</th>
<th>Definition</th>
</tr>
</thead>
<tbody>
<tr>
<td>PACIN</td>
<td>Carry Input to Mantissa Adder</td>
</tr>
<tr>
<td>PACOFF23</td>
<td>Carry Output Flip-Flop</td>
</tr>
<tr>
<td>PACOUT23</td>
<td>Carry Out of Position 23</td>
</tr>
<tr>
<td>PACOUT22</td>
<td>Carry Out of Position 22</td>
</tr>
<tr>
<td>PACOXT</td>
<td>Carry Out of Extension Adder</td>
</tr>
<tr>
<td>PADAIN</td>
<td>Global Address to Element Memory</td>
</tr>
<tr>
<td>PADPC</td>
<td>Double Precision Carry Flip-Flop</td>
</tr>
<tr>
<td>PADPIN</td>
<td>Local Control for DPC Input</td>
</tr>
<tr>
<td>PADSEL</td>
<td>Global Control EM Bus Enable</td>
</tr>
<tr>
<td>PAEACT</td>
<td>Element Activity Flip-Flop</td>
</tr>
<tr>
<td>PAEXOF</td>
<td>Exponent Adder Add Overflow</td>
</tr>
<tr>
<td>PAEXUF</td>
<td>Exponent Adder Add Underflow</td>
</tr>
<tr>
<td>PAGEN</td>
<td>Global Clock Control for the AU</td>
</tr>
<tr>
<td>PAMLOF</td>
<td>Detection of All Ones or Zeros from Mantissa Adder</td>
</tr>
<tr>
<td>PAMOBR</td>
<td>PICU Operand Buffer Register</td>
</tr>
<tr>
<td>PAMOF</td>
<td>Mantissa Overflow</td>
</tr>
<tr>
<td>PAMORL</td>
<td>Global Data/Control Inputs</td>
</tr>
<tr>
<td>PAMOVB</td>
<td>Local Control for B Register(23,22)</td>
</tr>
<tr>
<td>PANRMD</td>
<td>Normalize Decode Network Output</td>
</tr>
<tr>
<td>PAODAT</td>
<td>Output Data Bus</td>
</tr>
<tr>
<td>PAOVFF</td>
<td>Overflow Flip-Flop</td>
</tr>
<tr>
<td>PASEL</td>
<td>Local Control for Q Extension Bit</td>
</tr>
<tr>
<td>PASBOF</td>
<td>Exponent Adder Subtract Overflow</td>
</tr>
</tbody>
</table>