GATE | PSUs

COMPUTER SCIENCE & INFORMATION TECHNOLOGY

Computer Organization

Text Book: Theory with worked out Examples and Practice Questions
1. Computer Arithmetic

01. Ans: (b)
Sol: 
\[
\begin{array}{cccccccc}
128 & 64 & 32 & 16 & 8 & 4 & 2 & 1 \\
1 & 1 & 0 & 0 & 0 & 1 & 1 & 1 \\
0 & -1 & 0 & 0 & +1 & 0 & 0 & -1
\end{array}
\]

02. Ans: (d)
Sol: Sign extension is used for converting smaller size signed data to larger size by padding the sign bit to left.

03. Ans: (b)
Sol: In given data EX-OR gate is not available so for one EX-OR gate 2 level network is needed because complemented and un-complemented inputs are available.
So, total delay = 2 + 2 + 2 = 6
In the above, first 2 is for first EX-OR gate
Second 2 is for CLG
Third 2 is for 2\textsuperscript{nd} EX-OR gate

04. Ans (d)
Sol: In non normalized form, \(0.239 \times 2^{13}\) is
\[
0.00111101_2 \times 2^{13} \\
e = 13, \ b = 64, \ so \ E = 77 = 1001101 \\
M = 00111101
\]
\[
0100110100111101 = 4D3D
\]

05. Ans: (d)
Sol: Implicit Normalized form is \(1.11101 \times 2^{10}\)
\[
M = 11101000, \ e = 10 \ b = 64 \\
E = 74 = 1001010 \\
0100 1010 11101 000 = 4AE8
\]

06. Ans: (b)
Sol: Four number of carries are needed to design CLG
For C1, one AND gate + one OR gate
For C2, two AND gates + one OR gate
For C3, three AND gates + one OR gate
For C4, four AND gates + one OR gate are required
So total ten AND gates are needed and four OR gates are needed

07. Ans (a)
Sol: \(S = 1, \ e = \) Original Exponent
\[
E = \) Biased Exponent, \( b = \) biasing amount
\[
14.25 = 1110.01 \times 2^0 = 1.11001 \times 2^3 \\
e = 3, \ E = 3 + 127 = 130 \\
130 = 10000010 \\
M = 1100100…0 (23 bits)
\]

---

ACE Engineering Publications
Hyderabad • Delhi • Bhopal • Pune • Bhubaneswar • Lucknow • Patna • Bengaluru • Chennai • Vijayawada • Vizag • Tirupati • Kolkata • Ahmedabad
2. Memory Organization

Memory Basics

01. Ans: (d)
Sol: A memory has 16 bit Address i.e. it’s
    Starting Address = 0000H
    Ending Address = FFFFH
    ∴ Maximum number of memory locations it
    can address is $2^{16} = 64K$
    = $64 \times 1024$
    = 65536.

02. Ans: (d)
Sol: Number of memory locations × size of one
    location.

03. Ans: (c)
Sol: DRAM needs refresh logic CKT to avoid
    the discharge of capacitor but capacitor is
    not needed in SRAM.

04. Ans: (b)
Sol: $2048 \times 8 = 2^{11} \times 8$
    ∴ Number of Address lines = 11

05. Ans: (a)
Sol: Number of chips needed
    $= \frac{\text{Target size}}{\text{Basic size}}$
    $= \frac{64 \times 8}{16 \times 4}$
    $= 8$

06. Ans: (c)
Sol: $T = 200 \text{ ns}$
    ∴ word frequency
    $= \frac{1}{200 \times 10^{-9}} = 5 \times 10^6 \text{ words/sec}$

07. Ans: (a)
Sol: ROM is used for function table with large
    size because after program; ROM content is
    not possible to destroy and design cost is
    cheap.

08. Ans: (c)
Sol: Remain same because it is non-volatile
    memory

Cache Concept

01. Ans: (a)
Sol: Physical Address = 32 bits
    Cache size = 256 KB = $2^{18}$ B
    Associativity = 4
    Block size = 16 B
    Number of cache blocks = $2^{18}$ B/$2^4$B
    = $2^{14} = 16 \text{ K}$
    Number of cache sets = $\frac{2^{14}}{4} = 2^{12}$

02. Ans: (c)
Sol: Number of tag comparators needed
    = Associativity = 4
    Size of each comparator
    = number of tag bits = 16
03. Ans: (c)
   Sol: Memory size = $4 \times 16 = 2^{12} \times 16$
   \[ \therefore \text{Number of Address lines} = 12 \]
   \[ \text{Data lines} = 16 \]

04. Ans: (b)
   Sol: Number of cache blocks = $2c$
   \[ \text{Associativity} = 2 \]
   Mapping expression is $K \mod S$
   Where, $K = M.M$ block Number and
   $S = \text{Number of cache sets}$
   Number of Cache sets
   \[ = \frac{\text{Number of cache blocks}}{\text{Associativity}} \]
   \[ = \frac{2c}{2} = c \]
   \[ \therefore K \mod c \]

05. Ans: 31
   Sol: Word size = 16 bit,
   So memory size = $2^{31}$ words $= 2^{32}$ bytes
   \[ = 4 \text{ GB} \]
   \[ \therefore 31 \text{ address bits are needed.} \]

06. Ans: (d)
   Sol: Direct-Mapping
   Total Blocks=256
   Number of Tag bits = 19
   Tag Directory size = $((19+1)+1) \times 256$
   \[ = 5376 \]

07. Ans: (c)
   Sol: Number of cache blocks $= \frac{256 \text{ KB}}{32 \text{ B}} = 2^{13}$
   \[ \text{Associativity} = 4 \]
   Number of Sets $= \frac{2^{13}}{4} = 2^{11}$
   \[ \therefore \text{Tag size} = 32 – 11 – 5 = 16. \]

08. Ans: (a)
   Sol: Number of blocks $= 256 \text{ KB}/32\text{B} = 8 \text{ K}$
   Number of Sets with 4-way set-associative
   \[ = 8 \text{ K}/4 = 2 \text{ K} \]
   \[ 8 \text{ K}(16+1+2+1)-\text{bits} = 160 \text{ K-bits} \]

09. Ans: 20
   Sol: Associativity = 4
   Cache Size = 16 KB
   Block size = 16 bytes $= 32$ bytes
   Word size = 32 bits
   \[ \therefore \text{Number of cache blocks} \]
   \[ = \frac{16\text{KB}}{32\text{B}} = \frac{2^{14}\text{B}}{2^5\text{B}} = 2^9 \]
   \[ \therefore \text{Number of cache sets} = \frac{2^9}{2^7} = 2^2 \]
   \[ \therefore \text{Block size} = 8 \times 4 \text{ Bytes} = 32 \text{ bytes} = 2^5 \text{ B} \]
   Physical Address size = 32 bits
   Physical Address format:

<table>
<thead>
<tr>
<th>Tag Offset</th>
<th>Set Offset</th>
<th>Byte Offset</th>
</tr>
</thead>
<tbody>
<tr>
<td>20</td>
<td>7</td>
<td>5</td>
</tr>
</tbody>
</table>

10. Ans: (d)
    Sol: If associativity is doubled, then number of
    tag bits will be increased and set offset size
    is reduced and size of MUX is directly
    proportional to associativity only Physical
    address size and Data bus size are not
    altered.

11. Ans: (a)
    Sol: First cache organization: 32 KB with 2-way
    set associative cache.
The size of address = 32 bits.

<table>
<thead>
<tr>
<th>Tag</th>
<th>Set offset</th>
<th>Word offset</th>
</tr>
</thead>
<tbody>
<tr>
<td>(18-bits)</td>
<td>(9-bits)</td>
<td>(5-bits)</td>
</tr>
</tbody>
</table>

Multiplexer latency = \(0.6\) ns and \(k\)-bit comparator latency is \((K/10)\) ns
The hit latency of this cache = \(0.6 + (18/10)\) = \(2.4\) ns.

12. **Ans:** (d)

**Sol:** Second cache organization: 32 KB with direct mapped cache.
The size of address = 32 bits.

<table>
<thead>
<tr>
<th>Tag</th>
<th>Set offset</th>
<th>Word offset</th>
</tr>
</thead>
<tbody>
<tr>
<td>(17-bits)</td>
<td>(10-bits)</td>
<td>(5-bits)</td>
</tr>
</tbody>
</table>

Only one TAG comparator & no multiplexer
The hit latency of this cache = \((17/10)\) = \(1.7\) ns.

13. **Ans:** (b)

**Sol:**

<table>
<thead>
<tr>
<th>Tag</th>
<th>set</th>
<th>word</th>
</tr>
</thead>
<tbody>
<tr>
<td>6</td>
<td>6</td>
<td>4</td>
</tr>
<tr>
<td></td>
<td>16</td>
<td></td>
</tr>
</tbody>
</table>

\(\therefore\) Number of tag bits = 6

14. **Ans:** (d)

**Sol:** The cache consists of 4 sets with each set consists of 4 blocks.
Set 0 contains the blocks with main memory block / 4 \(= 0\)
Set 1 contains the blocks with main memory block / 4 \(= 1\)
Set 2 contains the blocks with main memory block / 4 \(= 2\)
Set 3 contains the blocks with main memory block / 4 \(= 3\)
Set 0: \(\emptyset, 8, 2, 6, 48, 32, 92\)
Set 1: \(1, 129, 73\)
Set 2:
Set 3: \(\emptyset, 3, 159, 63, 155\)
So 216 will not be in cache if LRU is used.

15. **Ans:** (b)

**Sol:** Physical Address formats for the given Block number and Tag number 6 are

<table>
<thead>
<tr>
<th>Tag</th>
<th>Block word</th>
<th>word Ranges</th>
</tr>
</thead>
<tbody>
<tr>
<td>10</td>
<td>00</td>
<td>xxxx → 128 to 147</td>
</tr>
<tr>
<td>10</td>
<td>01</td>
<td>xxxx → 144 to 159</td>
</tr>
<tr>
<td>00</td>
<td>10</td>
<td>xxxx → 32 to 47</td>
</tr>
<tr>
<td>01</td>
<td>11</td>
<td>xxxx → 112 to 127</td>
</tr>
</tbody>
</table>

\(128, 64, 32, 16, 8, 4, 2, 1\)

\(\therefore\) 150 and 132 are available

16. **Ans:** (b)

**Sol:** Cache accepts only 8 blocks and it uses LRU.

\[H = \text{Hit}, M = \text{Miss Block arrival Address}\]

<table>
<thead>
<tr>
<th>Block number</th>
<th>Hit, Miss</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>45</td>
</tr>
<tr>
<td>1</td>
<td>22</td>
</tr>
<tr>
<td>2</td>
<td>25</td>
</tr>
<tr>
<td>3</td>
<td>8</td>
</tr>
<tr>
<td>4</td>
<td>3</td>
</tr>
<tr>
<td>5</td>
<td>7</td>
</tr>
<tr>
<td>6</td>
<td>16</td>
</tr>
<tr>
<td>7</td>
<td>35</td>
</tr>
</tbody>
</table>

Cache Block number 5 consists 7
17. Ans: (c)
Sol: All blocks are mapped to set 0 only, but each set permits only 2 blocks

<p>| | | | | |</p>
<table>
<thead>
<tr>
<th></th>
<th></th>
<th></th>
<th></th>
<th></th>
</tr>
</thead>
<tbody>
<tr>
<td>M</td>
<td>M</td>
<td>M</td>
<td>H</td>
<td>M</td>
</tr>
</tbody>
</table>

Total Number of misses = 4

18. Ans: 14
Sol: \[ HC + (1 - H)M = (0.8 \times 5) + (0.2 \times 50) = 14 \text{ ns} \]

19. Ans: (a)
Sol:

<table>
<thead>
<tr>
<th>Tag</th>
<th>Line offset</th>
<th>Word offset</th>
</tr>
</thead>
<tbody>
<tr>
<td>1110</td>
<td>00100000000</td>
<td>1111</td>
</tr>
</tbody>
</table>

Tag = \( E_{16} \)  line = \( 201_{16} \)

20. Ans: (c)
Sol: The cache and main memory are divided into blocks of 64 bytes each. The direct mapped cache consists of 32 blocks (mod block \( i/32 \)). The array is stored from main memory locations \( 1100_{H} \). The array is placed in MM from \( 68 \) Block onwards. The total array consists of 2500 bytes, so they require a total of 40 blocks. In the cache all the 32 blocks are filled and the remaining 8 blocks are replacing the previous blocks. A total of 40 data misses will occur during first access. During the second access once again the 16 blocks are replaced for conflict misses, so 16 cache misses occur.

Total numbers of cache misses = 40 + 16 = 56

21. Ans: (a)
Sol: The lines 4 to 11 gets conflict misses frequently.

22. Ans: 24
Sol:

<table>
<thead>
<tr>
<th>Tag</th>
<th>2^{19} Bytes</th>
</tr>
</thead>
<tbody>
<tr>
<td>Final Tag</td>
<td>40</td>
</tr>
<tr>
<td>Tag</td>
<td>3</td>
</tr>
<tr>
<td>21</td>
<td>40</td>
</tr>
</tbody>
</table>

It uses 8 way set associative
\( \therefore \) Tag size = 24 bits.

23. Ans: 76
Sol: When a block is referred first time in cache memory, it is known as compulsory miss and it will be loaded in the cache memory, when a same block is referred in future i.e. 2\text{nd} time onwards; then it is considered as conflict miss.

For first iteration, 4 conflict misses and for each iteration from 2\text{nd} time onwards 8 conflict misses occur.

Total number of conflict misses occur for 10 times = \( 4 + (8 \times 9) = 76. \)

Conflict misses in first iteration

0, 128, 256, 128, (0), 128, (256), 128, 1, 129, 257, 129, (1), 129, (257), 129.

Total 4 times
Conflict misses in second time iteration
(0), 128, (256), 128, (0), 128, (256), 128, (1), 129, (257), 129, (1), 129, (257), 129,
Total 8 times

24. Ans: 59
Sol: Number of rows in a chip = $2^{14} = 16384$
One refresh time = $50 \times 10^{-9}$ sec
Chip refresh time = $16384 \times 50 \times 10^{-9}$ sec
= $819200 \times 10^{-9}$ sec
= 0.8192 msec
Given refresh period = 2 msec
Amount of time required for RD/WR operation
= $2 - 0.8192$
= 1.1808 msec
Amount of time used for RD/WR operation in percentage
= $\frac{1.1808 \text{msec}}{2 \text{msec}} \times 100 = 59.04$
∴ Closest integer value = 59

25. Ans: (b)
Sol: Number of bits required for addressing a byte in physical memory = P
Number of bits required for addressing a byte in cache = N
In Direct Map, Tag field size = P − N
In K-way Block Set Associative mapping, Tag field size = (P − N) + log₂ K

3. Pipeline Organization

01. Ans: (c)
Sol: Max. stage delay = 160 ns
Buffer delay = 5 ns
Pipeline clock = 165 ns
$T_{1000} = (K + n - 1) T_p$ clock
= $(4 + 999) \times 165 = \left( \frac{165495}{1000} \right) \text{µs}$
= 165.5 µs

02. Ans: (d)
Sol: 1. The $(j+1)^{th}$ instruction uses the result of the $j^{th}$ instruction as an operand, comes under data dependency and it causes data hazard. (RAW).
2. The execution of a conditional jump instruction comes under conditional dependency and it causes control hazard.
3. The $j^{th}$ and $(j + 1)$ instruction require the ALU at the same time comes under structural hazard. (WAR).

03. Ans: (c)

04. Ans: (c)
Sol: 2 Stall cycles
CPU clock frequency = 1GHz
Out of $10^9$ instructions 20% of instructions are branch instructions, which requires 3 clock cycles. The remaining 80% instructions require only one clock pulse for their completion.
Total execution time
= $10^9 \times (80/100) \times 10^{-9} + 10^9 \times (20/100) \times 3 \times 10^{-9}$
= 0.8 + 0.6 = 1.4 sec
05. Ans: 33.33
Sol: Old pipeline maximum delay = 800 ns
New pipeline maximum delay = 600 ns
\[ \frac{800}{600} = \frac{4}{3} \]
Increasing throughput = \( \frac{4 - \frac{3}{3}}{3} = 33.33\% \)

06. Ans: (b)
Sol:

<table>
<thead>
<tr>
<th>MUL</th>
<th>IF</th>
<th>ID</th>
<th>OF</th>
<th>PO</th>
<th>WO</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td>R_0, R_1</td>
<td>R_0 \times R_1</td>
<td>R_2</td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

<table>
<thead>
<tr>
<th>DIV</th>
<th>IF</th>
<th>ID</th>
<th>OF</th>
<th>PO</th>
<th>R_3/R_4</th>
<th>WO</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td>R_3, R_4</td>
<td>R_5</td>
<td></td>
<td></td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

<table>
<thead>
<tr>
<th>ADD</th>
<th>IF</th>
<th>ID</th>
<th>OF</th>
<th>R_2</th>
<th>PO</th>
<th>WO</th>
</tr>
</thead>
</table>

<table>
<thead>
<tr>
<th>SUB</th>
<th>IF</th>
<th>ID</th>
<th>OF</th>
<th>R_2</th>
<th>PO</th>
<th>WO</th>
</tr>
</thead>
</table>

07. Ans: (b)
Sol: Non-pipelined system delay = 30 ns
Max. Pipeline delay = 12 ns
\[ S = \frac{30}{12} = 2.5 \]

08. Ans: (b)
Sol: Pipeline clock = Max (stage delay + Overhead)
\[ = \text{Max} (5, 7, 10, 8, 6) + 1 = 11 \text{ns} \]
CPU gets target address after completion of branch instruction in EX stage only.
\[ (n+k-1) \times 11 \text{ns} + \text{stall delay (3)} \]
\[ = (8 + 5 - 1) \times 11 \text{ns} + 3 \times 11 \text{ns} \]
\[ = 165 \text{ns} \]

09. Ans: (d)
Sol: Number of stages = 5
One stage delay = 2 ns
- While executing more number of instructions only one stage delay is sufficient for executing one instruction when there is no Hazard.
- Number of Non Hazard instructions = 80%
\[ \therefore \text{it's Average time} = 0.8 \times 2 \text{ ns} = 1.6 \text{ ns} \]
For executing one Hazard instruction it takes all stage delays i.e., 10 ns.
\[ \therefore \text{It's average time is} 0.2 \times 10 \text{ ns} = 2 \text{ ns} \]
Average instruction time
\[ = 1.6 \text{ ns} + 2 \text{ ns} = 3.6 \text{ ns} \]
10. Ans: (c)
Sol: Let total number of instructions = 100
Non branch time = 80 × 2 = 160 ns
Total Branch instructions = 20% = 20
In 20; 80% are conditional and remaining 20% are unconditional 20 × 20 % = 4.
It’s time = 4 × 10 ns = 40 ns
Time needed for 50% instructions, Branch taken = \( \frac{20 - 4}{2} = 8 \times 2 \) ns = 16
Time needed for 50% instructions, Branch not taken = 8 × 10 ns = 80 ns
Total time = 160 + 40 + 80 + 16 = 296 ns
∴ Average time = 2.96 ns.

11. Ans: (c)
Sol: Efficiency = \( \frac{S}{K} \)
Where S = speed up, K = number of stages
\( K = \frac{S}{\text{efficiency}} \)
\( = \frac{6.6 \div 7.5}{0.88} \)
So, minimum 8 stages are needed

12. Ans: (c)

13. Ans: 3.2
Sol: Non-pipeline CPU frequency = 2.5 GHz
\[ T = 0.4 \text{ ns} \]
∴ One instruction time = 4 × 0.4 ns
\( t_n = 1.6 \) ns
Pipeline CPU frequency = 2 GHz
\[ T = 0.5 \text{ ns} = t_p \]

Sol: Only one clock cycle time is sufficient to execute one instruction.
\[ S = \frac{t_n}{t_p} \]
\[ = \frac{1.6 \text{ ns}}{0.5 \text{ ns}} = 3.2 \]

15. Ans: (b)
Sol: It is also known as W.A.R Hazard.
Anti-dependence Hazard creates Hazard (i.e. needs stall) when a low latency instruction is completed before a longer latency instruction that appears earlier in the program only BUT NOT ALWAYS.

16. Ans: 4
Sol: For ‘n’ number of instructions
\[ t_n = 6 \times n \text{ clks} \ (k = 6). \]
Highest speed up k = 6 if there is no stall cycle and all stage delays are equal but 25% of instructions need 2 stalls.
\[ t_p = (0.75 n \times 1) + (0.25 n \times (2 + 1)) =1.5 n \text{ clks}. \]
∴ \[ S = \frac{t_n}{t_p} = \frac{6 n \text{ clks}}{1.5 n \text{ clks}} = 4 \]
17. Ans: (c)  
Sol: \( f \propto \frac{1}{T} \)  
Minimum clock time gives highest clock frequency for the given pipelined processor.  
For P1 Largest clock time is 2 ns.  
For P2 Largest clock time is 1.5 ns.  
For P3 Largest clock time is 1 ns.  
For P4 Largest clock time is 1.1 ns.  
So, P3 gives highest peak clock frequency. 

18. Ans: 1.54  
Sol: ‘P’ has 5 stages  

<table>
<thead>
<tr>
<th>IF</th>
<th>ID/RF</th>
<th>EX</th>
<th>MEM</th>
<th>WB</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>2.2</td>
<td>2</td>
<td>1</td>
<td>0.75 ns</td>
</tr>
</tbody>
</table>

‘Q’ has 8 stages  

<table>
<thead>
<tr>
<th>IF</th>
<th>ID/RF1</th>
<th>RF2</th>
<th>EX1</th>
<th>EX2</th>
<th>MEM</th>
<th>WB</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>2.2/3</td>
<td>2.2</td>
<td>2.2</td>
<td>1</td>
<td>1</td>
<td>1</td>
</tr>
</tbody>
</table>

P - highest clock cycle time = 2.2 ns.  
Q - highest clock cycle time = 1 ns.  
In ‘P’ pipeline new instruction fetching is stopped for 2 stage delays  
Where in ‘Q’ pipeline new instruction fetching is stopped for 5 stage delays  
Number of branch instructions = 20%.  
\( \therefore \) ‘P’ total time is 
\( (0.8 \times 2.2 \text{ ns})+(0.2 \times (2 +1))2.2 \text{ ns} = 3.08 \text{ ns} \)  
‘Q’ total time is 
\( 0.8 \times 1 \text{ ns}+(0.2\times (5+1))\times1 \text{ ns} = 2 \text{ ns} \)  
\( \therefore \) \( \frac{P}{Q} = \frac{3.08 \text{ ns}}{2 \text{ ns}} = 1.54 \).

19. Ans: (b)  
Sol: For D1 processor, maximum \( T_{seg} = 4 \text{ ns} \),  
\( n = 100, k = 5 \)  
Time = 104 \times 4 \text{ ns} = 416 \text{ ns}  

For D2 processor,  
\( n = 100, k = 8, T_{seg} = 2 \text{ ns} \)  
Time = 107 \times 2 \text{ ns} = 214 \text{ ns}  
Hence, 202 ns time will be saved 

20. Ans: (b)  
Sol: \( t_n = 12 \text{ ns}, \) maximum \( T_{seg} = 6 \text{ ns} \)  
\( \therefore S = \frac{t_n}{t_p} = 2 \)

21. Ans: 1.51  
Sol: For Naive pipelined CPU  
\( K = 5, T_{seg} = 20 + 2 = 22 \text{ ns}, n = 20. \)  
Total time needed for 20 instructions  
\((5 + 20 – 1) \times 22 \text{ ns} = 24 \times 22 \text{ ns} = 528 \text{ ns} \)

For Efficient pipelined processor  
\( T_{seg} = 12 + 2 = 14 \text{ ns}; k = 6, n = 20 \)  
Total time for 20 instructions  
\((6 + 20 – 1) \times 14 \text{ ns} = 350 \text{ ns} \)
Speed up = \( \frac{t_n}{t_e} = \frac{528}{350} \)  
\[ = 1.50857 \]  
\( \approx 1.51 \)
22. Ans: 219
Sol: Number of stages = 5(IF, ID, OF, PO, WR),
k = 5
n = 100, except PO, all stages take one clock
In P.O stage
40 instructions take 3 clocks
35 instructions take 2 clocks
and remaining 25 instructions take 1 clk
If all instructions requires one clock in all
stages, total clocks required = (k + n – 1)
= 5 + 100 – 1 = 104
But, 40 instructions requires 3 clocks each
i.e. 40 instructions execution requires '80' more clocks and 35 instructions requires 2
clocks i.e. 35 instructions required 35 more
clocks.
So, total number of clocks required
= 104 + 80 + 35 = 219

4. CPU Organization

01. Ans: (b)
Sol: The Value in register A is rotated through	right 8 times. During each rotation
operation, if carry flag is set the value of
register B is incremented. After 8 rotations
B register contains the number of 1’s in
register A.

02. Ans: (a)
Sol: Extending the previous question, if the
contents of register A is rotated right once
again, and then register A will retain its
value. Therefore the instruction at X will be
RRC A, #1.

03. Ans: (d)
Sol: The given program is:

<table>
<thead>
<tr>
<th>Instruction</th>
<th>Operation</th>
<th>Instruction Size (No. of words)</th>
</tr>
</thead>
<tbody>
<tr>
<td>MOV R1,(3000)</td>
<td>R1←M[3000]</td>
<td>2</td>
</tr>
<tr>
<td>LOOP:</td>
<td></td>
<td></td>
</tr>
<tr>
<td>MOV R2,(R3)</td>
<td>R2←M[R3]</td>
<td>1</td>
</tr>
<tr>
<td>ADD R2, R1</td>
<td>R2←R2+R1</td>
<td>1</td>
</tr>
<tr>
<td>MOV(R3), R2</td>
<td>M[R3]←R2</td>
<td>1</td>
</tr>
<tr>
<td>INC R3</td>
<td>R3←R3+1</td>
<td>1</td>
</tr>
<tr>
<td>DEC R1</td>
<td>R1←R1–1</td>
<td>1</td>
</tr>
<tr>
<td>BNZ LOOP</td>
<td>Branch on not zero</td>
<td>2</td>
</tr>
<tr>
<td>HALT</td>
<td>Stop</td>
<td>1</td>
</tr>
</tbody>
</table>

Let the data at memory 3000 is 10.
The contents of R3 are 2000.
The content of memory locations from 2000
to 2010 is 100.
The number of memory references for
accessing the data is

<table>
<thead>
<tr>
<th>Instruction</th>
<th>Operation</th>
<th>No. of memory references</th>
</tr>
</thead>
<tbody>
<tr>
<td>MOV R1,(3000)</td>
<td>R1←M[3000]</td>
<td>1</td>
</tr>
<tr>
<td>MOV R2, (R3)</td>
<td>R2←M[R3]</td>
<td>10(Loop is repeated 10 times)</td>
</tr>
<tr>
<td>MOV (R3), R2</td>
<td>M[R3]←R2</td>
<td>10 (loop is repeated 10 times)</td>
</tr>
</tbody>
</table>

Total number of memory references are: 21

04. Ans: (a)
Sol: As the memory locations are incremented 10
times from 2000 to 2009, when the loop is
terminated R3 consists of 2010, whose value
will be 100(previous value) only.
05. Ans: (c)  
Sol: The program is loaded from memory location 1000 onwards. The word size is 32 bits and the memory is byte addressable.

<table>
<thead>
<tr>
<th>Address</th>
<th>Instruction</th>
<th>Word size</th>
</tr>
</thead>
<tbody>
<tr>
<td>1000 to 1007</td>
<td>MOV R1, (3000) R1←M[3000]</td>
<td>2</td>
</tr>
<tr>
<td>1008 to 1011</td>
<td>LOOP: MOV R2, (R3) R2←M[R3]</td>
<td>1</td>
</tr>
<tr>
<td>1012 to 1015</td>
<td>ADD R2, R1 R2←R2+R1</td>
<td>1</td>
</tr>
<tr>
<td>1016 to 1019</td>
<td>MOV(R3),R2 M[R3]←R2</td>
<td>1</td>
</tr>
<tr>
<td>1020 to 1023</td>
<td>INC R3 R3←R3+1</td>
<td>1</td>
</tr>
<tr>
<td>1024 to 1027</td>
<td>DEC R1 R1←R1-1</td>
<td>1</td>
</tr>
<tr>
<td>1028 to 1035</td>
<td>BNZ LOOP Branch on not zero</td>
<td>2</td>
</tr>
<tr>
<td>1036 to 1039</td>
<td>HALT Stop</td>
<td>1</td>
</tr>
</tbody>
</table>

If the interrupt occurs at INC R3 instruction, then first the instruction is executed and the program counter consists of 1024, which is stored in stack.

06. Ans: (d)  
Sol: Opcode size = 13-bit  
Control memory size = 7-bit  
= 128 word = 2^7  
Maximum number of one address instructions to be formulated

<table>
<thead>
<tr>
<th>operation</th>
<th>Address</th>
</tr>
</thead>
<tbody>
<tr>
<td>6</td>
<td>7</td>
</tr>
</tbody>
</table>

= 2^6 = 64

:\. Remaining number of zero address instructions to be formulated

= (64 – 32) x 2^7

= 4096

07. Ans: (c)  
Sol: Stack works on LIFO.

08. Ans: (b)  
Sol: Relative Addressing mode is used to relocate the program from one memory segment to other segment—without—change in code so, it is known as Position Independence Addressing mode.

09. Ans: (c)  
Sol: Relative Addressing mode is used to relocate the program from one memory segment to other segment—without—change in code so, it is known as Position Independence Addressing mode.

10. Ans: (b)  
Sol: In instruction execution cycle, to get the first operand through index addressing mode it takes one machine cycle. To get the second operand through indirect addressing mode (B), it takes two more machine cycles because B is the address.

After the addition is completed the result is needed to send to the destination by using the index addressing mode, which requires one more machine cycle.

So a total of four machine cycles are required to execute the above instruction. (Except fetch cycle)

11. Ans: (d)  
Sol: R1 → M [100]  
M[100] → R2  
M[100] → R3  
The above instructions are used for transferring R1 content to R2 and R3 through memory address 100. So, option ‘d’ is correct.
12. Ans: (a)  
Sol: Address field in the instruction is used to specify Memory Address or One of the processor Register Address.
For example to specify Rs in a processor which is having 16 bit Register from R0 to R15, it’s Address field is ‘0101’, and for implied Register; no address is specified in the instruction.

13. Ans: (d)  
Sol: • Stack grows upword means SP is incremented for PUSH operation and decremented for POP operation.
• One Memory location can store only one word (i.e., one byte)
• After ‘CALL’ execution; to store PC and PSW content; SP is incremented by ‘4’
016E_{16} + 4 = (0172)_{H}

14. Ans: (d)  
Sol: Here R2 will act as base or indexed register and 20 is the displacement.

15. Ans: 16383  
Sol: Word size = 32 bit
Number of CPU Registers = 64 = 2^6
So, for addressing a Register 6 bits are needed.
Instruction Opcode size is 32-bits.
Number of supporting Instructions = 45, so minimum 6 bits are needed.
Instruction is having with operation part, Reg1, Reg2 and Immediate operand

The Range of unsigned operand with 14-bit is 0 to 2^{14}–1
∴ Max unsigned integer is 16383.

16. Ans: (d)  
Sol: Max. number of two address instructions = 2^4.
When it uses only ‘n’, two address instructions then remaining (2^4 – n) with ‘6’ bit combinations are used for one address instructions.
∴ Max. number of one address instructions: (2^4 – n) x 2^6

17. Ans: 16  
Sol:

18. Ans: 500  
Sol: One instruction needs 34 bits,
So number of bytes needed = 5
Program size = 100
∴ Size of the memory in bytes = 500

19. Ans: –16  
Sol: While executing the i + 3 instruction, the PC content will be the starting address of the i+4. If the target of the branch instruction is
20. Ans: (d)
Sol: Max one Address instruction = $2^6 = 64$
But number of one address instructions used = 32.
Max Number of zero Address instructions = $32 \times 128 = 4096$

21. Ans: (c)
Sol: If 63 one address instructions are used then
Number of zero address instructions = $(64 - 63) \times 128 = 128$

22. Ans: (a)
Sol: After issuing an interrupt, while processing L is under execution.
Processor follows the below steps:
Step 1: Completion of current instruction execution
Step 2: Pushes the result of the current instruction status on stack
Step 3: Gets the new address to PC for starting ISR and executes the ISR.
Step 4: Pops the status from stack to continue the interrupted instruction.

5. Control Unit Design

01. Ans: (d)
02. Ans: (a, b, c, d)
03. Ans: (d)
Sol: To execute interrupt cycle, the present content of PC will be pushed to stack with the help of MBR and MAR before placing ISR address in PC. (Always only MAR and MBR are used to address in Basic computer).

04. Ans: (a)
Sol: Total Size of micro-instructions = 26 bits
Size of micro-operation = 13 bits
Total inputs for the multiplexer (Status bits) inputs = 8
So the multiplexer selection lines field(Y) = 3 bits ($2^3 = 8$)
The number of bits in the next address field size(X) = $13 - 3 = 10$ bits
Size of control memory = $2^{10} = 1024$

05. Ans: (d)
Sol: $S_8 = I_1T_4 + I_2T_4 + I_3T_4 + I_4T_4$
= $I \times T_4 = T_4$
$\therefore \ I = (I_1 + I_2 + I_3 + I_4)$
$S_7 = (I_1T_3 + I_2T_3 + I_3T_3 + I_4T_3)$
$\quad + (I_5T_1 + I_5T_2 + I_5T_3 + I_5T_4 + T_4I_3 + T_4I_4)$
$\therefore \ S_7 = T_3 + I_3 + T_4 \times I_4$
06. Ans: (b)
Sol: Fastest Control unit is hard-wired control unit and vertical micro-programming control unit is slowest.

07. Ans: (d)

08. Ans: (d)
Sol: All the given characteristics are belonging to RISC processor.

### 6. I/O Organization

01. Ans: (b)
Sol: For vectored hardware interrupt, the interrupting device supplies the respective address with additional hardware.

02. Ans: 456
Sol: Terminal Count Register size = 16 bit.
So, for one transfer operation of 64 KB, the register content will become zero, so, number of times the content of the register to be filled is

\[
\frac{29154 \text{ KB}}{64 \text{ KB}} \approx 456
\]

03. Ans: (b)

04. Ans: (a)
Sol: CPU gives highest priority for high speed devices and least priority for low speed devices. Hard disk has higher priority than others because it is fastest secondary memory.

05. Ans: (b)
Sol: In single line interrupt system contains a single interrupt request line and an interrupt grant line. In this system it may be possible for more than one I/O device request interrupt at the same time. By using 8259 IC it is possible to connect more number of IO devices. So in single interrupt system vectored interrupts are not possible but multiple interrupting devices are possible.

06. Ans: (b)
Sol: 10 KBPS = 10 KB is transferred in 1 sec = 10^4 B
1 byte takes = 0.1 ms = 100 µs
∴ Minimum waiting time needed is 100 µs for system
Programmed I/O takes 100 µs
Interrupt driven takes 4 µs
∴ Gain = \( \frac{100 \mu s}{4 \mu s} = 25 \)

07. Ans: (d)
Sol: CPU first takes care about it's temperature.

08. Ans: (a)
Sol: Non-pipelined system requires

\[ (2+2+1+1+1) \times 500 + 2 = 3502 \text{ cycles} \]
DMA clock need = 20 + (2 \times 500)
= 1020 cycles

Speed up = \( \frac{3502}{1020} = 3.44 \)
7. Secondary Memories

01. **Ans:** (P = 12.5), (Q = 2500000)

**Sol:**
- T = 200, RPM = 2400, RPS = 40
- Track capacity = 62500 bits
- One revolution time is 25 ms
- ∴ Average latency time = 12.5 ms
- Q = Data Transfer rate = no. of bits/sec.
- In one second it can complete 40 tracks.
- ∴ Q = 40 × 62500 bits/sec = 2500000 bits/sec

02. **Ans:** (d)

**Sol:**
- Transfer rate = 10 K.K bytes/sec
- For 10K bytes it takes 1 msec
- For 20K bytes it takes = 2 ms
- CPU frequency is 600 MHz,
  \[ T = \frac{10^{-6}}{600} = \frac{10}{6} \text{ ns} \]
- For initializing it takes 300 clk
  \[ = 500 \text{ ns} \left(\frac{10}{6} \times 300\right) \]
- For completing 900 clk = 1500ns \( \left(\frac{10}{6} \times 900\right) \)
- Total CPU time is 2000ns = 2 μs
- Processor consumes 2 μs for each 2 msec.
- ∴ In percentage it is 0.1%

03. **Ans:** (c)

**Sol:**
- The address (400,16,29) corresponds to the Sector number
  \[ = (\text{cylinder number} \times \text{number of sectors/cylinder}) + (\text{surface number} \times \text{number of sectors/track}) + \text{Present sector number} \]
  \[ = (400 \times 20 \times 63) + (16 \times 63) + 29 \]
  \[ = 505037 \]

04. **Ans:** (c)

**Sol:**
- \( \frac{1039}{63 \times 20} \):
  - 0th cylinder
- \( \frac{1039}{63} \):
  - 16th surface and remainder gives sector number: 31 (0, 16, 31)

05. **Ans:** (b)

**Sol:**
- Given seek time = 10 ms
- RPM = 6000, RPS = 100.
- One revolution takes 10 ms
- ∴ Average rotational Delay = 5 ms
- Transfer delay is neglected.
- \[ T_{\text{access/one library}} = t_{\text{seek}} + t_{\text{rotational}} + t_{\text{transfer}} \]
  \[ = 10 \text{ ms} + 5 \text{ ms} + 0 \]
  \[ = 15 \text{ ms} \]
- So, for 100 libraries loading;
  it takes \( (15 \text{ ms} \times 100) = 1500 \text{ ms} \)
  \[ = 1.5 \text{ sec} \]

06. **Ans:** (d)

**Sol:**
- Size of the data to be transferred=42,797 KB
- One sector capacity = 512 B
- ∴ Number of Sectors to store 42797 KB
  \[ = \frac{42797 \times 1024}{512} = 85,594 \]
- One cylinder has 64 × 16 = 1024 sectors
83 cylinders can store $83 \times 1024$
\[= 84,992 \text{ sectors} \]
\[\therefore \text{ Remaining number of sectors } = 602 \]
602 sectors occupy more than half of one cylinder capacity
But the given cylinder has started with $<1200, 9, 40>$ means more than half of that cylinder, so next cylinder is also needed for storing complete data.
\[\therefore \text{ Last cylinder number} \]
\[= 1200 + 83 + \text{next one} \]
\[= 1284 \]

07. Ans: 14020

Sol: Seek time = 4 milli seconds per each sector
Reading
RPM = 10000 i.e.,
1 Track rotation time = 6 ms
\[\therefore \text{ Average rotational delay is 3 ms.} \]
One Track has 600 sectors.
So, one sector transfer time is
\[\frac{6 \text{ ms}}{600} = 0.01 \text{ ms} \]
\[\therefore \text{ One sector Access time } = 0.01 + 4 + 3 \]
\[= 7.01 \text{ ms} \]
So, 2000 sectors time = $2000 \times 7.01$
\[= 14,020 \text{ ms} \]

08. Ans: 6.1

Sol: Transfer rate = $50 \times 10^6 \text{ Bytes/sec}$
So, 0.5 KB takes 0.1 ms
RPM = 15000; RPS = 250
1 rotation takes 4 milliseconds
Average rotational delay is 2 ms;
Seek time is 4 ms
\[\therefore \text{ Average time } = \text{ transfer time } + \text{ seek time } + \text{ rotational delay} \]
\[= 0.1 + 4 + 2 \text{ ms} \]
\[= 6.1 \text{ ms} \]