### LINKÖPINGS TEKNISKA HÖGSKOLA Institutionen för datavetenskap Petru Eles #### Tentamen i kursen ### Datorarkitektur - TDDI03 2018-01-11, kl. 08-12 Hjälpmedel: Engelsk ordbok. **Supporting material:** English dictionary. Poänggränser: Maximal poäng är 40. För godkänt krävs sammanlagt 21 poäng. **Points:** Maximum points: 40. In order to pass the exam you need a total of minimum 21 points. Jourhavande lärare: Petru Eles, tel. 013 281396 Good luck !!! # Tentamen i kursen Datorarkitektur - TDDI03, 2018-01-11, kl. 08-11 Du kan skriva på svenska eller engelska! | 1.<br>a) | Why do we need special write strategies for cache memories? | | | |----------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--| | b). | We have discussed three write strategies. How do they work? Which are their advantages and disadvantages? | | | | | (3p) | | | | 2. | A set-associative cache consists of $512 \ (=2^9)$ lines, divided into four-line sets. The main memory contains $128K \ (=2^{17})$ blocks of $128 \ (=2^7)$ bytes each. Show the format of the main memory address (give the total number of address bits, what different groups of address bits indicate, and how long each of these groups is). (3p) | | | | | | | | | 3.<br>a) | What is the role of the page table in a virtual memory system? What is the role of the translation lookaside buffer (TLB)? | | | | | (2p) | | | | 4 | Canaidan a ninglined muse assent with Laineline stores | | | | 4.<br>a) | Consider a pipelined processor with $k$ pipeline stages.<br>What is the theoretical acceleration (ignoring overheads) for a sequence of $n$ instructions compared to a similar but non-pipelined processor? Show how you obtain the formula! | | | | b)<br>c) | What is the acceleration of a sequence of 100 instructions if the number of pipeline stages is 8? What is the acceleration for an infinitely long sequence if the number of pipeline stages is 8? (3p) | | | | 5. | Define the three types of pipeline hazards. Give an example for each. | | | | | (3p) | | | | 6. | The design of RISC architectures is based on certain characteristics of typical programs which are frequently used. Enumerate at least five such characteristics of programs. | | | | | (2p) | | | | 7. | | | | | a) | What is a superscalar architecture? Draw a block-diagram of a superscalar unit. | | | | b) | Draw a block-diagram of a superscalar unit. (3p) | | | | | | | | #### Tentamen i kursen Datorarkitektur - TDDI03, 2018-01-11, kl. 08-11 Du kan skriva på svenska eller engelska! 8. Consider the following sequence of machine instructions: ``` 1: R1 R0 + R2 \leftarrow 2: R7 R1 + R12 \leftarrow 3: R5 \leftarrow R7 - 1 4: R1 R2 * R4 \leftarrow 5: R7 R9 + 10 6: R12 ← R4 - 25 7: R3 R5 * 2 \leftarrow 8: R1 + R3 R4 \leftarrow 9: R8 - 2 R10 ← R1 * 3 10: R1 ← ``` - a) Indicate the data dependencies among instructions. - b) Consider a superscalar computer on which the execution of each instruction takes one cycle; the computer has two units that execute addition&subtraction and one unit for multiplication. Produce a table, according to the model below, showing how instructions are executed in consecutive cycles, if we assume in order execution. - c) Rename the registers in the above sequence to prevent, where possible, dependency problems. Produce a second table, according to the model below, showing how instructions (after register renaming) are executed in consecutive cycles, if we assume out of order execution. In the table cells indicate the sequence number (1, 2, .., 10) of the instruction executed in the corresponding cycle on the respective unit. | | ADD/SUB | ADD/SUB | MUL | |---------|---------|---------|-----| | Cycle 1 | | | | | Cycle 2 | | | | | Cycle 3 | | | | | • • • | | | | (4p) 9. What is trace scheduling? How does it work? Why is it important with VLIW architectures? # Tentamen i kursen Datorarkitektur - TDDI03, 2018-01-11, kl. 08-11 Du kan skriva på svenska eller engelska! | <ul><li>10.</li><li>a)</li><li>b)</li><li>c)</li></ul> | Compare VLIW architectures with superscalar architectures: Show similarities and differences. Show the advantages and disadvantages of the two approaches. Why is a superscalar consuming more power, compared to a VLIW computer? | (3p) | |--------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------| | 11.<br>a)<br>b) | What is branch predication (like in the Itanium architecture)? Compare with ordinary branch prediction. | (2p) | | 12. | Consider an instruction sequence such that 40% of the computation has to be exe sequentially, while the rest is executed with full parallelism on 10 processors. Calculate expected speedup and efficiency. | | | 13. | Flynn's Classification of Computer Architectures: Explain and illustrate by figures. | (3p) | | 14. | What is a vector processor? Draw a block diagram. What is the role of the mask register? What is the basic difference between array processors and vector processors? | (3p) |