Chapter 5(Digital Design and Computer Architecture)


Chapter 5

Performance Analysis

•Definitions:

–CPI: Cycles per instruction

–clock period: seconds/cycle

–IPC: instructions per cycle

Throughput vs Response Time

•Response time (execution time) – the time between the start and the completion of a task

–Important to individual users

•Throughput (bandwidth) – the total amount of work done in a given time

–Important to data center managers

CPU Time

CPU Clocking

Clock period(周期): duration of a clock cycle

250ps = 0.25ns = 250×10–12s

Clock frequency (频率): cycles per second

4.0GHz = 4000MHz = 4.0×109Hz

Instruction Count and CPI

Weighted average CPI

e.g

Single-Cycle Processor

state elements

PC: program counter

I type

Multicycle Processor

What’s improved

read, write or alu operation in one cycle

每个周期内,完成一次read或write操作,或ALU操作;

different instruction adopts different cycle combination

不同指令使用不同的周期组合;

only one adder and one memory, shared in different cycles

仅需要一个adder,在不同周期重复使用;

一个memory, 在不同周期使用.

Instruction cycle

machine cycle(CPU周期)

State Elements

Pipelined Processor

•Divide single-cycle processor into 5 stages:

–Fetch

–Decode

–Execute

–Memory

–Writeback

Pipelined Processor Abstraction

(there is no building block conflict)

Pipelined Datapath

Same control unit as single-cycle processor

Harzard

Data hazard:

register value not yet written back to register file

Data forwarding

Stalling

Control Harzard

next instruction not decided yet (caused by branches)

Early Branch Resolution
Branch Prediction

Exceptions

Advanced Microarchitecture


文章作者: mohan li
版权声明: 本博客所有文章除特別声明外,均采用 CC BY 4.0 许可协议。转载请注明来源 mohan li !
评论