Author

Technique Used

No of Benchmark

Used

Benchmark

Or

Application Kernel

Energy

Improvement

Performance

Improvement

Platform for

Parallel

Implementation

Core DVFS

Lee et al. [11]

DVFS

Not specified

Not specified

65%

Not specified

Not specified

Jiao et al. [8]

DVFS

3

Ÿ Dense matrix multiplication

Ÿ Dense matrix transpose

Ÿ Fast Fourier transform

4%

Not specified

NVidiaGTX-280

Lee et al. [12]

DVFS

39

Ÿ GPGPU-Sim

Ÿ Rodinia

Ÿ ERCBench

Power constraint

20%

GPGPU-Sim

(Simulate Quadro FX 5800)

Mei et al. [13]

DVFS

37

Ÿ CUDA SDK 4.1

Ÿ Rodinia

19.28%

4

NVIDIA

GeForce GTX 560 Ti

Ge et al. [14]

K20c

DVFS

1

Ÿ Matrix multiplication

Ÿ Traveling salesman problem

Ÿ Finite state machine

Not specified

Not specified

NVIDIA

Tesla K20c

Hybrid DVFS

Liu et al. [19]

DVFS with Load Balancing

4

Ÿ AMD OPENCL Sdk

Ÿ IBM

20%

Performance

constraint

AMD Radeon HD 5770

Ma et al. [20]

DVFS

with

Task Mapping

9

Ÿ Rodinia

21.04%

Marginal

performance

degradation

NVIDIA

GeForce 8800 GTX GPU

Komoda et al. [22]

DVFS

with Task Mapping

25

Ÿ Rodinia

Ÿ BLAS Library

Power constraint

93%

NVIDIA

Tesla K20c

Sethia and Mahlke [17]

DVFS

with

Vary No of Thread

27

Ÿ Rodinia

Ÿ Parboil

15%

(Energy efficiency

mode)

20%

(Performance mode)

GPGPU-Sim

(Simulate GTX480)

Wang & Nagarajan [9]

DVFS

with

PID

12

Ÿ CUDA Sdk

23%

4%

GPGPU-Sim

(Simulate GTX480)