Fall 2007 Hall of Fame

This coveted title is earned by the Top 5 students who wrote the fastest code in accomplishing the final, and arguably the most challenging machine problem, MP5, parallel sort. Their results have been independently confirmed by a rigorous TA test suite, and the code has been manually checked and corrected for any irregularities and inconsistencies in the timing mechanism. Note that these students are ranked by absolute CUDA processing time, not the speedup, even though they are consistent. Congratulations!

MP5 Top 5:

Name Program Output
1. Alan Kaatz
Source Code / Report

**===--------------- Grading kaatz -------------------===**
Processing 16000000 elements...
Host CPU Processing time: 8084.667969 (ms)
G80 CUDA Processing time: 311.354004 (ms)
Speedup: 25.966160X
Test PASSED
diffing TA and student outputs test 1
diffing TA and student outputs test 2
diffing TA and student outputs test 3

PASSED: kaatz passed all testss
**===-------------------------------------------------===**

2. Thomas Shen
Source Code / Report

**===--------------- Grading tbshen -------------------===**
Processing 16000000 elements...
Host CPU Processing time: 8011.271973 (ms)
G80 CUDA Processing time: 322.955994 (ms)
Speedup: 24.806079X
Test PASSED
diffing TA and student outputs test 1
diffing TA and student outputs test 2
diffing TA and student outputs test 3

PASSED: tbshen passed all tests.
**===-------------------------------------------------===**

3. Lingling Miao
Source Code / Report
**===--------------- Grading lmiao2-------------------===**
Processing 16000000 elements...
Host CPU Processing time: 8018.119141 (ms)
G80 CUDA Processing time: 461.638000 (ms)
Speedup: 17.368846X
Test PASSED
diffing TA and student outputs test 1
diffing TA and student outputs test 2
diffing TA and student outputs test 3

PASSED: lmiao2 passed all tests.
**===-------------------------------------------------===**

4. Michael Connor
Source Code / Report

**===--------------- Grading connor2-------------------===**
Processing 16000000 elements...
Host CPU Processing time: 8034.141113 (ms)
G80 CUDA Processing time: 504.471985 (ms)
Speedup: 15.925842X
Test PASSED
diffing TA and student outputs test 1
diffing TA and student outputs test 2
diffing TA and student outputs test 3

PASSED: connor2 passed all tests.
**===-------------------------------------------------===**

5. Faycal Benmlih
Source Code / Report
**===--------------- Grading benmlih2-------------------===**
Processing 16000000 elements...
Host CPU Processing time: 7643.923828 (ms)
Number of elements: 16000000
Number of Cuda elements: 16000000
Number of Cuda Memory elements: 16777216
Number of padded elements: 777216
G80 CUDA Processing time: 578.578003 (ms)
Speedup: 13.211570X
Test PASSED
diffing TA and student outputs test 1
diffing TA and student outputs test 2
diffing TA and student outputs test 3

PASSED: benmlih2 passed all tests.
**===-------------------------------------------------===**

All speedup results are computed against an 2.2 GHz AMD Opteron 248 processor, with 1GB of system memory.