トップ 差分 一覧 Farm ソース 検索 ヘルプ PDF RSS ログイン

Diary/2020-4-11

RTX2080Tiの評価

e-treesのGPUサーバのGPUをGTX1080からRTX2080Tiにアップデート.
折角なのでGTX1080とRTX2080Tiの性能を https://github.com/ekondis/mixbench で比較してみた.
結果は,


RTX2080Tiはだいぶ速そうだぞ,ってことで仕事させないと.


それぞれのスペックは,こんな感じ.

------------------------ Device specifications ------------------------
Device:              GeForce GTX 1080
CUDA driver version: 10.20
GPU clock rate:      1911 MHz
Memory clock rate:   2752 MHz
Memory bus width:    256 bits
WarpSize:            32
L2 cache size:       2048 KB
Total global mem:    8117 MB
ECC enabled:         No
Compute Capability:  6.1
Total SPs:           2560 (20 MPs x 128 SPs/MP)
Compute throughput:  9784.32 GFlops (theoretical single precision FMAs)
Memory bandwidth:    352.32 GB/sec
-----------------------------------------------------------------------
Total GPU memory 8511881216, free 8374583296
Buffer size:          64MB
Trade-off type:       compute with global memory (block strided)

------------------------ Device specifications ------------------------
Device:              GeForce RTX 2080 Ti
CUDA driver version: 10.20
GPU clock rate:      1650 MHz
Memory clock rate:   3500 MHz
Memory bus width:    352 bits
WarpSize:            32
L2 cache size:       5632 KB
Total global mem:    11019 MB
ECC enabled:         No
Compute Capability:  7.5
Total SPs:           4352 (68 MPs x 64 SPs/MP)
Compute throughput:  14361.60 GFlops (theoretical single precision FMAs)
Memory bandwidth:    616.00 GB/sec
-----------------------------------------------------------------------
Total GPU memory 11554717696, free 11379408896
Buffer size:          64MB
Trade-off type:       compute with global memory (block strided)


生データとプロットスクリプト
gpu-comparison.dat(137)
gpu-comparison.plt(128)