Descripción
|
|
---|---|
The performance of a Monte Carlo model for the simulation of electromagnetic wave propagation in particle-filled atmospheres has been conducted for different CUDA versions and design approaches. The proposed algorithm exhibits a high degree of parallelism, which allows favorable implementation in a GPU. Practical implementation aspects of the model have been also explained and their impact assessed, such as the use of the different types of memories present in a GPU. A number of setups have been chosen in order to compare performance for manually optimized versus Unified Virtual Memory (UVM ) implementations for different CUDA versions. Features and relative performance impact of the different options have been discussed, extracting practical hints and rules useful to speed up CUDA programs. | |
Internacional
|
Si |
JCR del ISI
|
Si |
Título de la revista
|
?IEEE Transactions on Parallel and Distributed Systems |
ISSN
|
1045-9219 |
Factor de impacto JCR
|
|
Información de impacto
|
|
Volumen
|
27 |
DOI
|
10.1109/TPDS.2015.2463813 |
Número de revista
|
6 |
Desde la página
|
1579 |
Hasta la página
|
1588 |
Mes
|
JUNIO |
Ranking
|