Memorias de investigación
Conferencias:
Mesh traversal and sorting for efficient memory usage in scientific codes
Año:2011

Áreas de investigación
  • Ciencias de la computación y tecnología informática

Datos
Descripción
Applications that operate on meshes are very popular in High Performance Computing (HPC) environments. In the past, many techniques have been developed in order to optimize the memory accesses for these datasets. Different loop transformations and domain decompositions are com- monly used for structured meshes. However, unstructured grids are more challenging. The memory accesses, based on the mesh connectivity, do not map well to the usual lin- ear memory model. This work presents a method to improve the memory performance which is suitable for HPC codes that operate on meshes. We develop a method to adjust the sequence in which the data are used inside the algorithm, by means of traversing and sorting the mesh. This sorted mesh can be transferred sequentially to the lower memory levels and allows for minimum data transfer requirements. The method also reduces the lower memory requirements dra- matically: up to 63% of the L1 cache misses are removed in a traditional cache system. We have obtained speedups of up to 2.58 on memory operations as measured in a general- purpose CPU. An improvement is also observed with se- quential access memories, where we have observed reduc- tions of up to 99% in the required low-level memory size.
Internacional
Si
ISSN o ISBN
1097-2641
Entidad relacionada
Nacionalidad Entidad
Sin nacionalidad
Lugar del congreso
Orlando, EEUU

Esta actividad pertenece a memorias de investigación

Participantes
  • Autor: Pablo Barrio Lopez-Cortijo UPM

Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Laboratorio de Sistemas Integrados (LSI)
  • Departamento: Ingeniería Electrónica