Observatorio de I+D+i UPM

| Otras actividades
HOME

Proyectos Internacionales Art�culos Patentes UPM Software UPM Empresas UPM Otras actividades Memorias de investigaci�n

Memorias de investigación

Ponencias en congresos:

Parallel Efficient Data Loading

A�o:2019

�reas de investigaci�n

Ciencias de la computaci�n y tecnolog�a inform�tica

Datos

Descripci�n
In this paper we discuss how we architected and developed a parallel data loader for LeanXcale database. The loader is characterized for its efficiency and parallelism. LeanXcale can scale up and scale out to very large numbers and loading data in the traditional way it is not exploiting its full potential in terms of the loading rate it can reach. For this reason, we have created a parallel loader that can reach the maximum insertion rate LeanXcale can handle. LeanXcale also exhibits a dual interface, key-value and SQL, that has been exploited by the parallel loader. Basically, the loading leverages the key-value API and results in a highly efficient process that avoids the overhead of SQL processing. Finally, in order to guarantee the parallelism we have developed a data sampler that samples data to generate a histogram of data distribution and use it to pre-split the regions across LeanXcale instances to guarantee that all instances get an even amount of data during loading, thus g uaranteeing the peak processing loading capability of the deployment.
Internacional	Si
Nombre congreso	8th International Conference on Data Science, Technology and Applications, DATA 2019
Tipo de participaci�n	960
Lugar del congreso	Praga
Revisores	Si
ISBN o ISSN	978-989-758-377-3
DOI	10.5220/0008318904650469
Fecha inicio congreso	26/07/2019
Fecha fin congreso	28/07/2019
Desde la p�gina	465
Hasta la p�gina	469
T�tulo de las actas	Proceedings of the 8th International Conference on Data Science, Technology and Applications

Esta actividad pertenece a memorias de investigaci�n

Participantes

Autor: Ricardo Jimenez Peris UPM
Autor: Francisco J. Ballesteros Universidad Rey Juan Carlos, Madrid,
Autor: Ainhoa Azqueta Alzuaz UPM
Autor: Pavlos Kranas . UPM
Autor: Diego Burgos Sancho UPM
Autor: Patricio Mart�nez LeanXcale

Grupos de investigaci�n, Departamentos, Centros e Institutos de I+D+i relacionados

Creador: Departamento: Lenguajes y Sistemas Inform�ticos e Ingenier�a de Software