Memorias de investigación
Proyecto de I+D+i:
Large Scale Data Streaming (MS-2007-07)
Año:2007

Áreas de investigación
  • Lenguaje de programación

Datos
Descripción
There are many data streaming applications that will have to cope with massive data streams in which large clusters will be required to be able to cope with the input data stream. We foresee two classes of applications: 1) applications with many continuous queries that can scale by distributing the queries among a pool of sites; 2) Applications with one or a few queries but very massive data that will require parallelizing query operators among a large number of sites to be able to cope with the massive input streaming data. In order to realize a data streaming system able to satisfy the requirements of these two classes of applications for very large clusters (e.g. over 100 sites) it is a real challenge from a scientific point of view. We believe that the advances in System Area Networks (SANs) will enable a new kind of distributed support for data streaming. On one hand, SANs provide high-bandwidth low latency communication that enables end-to-end inter-host memory access in the realm of microseconds. This can be exploited for fast coordination among sites and agile load balancing. On the other hand, these networks are characterized by having Network Interface Cards (NICs) with their own processor with a reasonable capacity and able to access the host memory through DMA. Our previous work enables us to foresee that the envisioned highly scalable data streaming platform can be realized.
Internacional
Si
Tipo de proyecto
Proyectos y convenios en convocatorias públicas competitivas
Entidad financiadora
Microsoft Research Cambridge (PhD Award)
Nacionalidad Entidad
REINO UNIDO
Tamaño de la entidad
Gran Empresa (>250)
Fecha concesión
01/10/2006

Esta actividad pertenece a memorias de investigación

Participantes

Grupos de investigación, Departamentos, Centros e Institutos de I+D+i relacionados
  • Creador: Grupo de Investigación: Distributed Systems Labs (LSD) Laboratorio de sistemas distribuidos
  • Departamento: Lenguajes y Sistemas Informáticos e Ingeniería de Software