46,99 €
inkl. MwSt.
Versandkostenfrei*
Versandfertig in 6-10 Tagen
  • Broschiertes Buch

This manuscript provides a synthetic overview of research on data management in support of stream processing. It address all stages of the stream processing pipeline: data collection and in-transit processing at the edge, transfer towards the cloud processing sites, ingestion and persistent storage. First, the general context of stream data management is presented in light of the recent transition from Big to Fast Data. After highlighting the challenges at the data level associated with batch and real-time analytics, we introduce a subjective overview of proposals to address them. They bring…mehr

Produktbeschreibung
This manuscript provides a synthetic overview of research on data management in support of stream processing. It address all stages of the stream processing pipeline: data collection and in-transit processing at the edge, transfer towards the cloud processing sites, ingestion and persistent storage. First, the general context of stream data management is presented in light of the recent transition from Big to Fast Data. After highlighting the challenges at the data level associated with batch and real-time analytics, we introduce a subjective overview of proposals to address them. They bring solutions to the problems of in-transit stream storage and processing, fast data transfers, distributed metadata management, dynamic ingestion and transactional storage. The integration of these solutions into functional prototypes and the results of the large-scale experimental evaluations on clusters, clouds and supercomputers demonstrate their effectiveness for several real-life applications ranging from neuro-science to LHC nuclear physics. Finally, these contributions are put into the perspective of the High Performance Computing - Big Data convergence.
Autorenporträt
Alexandru Costan is an Associate Professor at INSA Rennes and a researcher within the KerData team at IRISA Rennes. His research interests include Big Data management in HPC and clouds, fast data and stream processing, autonomic behaviour and workflow management. He has published one book, more than 20 journal articles and 30 conference papers.