scispace - formally typeset
Open AccessBook

Big Data: Principles and best practices of scalable realtime data systems

Nathan Marz, +1 more
Reads0
Chats0
TLDR
Big Data describes a scalable, easy to understand approach to big data systems that can be built and run by a small team that takes advantage of clustered hardware along with new tools designed specifically to capture and analyze web-scale data.
Abstract
Services like social networks, web analytics, and intelligent e-commerce often need to manage data at a scale too big for a traditional database As scale and demand increase, so does Complexity Fortunately, scalability and simplicity are not mutually exclusiverather than using some trendy technology, a different approach is needed Big data systems use many machines working in parallel to store and process data, which introduces fundamental challenges unfamiliar to most developers Big Data shows how to build these systems using an architecture that takes advantage of clustered hardware along with new tools designed specifically to capture and analyze web-scale data It describes a scalable, easy to understand approach to big data systems that can be built and run by a small team Following a realistic example, this book guides readers through the theory of big data systems, how to use them in practice, and how to deploy and operate them once they're built Purchase of the print book comes with an offer of a free PDF, ePub, and Kindle eBook from Manning Also available is all code from the book

read more

Citations
More filters
Journal ArticleDOI

Data-intensive applications, challenges, techniques and technologies: A survey on Big Data

TL;DR: This paper is aimed to demonstrate a close-up view about Big Data, including Big Data applications, Big Data opportunities and challenges, as well as the state-of-the-art techniques and technologies currently adopt to deal with the Big Data problems.
Journal ArticleDOI

The real-time city? Big data and smart urbanism

TL;DR: In this article, the authors focus on the implications of big data and smart urbanism, examining five emerging concerns: the politics of big urban data, technocratic governance and city development, corporatisation of city governance and technological lock-ins, buggy, brittle and hackable cities, and the panoptic city.
Journal ArticleDOI

Big Data, new epistemologies and paradigm shifts:

TL;DR: The authors examines how the availability of Big Data, coupled with new data analytics, challenges established epistemologies across the sciences, social sciences and humanities, and assesses the extent to which they are engendering paradigm shifts across multiple disciplines.
Journal Article

Apache flink : Stream and batch processing in a single engine

TL;DR: This paper discusses the approach to achieve high throughput for transactional query processing while allowing concurrent analytical queries, and presents its approach to distributed snapshot isolation and optimized two-phase commit protocols.
Journal ArticleDOI

The Real-Time City? Big Data and Smart Urbanism

TL;DR: This paper details how cities are being instrumented with digital devices and infrastructure that produce 'big data', which smart city advocates argue enables real-time analysis of city life, new modes of urban governance, and provides the raw material for envisioning and enacting more efficient, sustainable, competitive, productive, open and transparent cities.
Related Papers (5)