Big Data

Big Data

Author: Viktor Mayer-Schönberger

Publisher: Houghton Mifflin Harcourt

Published: 2013

Total Pages: 257

ISBN-13: 0544002695

DOWNLOAD EBOOK

A exploration of the latest trend in technology and the impact it will have on the economy, science, and society at large.


Big Books in Times of Big Data

Big Books in Times of Big Data

Author: Inge Gerarda Martina Ven

Publisher:

Published: 2019

Total Pages: 253

ISBN-13: 9789400603608

DOWNLOAD EBOOK

Big Books in Times of Big Data examines recent trends of size and scale in the novel in terms of the shift from the bound book to the newer materialities of the digital. Using a wide-ranging international archive of hefty tomes by authors such as Mark Z. Danielewski, Roberto Bolaño, Elena Ferrante, and Karl Ove Knausgård, George R.R. Martin, Jonathan Franzen, and William T. Vollmann, Van de Ven reflects on the place of big book-bound literature in a media genealogy which includes film and television but also online databases, social media, selfies, and Global Information Systems. This study ma.


Big Data

Big Data

Author: James Warren

Publisher: Simon and Schuster

Published: 2015-04-29

Total Pages: 481

ISBN-13: 1638351104

DOWNLOAD EBOOK

Summary Big Data teaches you to build big data systems using an architecture that takes advantage of clustered hardware along with new tools designed specifically to capture and analyze web-scale data. It describes a scalable, easy-to-understand approach to big data systems that can be built and run by a small team. Following a realistic example, this book guides readers through the theory of big data systems, how to implement them in practice, and how to deploy and operate them once they're built. Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications. About the Book Web-scale applications like social networks, real-time analytics, or e-commerce sites deal with a lot of data, whose volume and velocity exceed the limits of traditional database systems. These applications require architectures built around clusters of machines to store and process data of any size, or speed. Fortunately, scale and simplicity are not mutually exclusive. Big Data teaches you to build big data systems using an architecture designed specifically to capture and analyze web-scale data. This book presents the Lambda Architecture, a scalable, easy-to-understand approach that can be built and run by a small team. You'll explore the theory of big data systems and how to implement them in practice. In addition to discovering a general framework for processing big data, you'll learn specific technologies like Hadoop, Storm, and NoSQL databases. This book requires no previous exposure to large-scale data analysis or NoSQL tools. Familiarity with traditional databases is helpful. What's Inside Introduction to big data systems Real-time processing of web-scale data Tools like Hadoop, Cassandra, and Storm Extensions to traditional database skills About the Authors Nathan Marz is the creator of Apache Storm and the originator of the Lambda Architecture for big data systems. James Warren is an analytics architect with a background in machine learning and scientific computing. Table of Contents A new paradigm for Big Data PART 1 BATCH LAYER Data model for Big Data Data model for Big Data: Illustration Data storage on the batch layer Data storage on the batch layer: Illustration Batch layer Batch layer: Illustration An example batch layer: Architecture and algorithms An example batch layer: Implementation PART 2 SERVING LAYER Serving layer Serving layer: Illustration PART 3 SPEED LAYER Realtime views Realtime views: Illustration Queuing and stream processing Queuing and stream processing: Illustration Micro-batch stream processing Micro-batch stream processing: Illustration Lambda Architecture in depth


Data Warehousing in the Age of Big Data

Data Warehousing in the Age of Big Data

Author: Krish Krishnan

Publisher: Newnes

Published: 2013-05-02

Total Pages: 371

ISBN-13: 0124059201

DOWNLOAD EBOOK

Data Warehousing in the Age of the Big Data will help you and your organization make the most of unstructured data with your existing data warehouse. As Big Data continues to revolutionize how we use data, it doesn't have to create more confusion. Expert author Krish Krishnan helps you make sense of how Big Data fits into the world of data warehousing in clear and concise detail. The book is presented in three distinct parts. Part 1 discusses Big Data, its technologies and use cases from early adopters. Part 2 addresses data warehousing, its shortcomings, and new architecture options, workloads, and integration techniques for Big Data and the data warehouse. Part 3 deals with data governance, data visualization, information life-cycle management, data scientists, and implementing a Big Data–ready data warehouse. Extensive appendixes include case studies from vendor implementations and a special segment on how we can build a healthcare information factory. Ultimately, this book will help you navigate through the complex layers of Big Data and data warehousing while providing you information on how to effectively think about using all these technologies and the architectures to design the next-generation data warehouse. - Learn how to leverage Big Data by effectively integrating it into your data warehouse. - Includes real-world examples and use cases that clearly demonstrate Hadoop, NoSQL, HBASE, Hive, and other Big Data technologies - Understand how to optimize and tune your current data warehouse infrastructure and integrate newer infrastructure matching data processing workloads and requirements


Big Data at Work

Big Data at Work

Author: Thomas Davenport

Publisher: Harvard Business Review Press

Published: 2014-02-04

Total Pages: 241

ISBN-13: 1422168174

DOWNLOAD EBOOK

Go ahead, be skeptical about big data. The author was—at first. When the term “big data” first came on the scene, bestselling author Tom Davenport (Competing on Analytics, Analytics at Work) thought it was just another example of technology hype. But his research in the years that followed changed his mind. Now, in clear, conversational language, Davenport explains what big data means—and why everyone in business needs to know about it. Big Data at Work covers all the bases: what big data means from a technical, consumer, and management perspective; what its opportunities and costs are; where it can have real business impact; and which aspects of this hot topic have been oversold. This book will help you understand: • Why big data is important to you and your organization • What technology you need to manage it • How big data could change your job, your company, and your industry • How to hire, rent, or develop the kinds of people who make big data work • The key success factors in implementing any big data project • How big data is leading to a new approach to managing analytics With dozens of company examples, including UPS, GE, Amazon, United Healthcare, Citigroup, and many others, this book will help you seize all opportunities—from improving decisions, products, and services to strengthening customer relationships. It will show you how to put big data to work in your own organization so that you too can harness the power of this ever-evolving new resource.


Bigger Than You: Big Data and Obesity

Bigger Than You: Big Data and Obesity

Author: Katherine Behar

Publisher: punctum books

Published: 2016

Total Pages: 72

ISBN-13: 0692652833

DOWNLOAD EBOOK

"I shall consider human actions and appetites just as if it were a question of lines, planes, and bodies." -Spinoza, in Ethics In her first inquiry toward decelerationist aesthetics, Katherine Behar explores the rise of two "big deal" contemporary phenomena, big data and obesity. In both, scale rearticulates the human as a diffuse informational pattern, causing important shifts in political form as well as aesthetic form. Bigness redraws relationships between the singular and the collective. Understood as informational patterns, collectives can be radically inclusive, even incorporating nonhumans. As a result, the political subject is slowly becoming a new object. This social and informational body belongs to no single individual, but is shared in solidarity with something "bigger than you." In decelerationist aesthetics, the aesthetic properties, proclivities, and performances of objects come to defy the accelerationist imperative to be nimbly individuated. Decelerationist aesthetics rejects atomistic, liberal, humanist subjects; this unit of self is too consonant with capitalist relations and functions. Instead, decelerationist aesthetics favors transhuman sociality embodied in particulate, mattered objects; the aesthetic form of such objects resists capitalist speed and immediacy by taking back and taking up space and time. In just this way, big data calls into question the conventions by which humans are defined as discrete entities, and individual scales of agency are made to form central binding pillars of social existence through which bodies are drawn into relations of power and pathos.


Big Data

Big Data

Author: Bill Schmarzo

Publisher: John Wiley & Sons

Published: 2013-09-23

Total Pages: 245

ISBN-13: 1118740009

DOWNLOAD EBOOK

Leverage big data to add value to your business Social media analytics, web-tracking, and other technologies help companies acquire and handle massive amounts of data to better understand their customers, products, competition, and markets. Armed with the insights from big data, companies can improve customer experience and products, add value, and increase return on investment. The tricky part for busy IT professionals and executives is how to get this done, and that's where this practical book comes in. Big Data: Understanding How Data Powers Big Business is a complete how-to guide to leveraging big data to drive business value. Full of practical techniques, real-world examples, and hands-on exercises, this book explores the technologies involved, as well as how to find areas of the organization that can take full advantage of big data. Shows how to decompose current business strategies in order to link big data initiatives to the organization’s value creation processes Explores different value creation processes and models Explains issues surrounding operationalizing big data, including organizational structures, education challenges, and new big data-related roles Provides methodology worksheets and exercises so readers can apply techniques Includes real-world examples from a variety of organizations leveraging big data Big Data: Understanding How Data Powers Big Business is written by one of Big Data's preeminent experts, William Schmarzo. Don't miss his invaluable insights and advice.


Big Data Analytics with Hadoop 3

Big Data Analytics with Hadoop 3

Author: Sridhar Alla

Publisher: Packt Publishing Ltd

Published: 2018-05-31

Total Pages: 471

ISBN-13: 1788624955

DOWNLOAD EBOOK

Explore big data concepts, platforms, analytics, and their applications using the power of Hadoop 3 Key Features Learn Hadoop 3 to build effective big data analytics solutions on-premise and on cloud Integrate Hadoop with other big data tools such as R, Python, Apache Spark, and Apache Flink Exploit big data using Hadoop 3 with real-world examples Book Description Apache Hadoop is the most popular platform for big data processing, and can be combined with a host of other big data tools to build powerful analytics solutions. Big Data Analytics with Hadoop 3 shows you how to do just that, by providing insights into the software as well as its benefits with the help of practical examples. Once you have taken a tour of Hadoop 3’s latest features, you will get an overview of HDFS, MapReduce, and YARN, and how they enable faster, more efficient big data processing. You will then move on to learning how to integrate Hadoop with the open source tools, such as Python and R, to analyze and visualize data and perform statistical computing on big data. As you get acquainted with all this, you will explore how to use Hadoop 3 with Apache Spark and Apache Flink for real-time data analytics and stream processing. In addition to this, you will understand how to use Hadoop to build analytics solutions on the cloud and an end-to-end pipeline to perform big data analysis using practical use cases. By the end of this book, you will be well-versed with the analytical capabilities of the Hadoop ecosystem. You will be able to build powerful solutions to perform big data analytics and get insight effortlessly. What you will learn Explore the new features of Hadoop 3 along with HDFS, YARN, and MapReduce Get well-versed with the analytical capabilities of Hadoop ecosystem using practical examples Integrate Hadoop with R and Python for more efficient big data processing Learn to use Hadoop with Apache Spark and Apache Flink for real-time data analytics Set up a Hadoop cluster on AWS cloud Perform big data analytics on AWS using Elastic Map Reduce Who this book is for Big Data Analytics with Hadoop 3 is for you if you are looking to build high-performance analytics solutions for your enterprise or business using Hadoop 3’s powerful features, or you’re new to big data analytics. A basic understanding of the Java programming language is required.


Big Data Analytics

Big Data Analytics

Author: Venkat Ankam

Publisher: Packt Publishing Ltd

Published: 2016-09-28

Total Pages: 326

ISBN-13: 1785889702

DOWNLOAD EBOOK

A handy reference guide for data analysts and data scientists to help to obtain value from big data analytics using Spark on Hadoop clusters About This Book This book is based on the latest 2.0 version of Apache Spark and 2.7 version of Hadoop integrated with most commonly used tools. Learn all Spark stack components including latest topics such as DataFrames, DataSets, GraphFrames, Structured Streaming, DataFrame based ML Pipelines and SparkR. Integrations with frameworks such as HDFS, YARN and tools such as Jupyter, Zeppelin, NiFi, Mahout, HBase Spark Connector, GraphFrames, H2O and Hivemall. Who This Book Is For Though this book is primarily aimed at data analysts and data scientists, it will also help architects, programmers, and practitioners. Knowledge of either Spark or Hadoop would be beneficial. It is assumed that you have basic programming background in Scala, Python, SQL, or R programming with basic Linux experience. Working experience within big data environments is not mandatory. What You Will Learn Find out and implement the tools and techniques of big data analytics using Spark on Hadoop clusters with wide variety of tools used with Spark and Hadoop Understand all the Hadoop and Spark ecosystem components Get to know all the Spark components: Spark Core, Spark SQL, DataFrames, DataSets, Conventional and Structured Streaming, MLLib, ML Pipelines and Graphx See batch and real-time data analytics using Spark Core, Spark SQL, and Conventional and Structured Streaming Get to grips with data science and machine learning using MLLib, ML Pipelines, H2O, Hivemall, Graphx, SparkR and Hivemall. In Detail Big Data Analytics book aims at providing the fundamentals of Apache Spark and Hadoop. All Spark components – Spark Core, Spark SQL, DataFrames, Data sets, Conventional Streaming, Structured Streaming, MLlib, Graphx and Hadoop core components – HDFS, MapReduce and Yarn are explored in greater depth with implementation examples on Spark + Hadoop clusters. It is moving away from MapReduce to Spark. So, advantages of Spark over MapReduce are explained at great depth to reap benefits of in-memory speeds. DataFrames API, Data Sources API and new Data set API are explained for building Big Data analytical applications. Real-time data analytics using Spark Streaming with Apache Kafka and HBase is covered to help building streaming applications. New Structured streaming concept is explained with an IOT (Internet of Things) use case. Machine learning techniques are covered using MLLib, ML Pipelines and SparkR and Graph Analytics are covered with GraphX and GraphFrames components of Spark. Readers will also get an opportunity to get started with web based notebooks such as Jupyter, Apache Zeppelin and data flow tool Apache NiFi to analyze and visualize data. Style and approach This step-by-step pragmatic guide will make life easy no matter what your level of experience. You will deep dive into Apache Spark on Hadoop clusters through ample exciting real-life examples. Practical tutorial explains data science in simple terms to help programmers and data analysts get started with Data Science


Too Big to Ignore

Too Big to Ignore

Author: Phil Simon

Publisher: John Wiley & Sons

Published: 2013-03-05

Total Pages: 256

ISBN-13: 1118641868

DOWNLOAD EBOOK

Residents in Boston, Massachusetts are automatically reporting potholes and road hazards via their smartphones. Progressive Insurance tracks real-time customer driving patterns and uses that information to offer rates truly commensurate with individual safety. Google accurately predicts local flu outbreaks based upon thousands of user search queries. Amazon provides remarkably insightful, relevant, and timely product recommendations to its hundreds of millions of customers. Quantcast lets companies target precise audiences and key demographics throughout the Web. NASA runs contests via gamification site TopCoder, awarding prizes to those with the most innovative and cost-effective solutions to its problems. Explorys offers penetrating and previously unknown insights into healthcare behavior. How do these organizations and municipalities do it? Technology is certainly a big part, but in each case the answer lies deeper than that. Individuals at these organizations have realized that they don't have to be Nate Silver to reap massive benefits from today's new and emerging types of data. And each of these organizations has embraced Big Data, allowing them to make astute and otherwise impossible observations, actions, and predictions. It's time to start thinking big. In Too Big to Ignore, recognized technology expert and award-winning author Phil Simon explores an unassailably important trend: Big Data, the massive amounts, new types, and multifaceted sources of information streaming at us faster than ever. Never before have we seen data with the volume, velocity, and variety of today. Big Data is no temporary blip of fad. In fact, it is only going to intensify in the coming years, and its ramifications for the future of business are impossible to overstate. Too Big to Ignore explains why Big Data is a big deal. Simon provides commonsense, jargon-free advice for people and organizations looking to understand and leverage Big Data. Rife with case studies, examples, analysis, and quotes from real-world Big Data practitioners, the book is required reading for chief executives, company owners, industry leaders, and business professionals.