Hadoop in 24 Hours, Sams Teach Yourself

Hadoop in 24 Hours, Sams Teach Yourself

Author: Jeffrey Aven

Publisher: Sams Publishing

Published: 2017-04-07

Total Pages: 851

ISBN-13: 0134456726

DOWNLOAD EBOOK

Apache Hadoop is the technology at the heart of the Big Data revolution, and Hadoop skills are in enormous demand. Now, in just 24 lessons of one hour or less, you can learn all the skills and techniques you'll need to deploy each key component of a Hadoop platform in your local environment or in the cloud, building a fully functional Hadoop cluster and using it with real programs and datasets. Each short, easy lesson builds on all that's come before, helping you master all of Hadoop's essentials, and extend it to meet your unique challenges. Apache Hadoop in 24 Hours, Sams Teach Yourself covers all this, and much more: Understanding Hadoop and the Hadoop Distributed File System (HDFS) Importing data into Hadoop, and process it there Mastering basic MapReduce Java programming, and using advanced MapReduce API concepts Making the most of Apache Pig and Apache Hive Implementing and administering YARN Taking advantage of the full Hadoop ecosystem Managing Hadoop clusters with Apache Ambari Working with the Hadoop User Environment (HUE) Scaling, securing, and troubleshooting Hadoop environments Integrating Hadoop into the enterprise Deploying Hadoop in the cloud Getting started with Apache Spark Step-by-step instructions walk you through common questions, issues, and tasks; Q-and-As, Quizzes, and Exercises build and test your knowledge; "Did You Know?" tips offer insider advice and shortcuts; and "Watch Out!" alerts help you avoid pitfalls. By the time you're finished, you'll be comfortable using Apache Hadoop to solve a wide spectrum of Big Data problems.


Sams Teach Yourself Hadoop in 24 Hours

Sams Teach Yourself Hadoop in 24 Hours

Author: Jeffrey Aven

Publisher: Sams Publishing

Published: 2017

Total Pages: 0

ISBN-13: 9780672338526

DOWNLOAD EBOOK

Apache Hadoop is the technology at the heart of the Big Data revolution, and Hadoop skills are in enormous demand. Now, in just 24 lessons of one hour or less, students can learn all the skills and techniques they'll need to deploy each key component of a Hadoop platform in a local environment or in the cloud, building a fully functional Hadoop cluster and using it with real programs and datasets. Each short, easy lesson builds on all that's come before, helping students master all of Hadoop's essentials, and extend it to meet real-world challenges. Apache Hadoop in 24 Hours, Sams Teach Yourself covers all this, and much more: Understanding Hadoop and the Hadoop Distributed File System (HDFS) Importing data into Hadoop, and process it there Mastering basic MapReduce Java programming, and using advanced MapReduce API concepts Making the most of Apache Pig and Apache Hive Implementing and administering YARN Taking advantage of the full Hadoop ecosystem Managing Hadoop clusters with Apache Ambari Working with the Hadoop User Environment (HUE) Scaling, securing, and troubleshooting Hadoop environments Integrating Hadoop into the enterprise Deploying Hadoop in the cloud Getting started with Apache Spark Step-by-step instructions walk students through common questions, issues, and tasks; Q-and-As, Quizzes, and Exercises build and test your knowledge; Did You Know? tips offer insider advice and shortcuts; and Watch Out! alerts help avoid pitfalls. By the time they're finished, they'll be comfortable using Apache Hadoop to solve a wide spectrum of Big Data problems.


Apache Spark in 24 Hours, Sams Teach Yourself

Apache Spark in 24 Hours, Sams Teach Yourself

Author: Jeffrey Aven

Publisher: Sams Publishing

Published: 2016-08-31

Total Pages: 1353

ISBN-13: 0134445821

DOWNLOAD EBOOK

Apache Spark is a fast, scalable, and flexible open source distributed processing engine for big data systems and is one of the most active open source big data projects to date. In just 24 lessons of one hour or less, Sams Teach Yourself Apache Spark in 24 Hours helps you build practical Big Data solutions that leverage Spark’s amazing speed, scalability, simplicity, and versatility. This book’s straightforward, step-by-step approach shows you how to deploy, program, optimize, manage, integrate, and extend Spark–now, and for years to come. You’ll discover how to create powerful solutions encompassing cloud computing, real-time stream processing, machine learning, and more. Every lesson builds on what you’ve already learned, giving you a rock-solid foundation for real-world success. Whether you are a data analyst, data engineer, data scientist, or data steward, learning Spark will help you to advance your career or embark on a new career in the booming area of Big Data. Learn how to • Discover what Apache Spark does and how it fits into the Big Data landscape • Deploy and run Spark locally or in the cloud • Interact with Spark from the shell • Make the most of the Spark Cluster Architecture • Develop Spark applications with Scala and functional Python • Program with the Spark API, including transformations and actions • Apply practical data engineering/analysis approaches designed for Spark • Use Resilient Distributed Datasets (RDDs) for caching, persistence, and output • Optimize Spark solution performance • Use Spark with SQL (via Spark SQL) and with NoSQL (via Cassandra) • Leverage cutting-edge functional programming techniques • Extend Spark with streaming, R, and Sparkling Water • Start building Spark-based machine learning and graph-processing applications • Explore advanced messaging technologies, including Kafka • Preview and prepare for Spark’s next generation of innovations Instructions walk you through common questions, issues, and tasks; Q-and-As, Quizzes, and Exercises build and test your knowledge; "Did You Know?" tips offer insider advice and shortcuts; and "Watch Out!" alerts help you avoid pitfalls. By the time you're finished, you'll be comfortable using Apache Spark to solve a wide spectrum of Big Data problems.


Sams Teach Yourself Node.js in 24 Hours

Sams Teach Yourself Node.js in 24 Hours

Author: George Ornbo

Publisher: Sams Publishing

Published: 2012-09-05

Total Pages: 1029

ISBN-13: 0132966263

DOWNLOAD EBOOK

In just 24 sessions of one hour or less, Sams Teach Yourself Node.js in 24 Hours will help you master the Node.js platform and use it to build server-side applications with extraordinary speed and scalability. Using this text’s straightforward, step-by-step approach, you’ll move from basic installation, configuration, and programming all the way through real-time messaging between browser and server, testing and deployment. Every lesson and case-study application builds on what you’ve already learned, giving you a rock-solid foundation for real-world success! Step-by-step instructions carefully walk you through the most common Node.js development tasks. Quizzes and Exercises at the end of each chapter help you test your knowledge. By the Way notes present valuable additional information related to the discussion. Did You Know? tips offer advice or show you easier ways to perform tasks. Watch Out! cautions alert you to possible problems and give you advice on how to avoid them. Learn how to... · Create end-to-end applications entirely in JavaScript · Master essential Node.js concepts like callbacks and quickly create your first program · Create basic sites with the HTTP module and Express web framework · Manage data persistence with Node.js and MongoDB · Debug and test Node.js applications · Deploy Node.js applications to thirdparty services, such as Heroku and Nodester · Build powerful real-time solutions, from chat servers to Twitter clients · Create JSON APIs using JavaScript on the server · Use core components of the Node.js API, including processes, child processes, events, buffers, and streams · Create and publish a Node.js module


Sams Teach Yourself SAP in 24 Hours

Sams Teach Yourself SAP in 24 Hours

Author: Tim Rhodes

Publisher: Pearson Education

Published: 2004-07-16

Total Pages: 529

ISBN-13: 0132715104

DOWNLOAD EBOOK

Third Edition: Thoroughly Updated and Expanded, with Extensive New Coverage! In just 24 sessions of one hour or less, you’ll master the entire SAP project lifecycle, from planning through implementation and system administration through day-to-day operations. Using this book’s straightforward, step-by-step approach, you’ll gain a strong real-world foundation in both the technology and business essentials of today’s SAP products and applications—from the ground up. Step-by-step instructions walk you through the most common questions, issues, and tasks you’ll encounter with SAP. Case study-based exercises help you build and test your knowledge. By the Way notes present interesting pieces of information. Did You Know? tips offer advice or teach an easier way. Watch Out! cautions warn about potential problems. Learn how to... Understand SAP’s newest products for enterprises and small-to-midsize businesses, and choose the right solutions for your company Discover how SAP integrates with Web services and service-oriented architecture Develop an efficient roadmap for deploying SAP in your environment Plan your SAP implementation from business, functional, technical, and project management perspectives Leverage NetWeaver 7.0 features to streamline development and integration, and reduce cost Walk through a step-by-step SAP technical installation Master basic SAP system administration and operations Perform essential tasks such as logon, session management, and printing Build SAP queries and reports Prepare for SAP upgrades and enhancements Develop your own personal career as an SAP professional Register your book at informit.com/title/9780137142842 for convenient access to updates and corrections as they become available.


Data Analytics with Spark Using Python

Data Analytics with Spark Using Python

Author: Jeffrey Aven

Publisher: Addison-Wesley Professional

Published: 2018-06-18

Total Pages: 772

ISBN-13: 0134844874

DOWNLOAD EBOOK

Solve Data Analytics Problems with Spark, PySpark, and Related Open Source Tools Spark is at the heart of today’s Big Data revolution, helping data professionals supercharge efficiency and performance in a wide range of data processing and analytics tasks. In this guide, Big Data expert Jeffrey Aven covers all you need to know to leverage Spark, together with its extensions, subprojects, and wider ecosystem. Aven combines a language-agnostic introduction to foundational Spark concepts with extensive programming examples utilizing the popular and intuitive PySpark development environment. This guide’s focus on Python makes it widely accessible to large audiences of data professionals, analysts, and developers—even those with little Hadoop or Spark experience. Aven’s broad coverage ranges from basic to advanced Spark programming, and Spark SQL to machine learning. You’ll learn how to efficiently manage all forms of data with Spark: streaming, structured, semi-structured, and unstructured. Throughout, concise topic overviews quickly get you up to speed, and extensive hands-on exercises prepare you to solve real problems. Coverage includes: • Understand Spark’s evolving role in the Big Data and Hadoop ecosystems • Create Spark clusters using various deployment modes • Control and optimize the operation of Spark clusters and applications • Master Spark Core RDD API programming techniques • Extend, accelerate, and optimize Spark routines with advanced API platform constructs, including shared variables, RDD storage, and partitioning • Efficiently integrate Spark with both SQL and nonrelational data stores • Perform stream processing and messaging with Spark Streaming and Apache Kafka • Implement predictive modeling with SparkR and Spark MLlib


Learning Spark

Learning Spark

Author: Jules S. Damji

Publisher: O'Reilly Media

Published: 2020-07-16

Total Pages: 400

ISBN-13: 1492050016

DOWNLOAD EBOOK

Data is bigger, arrives faster, and comes in a variety of formats—and it all needs to be processed at scale for analytics or machine learning. But how can you process such varied workloads efficiently? Enter Apache Spark. Updated to include Spark 3.0, this second edition shows data engineers and data scientists why structure and unification in Spark matters. Specifically, this book explains how to perform simple and complex data analytics and employ machine learning algorithms. Through step-by-step walk-throughs, code snippets, and notebooks, you’ll be able to: Learn Python, SQL, Scala, or Java high-level Structured APIs Understand Spark operations and SQL Engine Inspect, tune, and debug Spark operations with Spark configurations and Spark UI Connect to data sources: JSON, Parquet, CSV, Avro, ORC, Hive, S3, or Kafka Perform analytics on batch and streaming data using Structured Streaming Build reliable data pipelines with open source Delta Lake and Spark Develop machine learning pipelines with MLlib and productionize models using MLflow


High Performance Spark

High Performance Spark

Author: Holden Karau

Publisher: "O'Reilly Media, Inc."

Published: 2017-05-25

Total Pages: 356

ISBN-13: 1491943173

DOWNLOAD EBOOK

Apache Spark is amazing when everything clicks. But if you haven’t seen the performance improvements you expected, or still don’t feel confident enough to use Spark in production, this practical book is for you. Authors Holden Karau and Rachel Warren demonstrate performance optimizations to help your Spark queries run faster and handle larger data sizes, while using fewer resources. Ideal for software engineers, data engineers, developers, and system administrators working with large-scale data applications, this book describes techniques that can reduce data infrastructure costs and developer hours. Not only will you gain a more comprehensive understanding of Spark, you’ll also learn how to make it sing. With this book, you’ll explore: How Spark SQL’s new interfaces improve performance over SQL’s RDD data structure The choice between data joins in Core Spark and Spark SQL Techniques for getting the most out of standard RDD transformations How to work around performance issues in Spark’s key/value pair paradigm Writing high-performance Spark code without Scala or the JVM How to test for functionality and performance when applying suggested improvements Using Spark MLlib and Spark ML machine learning libraries Spark’s Streaming components and external community packages


Java in 24 Hours, Sams Teach Yourself (Covering Java 9)

Java in 24 Hours, Sams Teach Yourself (Covering Java 9)

Author: Rogers Cadenhead

Publisher: Sams Publishing

Published: 2017-09-08

Total Pages: 1093

ISBN-13: 0134663632

DOWNLOAD EBOOK

Computer programming with Java is easier than it looks. In just 24 lessons of one hour or less, you can learn to write computer programs in Java. Using a straightforward, step-by-step approach, popular author Rogers Cadenhead helps you master the skills and technology you need to create desktop and web programs, web services, an Android app, and even Minecraft mods in Java. Each lesson builds on what you’ve already learned, giving you a rock-solid foundation for real-world success. Full-color figures and clear step-by-step instructions visually show you how to program with Java. Quizzes and Exercises at the end of each chapter help you test your knowledge. Notes, Tips, and Cautions provide related information, advice, and warnings. Learn how to... • Set up your Java programming environment • Write your first working program in just minutes • Control program decisions and behavior • Store and work with information • Build straightforward user interfaces • Create interactive web programs • Use threading to build more responsive programs • Read and write files and XML data • Master best practices for object-oriented programming • Use Java 9’s new HTTP client • Use Java to create an Android app • Expand your skills with closures • Create Minecraft mods with Java Contents at a Glance Part I Getting Started 1 Becoming a Programmer 2 Writing Your First Program 3 Vacationing in Java 4 Understanding How Java Programs Work Part II Learning the Basics of Programming 5 Storing and Changing Information in a Program 6 Using Strings to Communicate 7 Using Conditional Tests to Make Decisions 8 Repeating an Action with Loops Part III Working with Information in New Ways 9 Storing Information with Arrays 10 Creating Your First Object 11 Describing What Your Object is Like 12 Making the Most of Existing Objects Part IV Moving into Advanced Topics 13 Storing Objects in Data Structures 14 Handling Errors in a Program 15 Creating a Threaded Program 16 Using Inner Classes and Closures Part V Programming a Graphical User Interface 17 Building a Simple User Interface in Swing 18 Laying Out a User Interface 19 Responding to User Input Part VI Writing Internet Applications 20 Reading and Writing Files 21 Using Java 9's New HTTP Client 22 Creating Java2D Graphics 23 Creating Minecraft Mods with Java 24 Writing Android Apps Appendixes A Using the NetBeans Integrated Development Environment B Where to Go from Here Java Resources C This Book's Web Site D Fixing a Problem with the Android Studio Emulator