The most comprehensive book on the engineering aspects of building reliable AI systems. "If you intend to use machine learning to solve business problems at scale, I'm delighted you got your hands on this book." -Cassie Kozyrkov, Chief Decision Scientist at Google "Foundational work about the reality of building machine learning models in production." -Karolis Urbonas, Head of Machine Learning and Science at Amazon
Field-tested tips, tricks, and design patterns for building machine learning projects that are deployable, maintainable, and secure from concept to production. In Machine Learning Engineering in Action, you will learn: Evaluating data science problems to find the most effective solution Scoping a machine learning project for usage expectations and budget Process techniques that minimize wasted effort and speed up production Assessing a project using standardized prototyping work and statistical validation Choosing the right technologies and tools for your project Making your codebase more understandable, maintainable, and testable Automating your troubleshooting and logging practices Ferrying a machine learning project from your data science team to your end users is no easy task. Machine Learning Engineering in Action will help you make it simple. Inside, you'll find fantastic advice from veteran industry expert Ben Wilson, Principal Resident Solutions Architect at Databricks. Ben introduces his personal toolbox of techniques for building deployable and maintainable production machine learning systems. You'll learn the importance of Agile methodologies for fast prototyping and conferring with stakeholders, while developing a new appreciation for the importance of planning. Adopting well-established software development standards will help you deliver better code management, and make it easier to test, scale, and even reuse your machine learning code. Every method is explained in a friendly, peer-to-peer style and illustrated with production-ready source code. About the technology Deliver maximum performance from your models and data. This collection of reproducible techniques will help you build stable data pipelines, efficient application workflows, and maintainable models every time. Based on decades of good software engineering practice, machine learning engineering ensures your ML systems are resilient, adaptable, and perform in production. About the book Machine Learning Engineering in Action teaches you core principles and practices for designing, building, and delivering successful machine learning projects. You'll discover software engineering techniques like conducting experiments on your prototypes and implementing modular design that result in resilient architectures and consistent cross-team communication. Based on the author's extensive experience, every method in this book has been used to solve real-world projects. What's inside Scoping a machine learning project for usage expectations and budget Choosing the right technologies for your design Making your codebase more understandable, maintainable, and testable Automating your troubleshooting and logging practices About the reader For data scientists who know machine learning and the basics of object-oriented programming. About the author Ben Wilson is Principal Resident Solutions Architect at Databricks, where he developed the Databricks Labs AutoML project, and is an MLflow committer.
Supercharge the value of your machine learning models by building scalable and robust solutions that can serve them in production environments Key Features Explore hyperparameter optimization and model management tools Learn object-oriented programming and functional programming in Python to build your own ML libraries and packages Explore key ML engineering patterns like microservices and the Extract Transform Machine Learn (ETML) pattern with use cases Book DescriptionMachine learning engineering is a thriving discipline at the interface of software development and machine learning. This book will help developers working with machine learning and Python to put their knowledge to work and create high-quality machine learning products and services. Machine Learning Engineering with Python takes a hands-on approach to help you get to grips with essential technical concepts, implementation patterns, and development methodologies to have you up and running in no time. You'll begin by understanding key steps of the machine learning development life cycle before moving on to practical illustrations and getting to grips with building and deploying robust machine learning solutions. As you advance, you'll explore how to create your own toolsets for training and deployment across all your projects in a consistent way. The book will also help you get hands-on with deployment architectures and discover methods for scaling up your solutions while building a solid understanding of how to use cloud-based tools effectively. Finally, you'll work through examples to help you solve typical business problems. By the end of this book, you'll be able to build end-to-end machine learning services using a variety of techniques and design your own processes for consistently performant machine learning engineering.What you will learn Find out what an effective ML engineering process looks like Uncover options for automating training and deployment and learn how to use them Discover how to build your own wrapper libraries for encapsulating your data science and machine learning logic and solutions Understand what aspects of software engineering you can bring to machine learning Gain insights into adapting software engineering for machine learning using appropriate cloud technologies Perform hyperparameter tuning in a relatively automated way Who this book is for This book is for machine learning engineers, data scientists, and software developers who want to build robust software solutions with machine learning components. If you're someone who manages or wants to understand the production life cycle of these systems, you'll find this book useful. Intermediate-level knowledge of Python is necessary.
Get up and running, and productive in no time with MLflow using the most effective machine learning engineering approach Key FeaturesExplore machine learning workflows for stating ML problems in a concise and clear manner using MLflowUse MLflow to iteratively develop a ML model and manage it Discover and work with the features available in MLflow to seamlessly take a model from the development phase to a production environmentBook Description MLflow is a platform for the machine learning life cycle that enables structured development and iteration of machine learning models and a seamless transition into scalable production environments. This book will take you through the different features of MLflow and how you can implement them in your ML project. You will begin by framing an ML problem and then transform your solution with MLflow, adding a workbench environment, training infrastructure, data management, model management, experimentation, and state-of-the-art ML deployment techniques on the cloud and premises. The book also explores techniques to scale up your workflow as well as performance monitoring techniques. As you progress, you'll discover how to create an operational dashboard to manage machine learning systems. Later, you will learn how you can use MLflow in the AutoML, anomaly detection, and deep learning context with the help of use cases. In addition to this, you will understand how to use machine learning platforms for local development as well as for cloud and managed environments. This book will also show you how to use MLflow in non-Python-based languages such as R and Java, along with covering approaches to extend MLflow with Plugins. By the end of this machine learning book, you will be able to produce and deploy reliable machine learning algorithms using MLflow in multiple environments. What you will learnDevelop your machine learning project locally with MLflow's different featuresSet up a centralized MLflow tracking server to manage multiple MLflow experimentsCreate a model life cycle with MLflow by creating custom modelsUse feature streams to log model results with MLflowDevelop the complete training pipeline infrastructure using MLflow featuresSet up an inference-based API pipeline and batch pipeline in MLflowScale large volumes of data by integrating MLflow with high-performance big data librariesWho this book is for This book is for data scientists, machine learning engineers, and data engineers who want to gain hands-on machine learning engineering experience and learn how they can manage an end-to-end machine learning life cycle with the help of MLflow. Intermediate-level knowledge of the Python programming language is expected.
Feature engineering is a crucial step in the machine-learning pipeline, yet this topic is rarely examined on its own. With this practical book, you’ll learn techniques for extracting and transforming features—the numeric representations of raw data—into formats for machine-learning models. Each chapter guides you through a single data problem, such as how to represent text or image data. Together, these examples illustrate the main principles of feature engineering. Rather than simply teach these principles, authors Alice Zheng and Amanda Casari focus on practical application with exercises throughout the book. The closing chapter brings everything together by tackling a real-world, structured dataset with several feature-engineering techniques. Python packages including numpy, Pandas, Scikit-learn, and Matplotlib are used in code examples. You’ll examine: Feature engineering for numeric data: filtering, binning, scaling, log transforms, and power transforms Natural text techniques: bag-of-words, n-grams, and phrase detection Frequency-based filtering and feature scaling for eliminating uninformative features Encoding techniques of categorical variables, including feature hashing and bin-counting Model-based feature engineering with principal component analysis The concept of model stacking, using k-means as a featurization technique Image feature extraction with manual and deep-learning techniques
All engineers and applied scientists will need to harness the power of machine learning to solve the highly complex and data intensive problems now emerging. This text teaches state-of-the-art machine learning technologies to students and practicing engineers from the traditionally “analog” disciplines—mechanical, aerospace, chemical, nuclear, and civil. Dr. McClarren examines these technologies from an engineering perspective and illustrates their specific value to engineers by presenting concrete examples based on physical systems. The book proceeds from basic learning models to deep neural networks, gradually increasing readers’ ability to apply modern machine learning techniques to their current work and to prepare them for future, as yet unknown, problems. Rather than taking a black box approach, the author teaches a broad range of techniques while conveying the kinds of problems best addressed by each. Examples and case studies in controls, dynamics, heat transfer, and other engineering applications are implemented in Python and the libraries scikit-learn and tensorflow, demonstrating how readers can apply the most up-to-date methods to their own problems. The book equally benefits undergraduate engineering students who wish to acquire the skills required by future employers, and practicing engineers who wish to expand and update their problem-solving toolkit.
A large international conference on Advances in Machine Learning and Systems Engineering was held in UC Berkeley, California, USA, October 20-22, 2009, under the auspices of the World Congress on Engineering and Computer Science (WCECS 2009). Machine Learning and Systems Engineering contains forty-six revised and extended research articles written by prominent researchers participating in the conference. Topics covered include Expert system, Intelligent decision making, Knowledge-based systems, Knowledge extraction, Data analysis tools, Computational biology, Optimization algorithms, Experiment designs, Complex system identification, Computational modeling, and industrial applications. Machine Learning and Systems Engineering offers the state of the art of tremendous advances in machine learning and systems engineering and also serves as an excellent reference text for researchers and graduate students, working on machine learning and systems engineering.
Summary Deep Learning and the Game of Go teaches you how to apply the power of deep learning to complex reasoning tasks by building a Go-playing AI. After exposing you to the foundations of machine and deep learning, you'll use Python to build a bot and then teach it the rules of the game. Foreword by Thore Graepel, DeepMind Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications. About the Technology The ancient strategy game of Go is an incredible case study for AI. In 2016, a deep learning-based system shocked the Go world by defeating a world champion. Shortly after that, the upgraded AlphaGo Zero crushed the original bot by using deep reinforcement learning to master the game. Now, you can learn those same deep learning techniques by building your own Go bot! About the Book Deep Learning and the Game of Go introduces deep learning by teaching you to build a Go-winning bot. As you progress, you'll apply increasingly complex training techniques and strategies using the Python deep learning library Keras. You'll enjoy watching your bot master the game of Go, and along the way, you'll discover how to apply your new deep learning skills to a wide range of other scenarios! What's inside Build and teach a self-improving game AI Enhance classical game AI systems with deep learning Implement neural networks for deep learning About the Reader All you need are basic Python skills and high school-level math. No deep learning experience required. About the Author Max Pumperla and Kevin Ferguson are experienced deep learning specialists skilled in distributed systems and data science. Together, Max and Kevin built the open source bot BetaGo. Table of Contents PART 1 - FOUNDATIONS Toward deep learning: a machine-learning introduction Go as a machine-learning problem Implementing your first Go bot PART 2 - MACHINE LEARNING AND GAME AI Playing games with tree search Getting started with neural networks Designing a neural network for Go data Learning from data: a deep-learning bot Deploying bots in the wild Learning by practice: reinforcement learning Reinforcement learning with policy gradients Reinforcement learning with value methods Reinforcement learning with actor-critic methods PART 3 - GREATER THAN THE SUM OF ITS PARTS AlphaGo: Bringing it all together AlphaGo Zero: Integrating tree search with reinforcement learning
The book provides foundations of machine learning and algorithms with a road map to deep learning, genesis of machine learning, installation of Python, supervised machine learning algorithms and implementations in Python or R, unsupervised machine learning algorithms in Python or R including natural language processing techniques and algorithms, Bayesian statistics, origins of deep learning, neural networks, and all the deep learning algorithms with some implementations in TensorFlow and architectures, installation of TensorFlow, neural net implementations in TensorFlow, Amazon ecosystem for machine learning, swarm intelligence, machine learning algorithms, in-memory computing, genetic algorithms, real-world research projects with supercomputers, deep learning frameworks with Intel deep learning platform, Nvidia deep learning frameworks, IBM PowerAI deep learning frameworks, H2O AI deep learning framework, HPC with deep learning frameworks, GPUs and CPUs, memory architectures, history of supercomputing, infrastructure for supercomputing, installation of Hadoop on Linux operating system, design considerations, e-Therapeutics's big data project, infrastructure for in-memory data fabric Hadoop, healthcare and best practices for data strategies, R, architectures, NoSQL databases, HPC with parallel computing, MPI for data science and HPC, and JupyterLab for HPC.