The Art of Feature Engineering

The Art of Feature Engineering

Author: Pablo Duboue

Publisher: Cambridge University Press

Published: 2020-06-25

Total Pages: 287

ISBN-13: 1108709389

DOWNLOAD EBOOK

A practical guide for data scientists who want to improve the performance of any machine learning solution with feature engineering.


Feature Engineering and Selection

Feature Engineering and Selection

Author: Max Kuhn

Publisher: CRC Press

Published: 2019-07-25

Total Pages: 266

ISBN-13: 1351609467

DOWNLOAD EBOOK

The process of developing predictive models includes many stages. Most resources focus on the modeling algorithms but neglect other critical aspects of the modeling process. This book describes techniques for finding the best representations of predictors for modeling and for nding the best subset of predictors for improving model performance. A variety of example data sets are used to illustrate the techniques along with R programs for reproducing the results.


Feature Engineering for Machine Learning

Feature Engineering for Machine Learning

Author: Alice Zheng

Publisher: "O'Reilly Media, Inc."

Published: 2018-03-23

Total Pages: 218

ISBN-13: 1491953195

DOWNLOAD EBOOK

Feature engineering is a crucial step in the machine-learning pipeline, yet this topic is rarely examined on its own. With this practical book, you’ll learn techniques for extracting and transforming features—the numeric representations of raw data—into formats for machine-learning models. Each chapter guides you through a single data problem, such as how to represent text or image data. Together, these examples illustrate the main principles of feature engineering. Rather than simply teach these principles, authors Alice Zheng and Amanda Casari focus on practical application with exercises throughout the book. The closing chapter brings everything together by tackling a real-world, structured dataset with several feature-engineering techniques. Python packages including numpy, Pandas, Scikit-learn, and Matplotlib are used in code examples. You’ll examine: Feature engineering for numeric data: filtering, binning, scaling, log transforms, and power transforms Natural text techniques: bag-of-words, n-grams, and phrase detection Frequency-based filtering and feature scaling for eliminating uninformative features Encoding techniques of categorical variables, including feature hashing and bin-counting Model-based feature engineering with principal component analysis The concept of model stacking, using k-means as a featurization technique Image feature extraction with manual and deep-learning techniques


Python Feature Engineering Cookbook

Python Feature Engineering Cookbook

Author: Soledad Galli

Publisher: Packt Publishing Ltd

Published: 2020-01-22

Total Pages: 364

ISBN-13: 1789807824

DOWNLOAD EBOOK

Extract accurate information from data to train and improve machine learning models using NumPy, SciPy, pandas, and scikit-learn libraries Key FeaturesDiscover solutions for feature generation, feature extraction, and feature selectionUncover the end-to-end feature engineering process across continuous, discrete, and unstructured datasetsImplement modern feature extraction techniques using Python's pandas, scikit-learn, SciPy and NumPy librariesBook Description Feature engineering is invaluable for developing and enriching your machine learning models. In this cookbook, you will work with the best tools to streamline your feature engineering pipelines and techniques and simplify and improve the quality of your code. Using Python libraries such as pandas, scikit-learn, Featuretools, and Feature-engine, you’ll learn how to work with both continuous and discrete datasets and be able to transform features from unstructured datasets. You will develop the skills necessary to select the best features as well as the most suitable extraction techniques. This book will cover Python recipes that will help you automate feature engineering to simplify complex processes. You’ll also get to grips with different feature engineering strategies, such as the box-cox transform, power transform, and log transform across machine learning, reinforcement learning, and natural language processing (NLP) domains. By the end of this book, you’ll have discovered tips and practical solutions to all of your feature engineering problems. What you will learnSimplify your feature engineering pipelines with powerful Python packagesGet to grips with imputing missing valuesEncode categorical variables with a wide set of techniquesExtract insights from text quickly and effortlesslyDevelop features from transactional data and time series dataDerive new features by combining existing variablesUnderstand how to transform, discretize, and scale your variablesCreate informative variables from date and timeWho this book is for This book is for machine learning professionals, AI engineers, data scientists, and NLP and reinforcement learning engineers who want to optimize and enrich their machine learning models with the best features. Knowledge of machine learning and Python coding will assist you with understanding the concepts covered in this book.


Feature Engineering Bookcamp

Feature Engineering Bookcamp

Author: Sinan Ozdemir

Publisher: Simon and Schuster

Published: 2022-10-18

Total Pages: 270

ISBN-13: 1638351406

DOWNLOAD EBOOK

Deliver huge improvements to your machine learning pipelines without spending hours fine-tuning parameters! This book’s practical case-studies reveal feature engineering techniques that upgrade your data wrangling—and your ML results. In Feature Engineering Bookcamp you will learn how to: Identify and implement feature transformations for your data Build powerful machine learning pipelines with unstructured data like text and images Quantify and minimize bias in machine learning pipelines at the data level Use feature stores to build real-time feature engineering pipelines Enhance existing machine learning pipelines by manipulating the input data Use state-of-the-art deep learning models to extract hidden patterns in data Feature Engineering Bookcamp guides you through a collection of projects that give you hands-on practice with core feature engineering techniques. You’ll work with feature engineering practices that speed up the time it takes to process data and deliver real improvements in your model’s performance. This instantly-useful book skips the abstract mathematical theory and minutely-detailed formulas; instead you’ll learn through interesting code-driven case studies, including tweet classification, COVID detection, recidivism prediction, stock price movement detection, and more. About the technology Get better output from machine learning pipelines by improving your training data! Use feature engineering, a machine learning technique for designing relevant input variables based on your existing data, to simplify training and enhance model performance. While fine-tuning hyperparameters or tweaking models may give you a minor performance bump, feature engineering delivers dramatic improvements by transforming your data pipeline. About the book Feature Engineering Bookcamp walks you through six hands-on projects where you’ll learn to upgrade your training data using feature engineering. Each chapter explores a new code-driven case study, taken from real-world industries like finance and healthcare. You’ll practice cleaning and transforming data, mitigating bias, and more. The book is full of performance-enhancing tips for all major ML subdomains—from natural language processing to time-series analysis. What's inside Identify and implement feature transformations Build machine learning pipelines with unstructured data Quantify and minimize bias in ML pipelines Use feature stores to build real-time feature engineering pipelines Enhance existing pipelines by manipulating input data About the reader For experienced machine learning engineers familiar with Python. About the author Sinan Ozdemir is the founder and CTO of Shiba, a former lecturer of Data Science at Johns Hopkins University, and the author of multiple textbooks on data science and machine learning. Table of Contents 1 Introduction to feature engineering 2 The basics of feature engineering 3 Healthcare: Diagnosing COVID-19 4 Bias and fairness: Modeling recidivism 5 Natural language processing: Classifying social media sentiment 6 Computer vision: Object recognition 7 Time series analysis: Day trading with machine learning 8 Feature stores 9 Putting it all together


Feature Extraction, Construction and Selection

Feature Extraction, Construction and Selection

Author: Huan Liu

Publisher: Springer Science & Business Media

Published: 2012-12-06

Total Pages: 418

ISBN-13: 1461557259

DOWNLOAD EBOOK

There is broad interest in feature extraction, construction, and selection among practitioners from statistics, pattern recognition, and data mining to machine learning. Data preprocessing is an essential step in the knowledge discovery process for real-world applications. This book compiles contributions from many leading and active researchers in this growing field and paints a picture of the state-of-art techniques that can boost the capabilities of many existing data mining tools. The objective of this collection is to increase the awareness of the data mining community about the research of feature extraction, construction and selection, which are currently conducted mainly in isolation. This book is part of our endeavor to produce a contemporary overview of modern solutions, to create synergy among these seemingly different branches, and to pave the way for developing meta-systems and novel approaches. Even with today's advanced computer technologies, discovering knowledge from data can still be fiendishly hard due to the characteristics of the computer generated data. Feature extraction, construction and selection are a set of techniques that transform and simplify data so as to make data mining tasks easier. Feature construction and selection can be viewed as two sides of the representation problem.


Introduction to Machine Learning with Python

Introduction to Machine Learning with Python

Author: Andreas C. Müller

Publisher: "O'Reilly Media, Inc."

Published: 2016-09-26

Total Pages: 429

ISBN-13: 1449369898

DOWNLOAD EBOOK

Machine learning has become an integral part of many commercial applications and research projects, but this field is not exclusive to large companies with extensive research teams. If you use Python, even as a beginner, this book will teach you practical ways to build your own machine learning solutions. With all the data available today, machine learning applications are limited only by your imagination. You’ll learn the steps necessary to create a successful machine-learning application with Python and the scikit-learn library. Authors Andreas Müller and Sarah Guido focus on the practical aspects of using machine learning algorithms, rather than the math behind them. Familiarity with the NumPy and matplotlib libraries will help you get even more from this book. With this book, you’ll learn: Fundamental concepts and applications of machine learning Advantages and shortcomings of widely used machine learning algorithms How to represent data processed by machine learning, including which data aspects to focus on Advanced methods for model evaluation and parameter tuning The concept of pipelines for chaining models and encapsulating your workflow Methods for working with text data, including text-specific processing techniques Suggestions for improving your machine learning and data science skills


Feature Engineering Made Easy

Feature Engineering Made Easy

Author: Sinan Ozdemir

Publisher: Packt Publishing Ltd

Published: 2018-01-22

Total Pages: 310

ISBN-13: 1787286479

DOWNLOAD EBOOK

A perfect guide to speed up the predicting power of machine learning algorithms Key Features Design, discover, and create dynamic, efficient features for your machine learning application Understand your data in-depth and derive astonishing data insights with the help of this Guide Grasp powerful feature-engineering techniques and build machine learning systems Book Description Feature engineering is the most important step in creating powerful machine learning systems. This book will take you through the entire feature-engineering journey to make your machine learning much more systematic and effective. You will start with understanding your data—often the success of your ML models depends on how you leverage different feature types, such as continuous, categorical, and more, You will learn when to include a feature, when to omit it, and why, all by understanding error analysis and the acceptability of your models. You will learn to convert a problem statement into useful new features. You will learn to deliver features driven by business needs as well as mathematical insights. You'll also learn how to use machine learning on your machines, automatically learning amazing features for your data. By the end of the book, you will become proficient in Feature Selection, Feature Learning, and Feature Optimization. What you will learn Identify and leverage different feature types Clean features in data to improve predictive power Understand why and how to perform feature selection, and model error analysis Leverage domain knowledge to construct new features Deliver features based on mathematical insights Use machine-learning algorithms to construct features Master feature engineering and optimization Harness feature engineering for real world applications through a structured case study Who this book is for If you are a data science professional or a machine learning engineer looking to strengthen your predictive analytics model, then this book is a perfect guide for you. Some basic understanding of the machine learning concepts and Python scripting would be enough to get started with this book.


Real-World Machine Learning

Real-World Machine Learning

Author: Henrik Brink

Publisher: Simon and Schuster

Published: 2016-09-15

Total Pages: 380

ISBN-13: 1638357005

DOWNLOAD EBOOK

Summary Real-World Machine Learning is a practical guide designed to teach working developers the art of ML project execution. Without overdosing you on academic theory and complex mathematics, it introduces the day-to-day practice of machine learning, preparing you to successfully build and deploy powerful ML systems. Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications. About the Technology Machine learning systems help you find valuable insights and patterns in data, which you'd never recognize with traditional methods. In the real world, ML techniques give you a way to identify trends, forecast behavior, and make fact-based recommendations. It's a hot and growing field, and up-to-speed ML developers are in demand. About the Book Real-World Machine Learning will teach you the concepts and techniques you need to be a successful machine learning practitioner without overdosing you on abstract theory and complex mathematics. By working through immediately relevant examples in Python, you'll build skills in data acquisition and modeling, classification, and regression. You'll also explore the most important tasks like model validation, optimization, scalability, and real-time streaming. When you're done, you'll be ready to successfully build, deploy, and maintain your own powerful ML systems. What's Inside Predicting future behavior Performance evaluation and optimization Analyzing sentiment and making recommendations About the Reader No prior machine learning experience assumed. Readers should know Python. About the Authors Henrik Brink, Joseph Richards and Mark Fetherolf are experienced data scientists engaged in the daily practice of machine learning. Table of Contents PART 1: THE MACHINE-LEARNING WORKFLOW What is machine learning? Real-world data Modeling and prediction Model evaluation and optimization Basic feature engineering PART 2: PRACTICAL APPLICATION Example: NYC taxi data Advanced feature engineering Advanced NLP example: movie review sentiment Scaling machine-learning workflows Example: digital display advertising


Rosie Revere, Engineer

Rosie Revere, Engineer

Author: Andrea Beaty

Publisher: Abrams

Published: 2013-09-03

Total Pages: 38

ISBN-13: 1613125305

DOWNLOAD EBOOK

In this beloved New York Times bestselling picture book, meet Rosie Revere, a seemingly quiet girl by day but a brilliant inventor of gizmos and gadgets by night. Rosie dreams of becoming a great engineer, and her room becomes a secret workshop where she constructs ingenious inventions from odds and ends. From hot dog dispensers to helium pants and python-repelling cheese hats, Rosie's creations would astound anyone—if only she'd let them see. But Rosie is afraid of failure, so she hides her inventions under her bed. That is, until her great-great-aunt Rose (also known as Rosie the Riveter) pays her a visit. Aunt Rose teaches Rosie that the first flop isn't something to fear; it's something to celebrate. Failure only truly happens if you quit. And so, Rosie learns to embrace her passion, celebrate her missteps, and pursue her dreams with persistence. This empowering picture book encourages young readers to explore their creativity, persevere through challenges, and celebrate the journey toward achieving their goals. Whether you're a budding engineer or simply love stories of resilience, Rosie Revere, Engineer is a delightful read for all ages. Add this inspiring tale to your family library and discover the magic of celebrating each failure on the road to success. Don’t miss the book that the Duchess of York recently chose to read aloud at a Literally Healing visit to a children’s hospital. For more STEM-themed adventures, check out other titles by Andrea Beaty and David Roberts, including Ada Twist, Scientist, Iggy Peck, Architect, and Rosie Revere and the Raucous Riveters. “Will no doubt inspire conversations with children about the benefits of failure and the pursuit of dreams.” —School Library Journal Check out all the books in the Questioneers Series: The Questioneers Picture Book Series: Iggy Peck, Architect | Rosie Revere, Engineer | Ada Twist, Scientist | Sofia Valdez, Future Prez | Aaron Slater, Illustrator | Lila Greer, Teacher of the Year The Questioneers Chapter Book Series: Rosie Revere and the Raucous Riveters | Ada Twist and the Perilous Pants | Iggy Peck and the Mysterious Mansion | Sofia Valdez and the Vanishing Vote | Ada Twist and the Disappearing Dogs | Aaron Slater and the Sneaky Snake Questioneers: The Why Files Series: Exploring Flight! | All About Plants! | The Science of Baking | Bug Bonanza! | Rockin’ Robots! Questioneers: Ada Twist, Scientist Series: Ghost Busted | Show Me the Bunny | Ada Twist, Scientist: Brainstorm Book | 5-Minute Ada Twist, Scientist Stories The Questioneers Big Project Book Series: Iggy Peck’s Big Project Book for Amazing Architects | Rosie Revere’s Big Project Book for Bold Engineers | Ada Twist’s Big Project Book for Stellar Scientists | Sofia Valdez’s Big Project Book for Awesome Activists | Aaron Slater’s Big Project Book for Astonishing Artists