This book is about making machine learning models and their decisions interpretable. After exploring the concepts of interpretability, you will learn about simple, interpretable models such as decision trees, decision rules and linear regression. Later chapters focus on general model-agnostic methods for interpreting black box models like feature importance and accumulated local effects and explaining individual predictions with Shapley values and LIME. All interpretation methods are explained in depth and discussed critically. How do they work under the hood? What are their strengths and weaknesses? How can their outputs be interpreted? This book will enable you to select and correctly apply the interpretation method that is most suitable for your machine learning project.
An insightful, hands-on focus on the statistical methods used by compensation and human resources professionals in their everyday work Across various industries, compensation professionals work to organize and analyze aspects of employment that deal with elements of pay, such as deciding base salary, bonus, and commission provided by an employer to its employees for work performed. Acknowledging the numerous quantitative analyses of data that are a part of this everyday work, Statistics for Compensation provides a comprehensive guide to the key statistical tools and techniques needed to perform those analyses and to help organizations make fully informed compensation decisions. This self-contained book is the first of its kind to explore the use of various quantitative methods—from basic notions about percents to multiple linear regression—that are used in the management, design, and implementation of powerful compensation strategies. Drawing upon his extensive experience as a consultant, practitioner, and teacher of both statistics and compensation, the author focuses on the usefulness of the techniques and their immediate application to everyday compensation work, thoroughly explaining major areas such as: Frequency distributions and histograms Measures of location and variability Model building Linear models Exponential curve models Maturity curve models Power models Market models and salary survey analysis Linear and exponential integrated market models Job pricing market models Throughout the book, rigorous definitions and step-by-step procedures clearly explain and demonstrate how to apply the presented statistical techniques. Each chapter concludes with a set of exercises, and various case studies showcase the topic's real-world relevance. The book also features an extensive glossary of key statistical terms and an appendix with technical details. Data for the examples and practice problems are available in the book and on a related FTP site. Statistics for Compensation is an excellent reference for compensation professionals, human resources professionals, and other practitioners responsible for any aspect of base pay, incentive pay, sales compensation, and executive compensation in their organizations. It can also serve as a supplement for compensation courses at the upper-undergraduate and graduate levels.
Maximize performance with better data Developing a successful workforce requires more than a gut check. Data can help guide your decisions on everything from where to seat a team to optimizing production processes to engaging with your employees in ways that ring true to them. People analytics is the study of your number one business asset—your people—and this book shows you how to collect data, analyze that data, and then apply your findings to create a happier and more engaged workforce. Start a people analytics project Work with qualitative data Collect data via communications Find the right tools and approach for analyzing data If your organization is ready to better understand why high performers leave, why one department has more personnel issues than another, and why employees violate, People Analytics For Dummies makes it easier.
Companion Website materials: https://tzkeith.com/ Multiple Regression and Beyond offers a conceptually-oriented introduction to multiple regression (MR) analysis and structural equation modeling (SEM), along with analyses that flow naturally from those methods. By focusing on the concepts and purposes of MR and related methods, rather than the derivation and calculation of formulae, this book introduces material to students more clearly, and in a less threatening way. In addition to illuminating content necessary for coursework, the accessibility of this approach means students are more likely to be able to conduct research using MR or SEM--and more likely to use the methods wisely. This book: • Covers both MR and SEM, while explaining their relevance to one another • Includes path analysis, confirmatory factor analysis, and latent growth modeling • Makes extensive use of real-world research examples in the chapters and in the end-of-chapter exercises • Extensive use of figures and tables providing examples and illustrating key concepts and techniques New to this edition: • New chapter on mediation, moderation, and common cause • New chapter on the analysis of interactions with latent variables and multilevel SEM • Expanded coverage of advanced SEM techniques in chapters 18 through 22 • International case studies and examples • Updated instructor and student online resources
Handbook of Statistical Analysis and Data Mining Applications, Second Edition, is a comprehensive professional reference book that guides business analysts, scientists, engineers and researchers, both academic and industrial, through all stages of data analysis, model building and implementation. The handbook helps users discern technical and business problems, understand the strengths and weaknesses of modern data mining algorithms and employ the right statistical methods for practical application. This book is an ideal reference for users who want to address massive and complex datasets with novel statistical approaches and be able to objectively evaluate analyses and solutions. It has clear, intuitive explanations of the principles and tools for solving problems using modern analytic techniques and discusses their application to real problems in ways accessible and beneficial to practitioners across several areas—from science and engineering, to medicine, academia and commerce. - Includes input by practitioners for practitioners - Includes tutorials in numerous fields of study that provide step-by-step instruction on how to use supplied tools to build models - Contains practical advice from successful real-world implementations - Brings together, in a single resource, all the information a beginner needs to understand the tools and issues in data mining to build successful data mining solutions - Features clear, intuitive explanations of novel analytical tools and techniques, and their practical applications
Learn how to use R to turn raw data into insight, knowledge, and understanding. This book introduces you to R, RStudio, and the tidyverse, a collection of R packages designed to work together to make data science fast, fluent, and fun. Suitable for readers with no previous programming experience, R for Data Science is designed to get you doing data science as quickly as possible. Authors Hadley Wickham and Garrett Grolemund guide you through the steps of importing, wrangling, exploring, and modeling your data and communicating the results. You'll get a complete, big-picture understanding of the data science cycle, along with basic tools you need to manage the details. Each section of the book is paired with exercises to help you practice what you've learned along the way. You'll learn how to: Wrangle—transform your datasets into a form convenient for analysis Program—learn powerful R tools for solving data problems with greater clarity and ease Explore—examine your data, generate hypotheses, and quickly test them Model—provide a low-dimensional summary that captures true "signals" in your dataset Communicate—learn R Markdown for integrating prose, code, and results
Effectively and ethically leveraging people data to deliver real business value is what sets the best HR leaders and teams apart. Excellence in People Analytics provides business and human resources leaders with everything they need to know about creating value from people analytics. Written by two leading experts in the field, this practical guide outlines how to create sustainable business value with people analytics and develop a data-driven culture in HR. Most importantly, it allows HR professionals and business executives to translate their data into tangible actions to improve business performance. while navigating the rapidly evolving world of work. Full of practical tools and advice assembled around the Insight222 Nine Dimensions in People Analytics® model, this book demonstrates how to use people data to increase profits, improve staff retention and workplace productivity as well as develop individual employee experience. Featuring case studies from leading companies including Microsoft, HSBC, Syngenta, Capital One, Novartis, Bosch, Uber, Santander Brasil and American Eagle Outfitters®, Excellence in People Analytics is essential reading for all HR professionals needing to unlock the potential in their people data and gain competitive advantage
Applied Predictive Modeling covers the overall predictive modeling process, beginning with the crucial steps of data preprocessing, data splitting and foundations of model tuning. The text then provides intuitive explanations of numerous common and modern regression and classification techniques, always with an emphasis on illustrating and solving real data problems. The text illustrates all parts of the modeling process through many hands-on, real-life examples, and every chapter contains extensive R code for each step of the process. This multi-purpose text can be used as an introduction to predictive models and the overall modeling process, a practitioner’s reference handbook, or as a text for advanced undergraduate or graduate level predictive modeling courses. To that end, each chapter contains problem sets to help solidify the covered concepts and uses data available in the book’s R package. This text is intended for a broad audience as both an introduction to predictive models as well as a guide to applying them. Non-mathematical readers will appreciate the intuitive explanations of the techniques while an emphasis on problem-solving with real data across a wide variety of applications will aid practitioners who wish to extend their expertise. Readers should have knowledge of basic statistical ideas, such as correlation and linear regression analysis. While the text is biased against complex equations, a mathematical background is needed for advanced topics.
A comprehensive overview of data science covering the analytics, programming, and business skills necessary to master the discipline Finding a good data scientist has been likened to hunting for a unicorn: the required combination of technical skills is simply very hard to find in one person. In addition, good data science is not just rote application of trainable skill sets; it requires the ability to think flexibly about all these areas and understand the connections between them. This book provides a crash course in data science, combining all the necessary skills into a unified discipline. Unlike many analytics books, computer science and software engineering are given extensive coverage since they play such a central role in the daily work of a data scientist. The author also describes classic machine learning algorithms, from their mathematical foundations to real-world applications. Visualization tools are reviewed, and their central importance in data science is highlighted. Classical statistics is addressed to help readers think critically about the interpretation of data and its common pitfalls. The clear communication of technical results, which is perhaps the most undertrained of data science skills, is given its own chapter, and all topics are explained in the context of solving real-world data problems. The book also features: • Extensive sample code and tutorials using Python™ along with its technical libraries • Core technologies of “Big Data,” including their strengths and limitations and how they can be used to solve real-world problems • Coverage of the practical realities of the tools, keeping theory to a minimum; however, when theory is presented, it is done in an intuitive way to encourage critical thinking and creativity • A wide variety of case studies from industry • Practical advice on the realities of being a data scientist today, including the overall workflow, where time is spent, the types of datasets worked on, and the skill sets needed The Data Science Handbook is an ideal resource for data analysis methodology and big data software tools. The book is appropriate for people who want to practice data science, but lack the required skill sets. This includes software professionals who need to better understand analytics and statisticians who need to understand software. Modern data science is a unified discipline, and it is presented as such. This book is also an appropriate reference for researchers and entry-level graduate students who need to learn real-world analytics and expand their skill set. FIELD CADY is the data scientist at the Allen Institute for Artificial Intelligence, where he develops tools that use machine learning to mine scientific literature. He has also worked at Google and several Big Data startups. He has a BS in physics and math from Stanford University, and an MS in computer science from Carnegie Mellon.
People analytics (also known as HR analytics) is revolutionizing Human Resource Management. Get ready for the future of HR and discover how you can leverage the power of data to drive better outcomes for your business and employees. We set out to write an inspiring book for (HR) professionals, managers, and directors who want to get a feel for the scope of HR analytics and learn how it can help both the employees and the business. In this book, we combined our experiences with lots of inspiring examples. It's concise, easy to read and teaches you all the basic principles of people analytics. After reading this book, you will: - have a solid understanding of what HR analytics is - know the difference between HR analytics and HR reporting - have a clear picture of the scope and the added value of HR analytics - understand the capabilities needed to build an HR analytics team - have plenty of ideas for applying HR analytics to your organization - know which pitfalls to avoid to prevent failure Who should read this book? If you're new to HR analytics and want to learn all the basics without having to plow through pages full of jargon, this book is for you. It's concise and easy to read, especially for people without a background in statistics or IT. Also if you're not working in human resource management yet and want to explore this exciting new field, this book provides you the foundation you are looking If you already have a career in HR analytics and are looking for in-depth knowledge and information, this book is NOT for you. It's definitely interesting and inspiring for those who have already started but don't expect in-depth (statistical) information.