This is the essential how-to manual for anyone interested in baseball research. How to Do Baseball Research updates and greatly expands The Baseball Research Handbook, published by the Society for American Baseball Research (SABR) in 1987. A group of talented SABR members provide information and advice in a variety of areas, including how to use libraries and archives, find illustrations, and prepare manuscripts for publication. Particularly noteworthy is the new information on using the computer for baseball research and statistical analysis. Contributions from SABR committee chairs and longtime SABR researchers add valuable specifics to the fundamental advice in the ten chapters.
Baseball Hacks isn't your typical baseball book--it's a book about how to watch, research, and understand baseball. It's an instruction manual for the free baseball databases. It's a cookbook for baseball research. Every part of this book is designed to teach baseball fans how to do something. In short, it's a how-to book--one that will increase your enjoyment and knowledge of the game. So much of the way baseball is played today hinges upon interpreting statistical data. Players are acquired based on their performance in statistical categories that ownership deems most important. Managers make in-game decisions based not on instincts, but on probability - how a particular batter might fare against left-handedpitching, for instance. The goal of this unique book is to show fans all the baseball-related stuff that they can do for free (or close to free). Just as open source projects have made great software freely available, collaborative projects such as Retrosheet and Baseball DataBank have made great data freely available. You can use these data sources to research your favorite players, win your fantasy league, or appreciate the game of baseball even more than you do now. Baseball Hacks shows how easy it is to get data, process it, and use it to truly understand baseball. The book lists a number of sources for current and historical baseball data, and explains how to load it into a database for analysis. It then introduces several powerful statistical tools for understanding data and forecasting results. For the uninitiated baseball fan, author Joseph Adler walks readers through the core statistical categories for hitters (batting average, on-base percentage, etc.), pitchers (earned run average, strikeout-to-walk ratio, etc.), and fielders (putouts, errors, etc.). He then extrapolates upon these numbers to examine more advanced data groups like career averages, team stats, season-by-season comparisons, and more. Whether you're a mathematician, scientist, or season-ticket holder to your favorite team, Baseball Hacks is sure to have something for you. Advance praise for Baseball Hacks: "Baseball Hacks is the best book ever written for understanding and practicing baseball analytics. A must-read for baseball professionals and enthusiasts alike." -- Ari Kaplan, database consultant to the Montreal Expos, San Diego Padres, and Baltimore Orioles "The game was born in the 19th century, but the passion for its analysis continues to grow into the 21st. In Baseball Hacks, Joe Adler not only demonstrates thatthe latest data-mining technologies have useful application to the study of baseball statistics, he also teaches the reader how to do the analysis himself, arming the dedicated baseball fan with tools to take his understanding of the game to a higher level." -- Mark E. Johnson, Ph.D., Founder, SportMetrika, Inc. and Baseball Analyst for the 2004 St. Louis Cardinals
Sandlot Stats uses the national pastime to help students who love baseball learn—and enjoy—statistics. As Derek Jeter strolls toward the plate, the announcer tosses out a smattering of statistics—from hitting streaks to batting averages. But what do the numbers mean? And how can America’s favorite pastime be a model for learning about statistics? Sandlot Stats is an innovative textbook that explains the mathematical underpinnings of baseball so that students can understand the world of statistics and probability. Carefully illustrated and filled with exercises and examples, this book teaches the fundamentals of probability and statistics through the feats of baseball legends such as Hank Aaron, Joe DiMaggio, and Ted Williams—and more recent players such as Barry Bonds, Albert Pujols, and Alex Rodriguez. Exercises require only pen-and-paper or Microsoft Excel to perform the analyses. Sandlot Stats covers all the bases, including • descriptive and inferential statistics • linear regression and correlation • probability • sports betting • probability distribution functions • sampling distributions • hypothesis testing • confidence intervals • chi-square distribution Sandlot Stats offers information covered in most introductory statistics books, yet is peppered with interesting facts from the history of baseball to enhance the interest of the student and make learning fun.
The past 30 years have seen an explosion in the number and variety of baseball books and articles. Following the lead of pioneers Bill James, John Thorn, and Pete Palmer, researchers have steadily challenged the ways we think about player and team performance--and along the way revised what we thought we knew of baseball history. This book by the authors of Understanding Sabermetrics (2008) goes beyond the explanation of new statistics to demonstrate their use in solving some of the more familiar problems of baseball research, such as how to compare players across generations; how to account for the effects of ballparks and rules changes; and how to measure the effectiveness of the sacrifice bunt or the range of the Gold Glove-winning shortstop. Instructors considering this book for use in a course may request an examination copy here.
From the authority on baseball research and statistics comes a vast and fascinating compendium of unique baseball lists and records. The SABR Baseball List & Record Book is an expansive collection of pitching, hitting, fielding, home run, team, and rookie records not available online or in any other book. This is a treasure trove of baseball history for statistically minded baseball fans that's also packed with intriguing marginalia. For instance, on July 25, 1967, Chicago's Ken Berry ended Game Two of a doubleheader against Cleveland with a home run in the bottom of the sixteenth inning -- Chicago's second game-winning homer of the day. The comprehensive lists include Most Career Home Runs by Two Brothers (Tommie and Hank Aaron have 768), Most Seasons with 15 or More Wins (Cy Young and Greg Maddux each have 18), and Highest On Base Percentage in a Season by a Rookie (listing every rookie above .400). Unlike other record books that only list the record holders -- say, most RBI by a rookie, held by Ted Williams with 145 -- SABR details every rookie to reach 100 RBI. Other record books might note the last pitcher in each league to steal home; here SABR has included every pitcher to do it. The book also includes a number of idiosyncratic features, such as a rundown of every player who has hit a triple and then stolen home, or every reliever who has won two games in one day. Many of the lists include a comments column for key historical notes and entertaining trivia (Bob Horner hit four home runs in a 1986 game, but his team lost). This is a must-have for every fan's library. Edited by Lyle Spatz, Chairman of the Baseball Records Committee for SABR
SABR 50 at 50 celebrates and highlights the Society for American Baseball Research’s wide-ranging contributions to baseball history. Established in 1971 in Cooperstown, New York, SABR has sought to foster and disseminate the research of baseball—with groundbreaking work from statisticians, historians, and independent researchers—and has published dozens of articles with far-reaching and long-lasting impact on the game. Among its current membership are many Major and Minor League Baseball officials, broadcasters, and writers as well as numerous former players. The diversity of SABR members’ interests is reflected in this fiftieth-anniversary volume—from baseball and the arts to statistical analysis to the Deadball Era to women in baseball. SABR 50 at 50 includes the most important and influential research published by members across a multitude of topics, including the sabermetric work of Dick Cramer, Pete Palmer, and Bill James, along with Jerry Malloy on the Negro Leagues, Keith Olbermann on why the shortstop position is number 6, John Thorn and Jules Tygiel on the untold story behind Jackie Robinson’s signing with the Dodgers, and Gai Berlage on the Colorado Silver Bullets women’s team in the 1990s. To provide history and context, each notable research article is accompanied by a short introduction. As SABR celebrates fifty years this collection gathers the organization’s most notable research and baseball history for the serious baseball reader.
Rescued in 2010 from the small creek that runs next to Doubleday Field in Cooperstown, New York, a simple baseball launched an epic quest that spanned the United States and beyond. For eight years, "The Hall Ball" went on a journey to have its picture taken with every member of the Baseball Hall of Fame, both living and deceased. The goal? To enshrine the first crowd-sourced artifact ever donated to the Hall. Part travelogue, part baseball history, part photo journal, this book tells the full story for the first time. The narratives that accompany the ball's odyssey are as funny and moving as any in the history of the game.
With its flexible capabilities and open-source platform, R has become a major tool for analyzing detailed, high-quality baseball data. Analyzing Baseball Data with R provides an introduction to R for sabermetricians, baseball enthusiasts, and students interested in exploring the rich sources of baseball data. It equips readers with the necessary skills and software tools to perform all of the analysis steps, from gathering the datasets and entering them in a convenient format to visualizing the data via graphs to performing a statistical analysis. The authors first present an overview of publicly available baseball datasets and a gentle introduction to the type of data structures and exploratory and data management capabilities of R. They also cover the traditional graphics functions in the base package and introduce more sophisticated graphical displays available through the lattice and ggplot2 packages. Much of the book illustrates the use of R through popular sabermetrics topics, including the Pythagorean formula, runs expectancy, career trajectories, simulation of games and seasons, patterns of streaky behavior of players, and fielding measures. Each chapter contains exercises that encourage readers to perform their own analyses using R. All of the datasets and R code used in the text are available online. This book helps readers answer questions about baseball teams, players, and strategy using large, publically available datasets. It offers detailed instructions on downloading the datasets and putting them into formats that simplify data exploration and analysis. Through the book’s various examples, readers will learn about modern sabermetrics and be able to conduct their own baseball analyses.