Baseball's All-Time Best Sluggers: Adjusted Batting Performance from Strikeouts to Home Runs

by Michael J. Schell

These are only three of the intriguing questions Michael Schell addresses in Baseball's All-Time Best Sluggers, a lively examination of the game of baseball using the most sophisticated statistical tools available. The book provides an in-depth evaluation of every major offensive event in baseball history, and identifies the players with the 100 best seasons and most productive careers. For the first time ever, ballpark effects across baseball history are presented for doubles, triples, right- and left-handed home-run hitting, and strikeouts. The book culminates with a ranking of the game's best all-around batters.

Using a brisk conversational style, Schell brings to the plate the two most important credentials essential to producing a book of this kind: an encyclopedic knowledge of baseball and a professional background in statistics. Building on the traditions of renowned baseball historians Pete Palmer and Bill James, he has analyzed the most important factors impacting the sport, including the relative difficulty of hitting in different ballparks, the length of hitters' careers, the talent pool from which players are drawn, player aging, and changes in the game that have raised or lowered major-league batting averages.

Schell's book finally levels the playing field, giving new credit to hitters who played in adverse conditions, and downgrading others who faced fewer obstacles. It also provides rankings based on players' positions. For example, Derek Jeter ranks 295th out of 1,140 on the best batters list, but jumps to 103rd in the position-adjusted list, reflecting his offensive prowess among shortstops.

Replete with dozens of never-before reported stories and statistics, Baseball's All-Time Best Sluggers will forever shape the way baseball fans view the greatest heroes of America's national pastime.

Answers: 1. Coors Field 2. Mel Ott 3. Barry Bonds, 2001-2004 seasons

Read More

Editorial Reviews

Michael J. Schell has produced what may be the most rigorous effort yet to compare baseball players from various eras. And in the process, he has offered a tantalizing suggestion that steroids may not have affected the game as much as many people assume.
Michael J. Schell has produced what may be the most rigorous effort yet to compare baseball players from various eras. And in the process, he has offered a tantalizing suggestion that steroids may not have affected the game as much as many people assume.
Library Journal
Schell (Baseball's All-Time Best Hitters) now calls on statistical analyses to determine the game's top sluggers. His latest study focuses on 1140 major leaguers who appeared at the plate 4000 times. Schell adjusts the numbers compiled by ballplayers, taking into account league averages, the impact of various stadiums, length of career, positions played, and transformations in the game itself. Hardly surprising, the top ten "adjusted home runs per 500 Abs" single-seasons include four each by Barry Bonds and Babe Ruth and one by Mark McGwire and 19th-century slugger Buck Freeman. Baseball fans, ever fascinated with statistics, should enjoy rifling through this information-packed work. For general libraries. Copyright 2005 Reed Business Information.
From the Publisher

"Baseball fans, ever fascinated with statistics, should enjoy rifling through this information-packed work."--Library Journal

"Michael J. Schell has produced what may be the most rigorous effort yet to compare baseball players from various eras. And in the process, he has offered a tantalizing suggestion that steroids may not have affected the game as much as many people assume."--Christopher Shea, The Boston Globe

Chapter One


Why Adjustments Are Needed

King Arthur's quest for it in the Middle Ages became a large part of his legend. Monty Python and Indiana Jones launched their searches in popular 1974 and 1989 movies. The mythic quest for the Holy Grail, the name given in Western tradition to the chalice used by Jesus Christ at his Passover meal the night before his death, is now often a metaphor for a quintessential search.

In the illustrious history of baseball, the "holy grail" is a ranking of each player's overall value on the baseball diamond. Because player skills are multifaceted, it is not clear that such a ranking is possible. In comparing two players, you see that one hits home runs much better, whereas the other gets on base more often, is faster on the base paths, and is a better fielder. So which player should rank higher?

In Baseball's All-Time Best Hitters, I identified which players were best at getting a hit in a given at-bat, calling them the best hitters. Many reviewers either disapproved of or failed to note my definition of "best hitter." Although frequently used in baseball writings, the terms "good hitter" or besthitter are rarely defined.

In a July 1997 Sports Illustrated article, Tom Verducci called Tony Gwynn "the best hitter since Ted Williams" while considering only batting average. With the likes of Willie Mays, Hank Aaron, and Mickey Mantle as candidates to rival Gwynn, it is clear that Verducci used best hitter in the same, limited way.

Best Batters and the Offensive Events Used to Determine Them

A broader category is best batters. These are the players with the best all-around ability to produce runs based on events emerging from their plate appearances. Those in the "holy grail" category would be the best players, the players who are best in all-around play.

The goal of Baseball's All-Time Best Sluggers is to identify the best batters in baseball history. Consequently, neither pitching nor fielding is considered in this book.

When a player steps up to the plate, one of more than a dozen events can happen. Here are the major ones. The player can get a single, double, triple, or home run, based on the number of bases earned on a hit. Home runs can be either inside or outside the park. Players also make outs. Outs, like hits, can be divided into several categories, including strikeouts, groundouts, and flyouts. Another major event is a base on balls, sometimes subcategorized into intentional or nonintentional. Additional events include sacrifice flies, sacrifice hits, hit-by-pitch, and reached-base-on-an-error.

The currency of baseball is runs. Since scoring more runs than one's opponent wins ballgames, a batter's primary role is to produce runs. A value could be placed on each offensive event, based on how it contributes to or hampers run scoring. A batter's overall value would then be calculated by multiplying the values of these events by the rate at which he gets them.

This concept, known as linear weights, dates back to the early 1960s and the pioneering work of George Lindsey. In the 1980s, John Thorn and Pete Palmer expanded on this idea with their "Batting Runs" formula, which plays a central role in their "Total Player Rating" in Total Baseball. Batting Runs has adjustments for both era-of-play and ballpark effects.

The study in this book extends the work of Thorn and Palmer. The basic formula, called Event-Specific Batting Runs, is developed. For the adjustment process, we will look at 10 basic offensive events: batting average, doubles-plus-triples (the sum of doubles and triples), triples, home runs, runs, RBIs, walks, strikeouts, stolen bases, and hit-by-pitches. Four adjustments are applied to these events: the effects of aging, ballpark effects, and two factors based on the era of play, as described later in this chapter.

Seven offensive events are used to rank the batters-the four kinds of hits, walks, and total outs. Four of these-triples, home runs, walks, and hit-by-pitches-are basic offensive events. The other three are derived from basic offensive events.

Singles are obtained from subtracting Extra-Base Hits (doubles-plus-triples plus home runs) from hits, where hits equals the batting average multiplied by at-bats. Doubles are obtained by subtracting triples from doubles-plus-triples. Total outs are obtained by subtracting hits from at-bats.

The reasons why doubles-plus-triples (DPTs), rather than doubles, are considered a basic offensive event are discussed in Chapters 4 and 8. In brief, a double is a hit intermediate between a single and a triple, but more like a triple in nature. The answer to the question of which batted balls become doubles versus triples is blurry, and depends significantly on the speed of the batter. Thus it is better to apply the four adjustments to doubles-plus-triples and to triples and obtain adjusted doubles by subtracting the latter from the former.

Why not use the other four basic offensive events in the Event-Specific Batting Runs formula? Individual run and RBI totals depend on the ability of one's teammates and the batter's position in the lineup, not just on the relative abilities of pitcher and batter. Batters fortunate to have teammates on base more often can garner RBIs more easily; a single hit with a man on third nets an RBI, but a bases-empty single doesn't. Hence runs and RBIs are not used in the best- batter ranking system. They are used, however, to build a case for the existence of clutch hitters, with Yogi Berra, Pie Traynor, and Joe DiMaggio all ranking among the top 10.

Strikeouts, like other outs, hurt run scoring. They haven't been shown, however, to have a significantly different negative value from non-strikeout outs. Consequently they are not needed in the valuation.

Stolen bases are not used for two reasons. First, although they are an offensive event, they are not a batting event. Second, to evaluate stolen bases fairly, one also needs caught stealing data. Caught stealing data, however, have been consistently available only since 1951. Thus it becomes difficult to compare players from before and after that time.

Rankings are also provided for five additional derived offensive events: OBP, slugging average, OPS, Event-Specific Batting Runs (ESBRs), and Career Batter Rating (CBR). The first three events are popular ways to evaluate players, while the final two are the main formulas developed in this book-to identify the best batters and get a preliminary ranking of baseball's best players. Finally, position-adjusted results are also given for ESBRs and CBR.

Philosophy of Player Comparison Over Time

It is a tall order to develop a method that can compare athletic performance fairly over time. For most running and swimming events, race times have progressively declined over time. Does this mean that recent world record holders are better than their forebears? Not necessarily. To ensure fairness, the philosophy of this study involves a complete time-transport of players. In other words, the athlete from bygone times needs to have all the modern advantages today's stars enjoy-such as current equipment, today's sports medicine, training advances, new techniques, and nutritional advances. How can this be done?

The basic strategy is first to rank players within their own eras and then to integrate these era-restricted rankings into an overall ranking. We could develop a statistical method to test whether the collective abilities of players in the league are stable, improve, or decline over time. Such a method would be quite complex, however, and is therefore not attempted. Instead the rankings in this book are based on the assumption that the players from each era are equally talented, as described in Chapter 2.

The Evolving Game of Baseball

To the casual fan, baseball is a game that seemingly has changed little over time. Baseball announcers and reporters encourage this view, often unwittingly, when the accomplishments of current players are compared to those achieved in bygone eras without any caveats. Record-breaking performances-heralded events in the lore of baseball-invite such misperceptions. For example, Mark McGwire's 70 home runs and Sammy Sosa's 66 home runs in 1998 vaulted them to the top of the single-season home run list-past Roger Maris's 61 in 1961 and Babe Ruth's 60 in 1927. Then in 2001 Barry Bonds established a new mark, with 73 home runs. Constancy is only a thin veneer, however, and among the most enduring features of more than a century of major league baseball play is that the game is constantly changing. In Chapter 3, we look at how offensive events have varied across baseball history.

Adjusting Player Averages

In this book, four principal adjustments are applied to a batter's raw statistical data in order to rank his overall batting ability. These adjustments are for hitting feasts and famines, ballpark differences, the talent pool, and late career declines, which are conceptually the same as those used in Baseball's All-Time Best Hitters. Since the rationale for each adjustment is explained in that book, only a brief description suffices here.

Hitting Feasts and Famines: Across time, different aspects of the game are in the ascendancy. For example, batting averages were quite high in the 1920s and 1930s. In today's game, power events such as doubles and home runs are at or near all-time highs. These shifts are adjusted out under the assumption that they have occurred as a result of changes other than shifts in player ability, such as rules, training, equipment, or ballparks. This adjustment makes the performance of the average regular player (defined in Chapter 2) equivalent across time.

Ballpark Effects: It is much easier to collect hits, hit home runs, and score runs in some ballparks than in others. Since players play half their games at home, their true value is more fairly appraised by adjusting out the effects of their home parks. In this book, the adjustments are made using ballpark effects that will be obtained for every basic offensive event except stolen bases and HBPs.

Talent Pool: A key consideration in comparing players from different eras is whether play has remained the same or has improved or declined over time. In Full House, Stephen J. Gould considered the standard deviation of the offensive event being studied to be a measure of the talent pool from which players in that season were selected: a large standard deviation means greater variability in performance among the players, implying that they were drawn from a smaller talent pool.

Although I believe that the standard deviation for a given event only imperfectly measures the talent pool for it, it is still an appropriate adjustment under a "percentile equivalence" assumption explained in Chapter 2.

Late Career Declines: Age and experience certainly play roles in the seasonal performances of ballplayers. Although experience is beneficial, the abilities of most players decline at the end of their careers. I believe that a better comparison can be made between players by discounting late career plate appearances of players with long careers. An aging profile for each offensive event is shown, and a cutoff based on number of at-bats or at-bats plus walks plus hit-by-pitches is used to define a player's "productive career" average.

The 100 Best Batters

Let's take a sneak preview of the best batters through the 2003 season (Table 1.1). It will surprise few baseball fans that Babe Ruth and Ted Williams claim the top two spots. Next comes Rogers Hornsby, who was the standout player in the National League in the 1920s, counterbalancing Ruth's American League dominance. In fourth place is the greatest star from our time, Barry Bonds, who fashioned three seasons-2001, 2002, and 2003-to cap an already fabulous career. Lou Gehrig, Ruth's longtime teammate, claims fifth place. Mickey Mantle places sixth, giving the Yankees three of the top six batters. Stan Musial, Ted Williams' National League counterpart, ranks seventh. Ty Cobb, who received more votes than Babe Ruth in the inaugural Hall of Fame class, ranks as the eighth best batter of all-time. Jimmie Foxx and Willie Mays round out the top 10.

As the four adjustments are further developed and applied throughout this book, we will see how this ranking was obtained.

Organization of the Book

In this book, a number of statistical and baseball terms are used. They are italicized at their point of first definition in the text. In other chapters, they are italicized when first used. All such terms are defined in the glossary, in addition to the chapter or appendix in which they are presented in detail.

The book is divided into two major sections: "The Methods" and "The Findings."

The Methods

A summary of the method for adjusting player averages is given in Chapter 2, with the detailed statistical aspects provided in Appendices A-G. The four adjustments are discussed in detail in Chapters 3-6. Because the adjustments are based on statistical principles, these chapters necessarily contain some references and descriptions of statistical methods, although the more technical details are provided in the appendices.

Chapters 3-6 provide additional insights into how the method detailed in Chapter 2 works. However, the essential ingredients for actually making the adjustments are given in Chapter 2 using data given in Appendices H-J. Readers anxious to see the findings may choose to skip those chapters.

The Findings

The basic offensive events are discussed in Chapters 7-12. Throughout the book, the basic offensive events are presented in the following order: batting average, doubles-plus-triples, triples, home runs, runs, RBIs, walks, strikeouts, and stolen bases. In each chapter, some additional results that go beyond the adjusted averages approach defined in "The Methods" are given. At the end of each chapter, the top 10 single-season and productive career performances based on the adjustment methods are highlighted. Lists of the top 100 single-season and productive career averages are given in Appendices L and M.

Chapters 13-17 close in on the major goal of this book-determining the 100 best batters. Chapter 13 begins with two traditional, derived offensive event measures-OBP and slugging average-and their sum, OPS. Chapter 14 introduces two new methods, Event-Specific Batting Runs and Career Batter Rating, which yield the 100 best batters. Chapter 15 lists the 25 best batters at each position. Later, a list of the 100 best batters after adjustment for fielding position is provided. This list accounts for the fact that offensive performance tends to be lower for positions that have greater defensive demands.

Because of the involved nature of the adjustments, the findings are given through the 2003 season. However, Chapter 16 provides a brief update of the top performances and players from the 2004 season. Chapter 17 compares the Career Batter Rating to the work of Pete Palmer and Bill James, and maps a course for further improving the ranking of batting ability across baseball history.

Appendix N contains a complete list of all 1140 players who qualified for inclusion in this book, along with their adjusted averages and Career Batter Ratings.


What People are saying about this

Carl Morris
Everyone knows that batting .300 in the major leagues is much harder than batting .300 in the minors. Although baseball rules and equipment change over time and parks differ, such differences in difficulty are ignored regularly by those who compare batters who played in different decades and/or in different stadiums. Michael Schell has painstakingly made the needed adjustments for eras, for park factors, for players' ages, and for variability in performances, so as to determine which batters really have been most dominant. There are many other treasures to be found here, and many methodological lessons to be learned and enjoyed by baseball enthusiasts and by those who think about player evaluations.
Carl Morris, Harvard University
Jim Albert
A significant contribution to the sabermetrics field. This book will be a fun read for any baseball fan.
Jim Albert, Bowling Green State University.
Tom Tippett
Some say it's impossible to compare hitters from different eras. In this book, Michael Schell meets that challenge head-on, using modern statistical methods to adjust for differences in eras, ballparks, and the level of competition. It may not settle every argument about the game's best all-time hitters, but it's sure to raise the quality of those arguments.
Tom Tippett, Principal Designer, "Diamond Mind Baseball"
Daniel Levitt
Well-written and organized. Baseball's All-Time Best Sluggersstrikes the right balance between the statistical lingo of the professional statistician and the more familiar verbiage of baseball books.
Daniel Levitt, co-author, with Mark Armour, of "Paths to Glory: How Great Baseball Teams Got That Way"
Pete Palmer
Michael Schell has expanded on his original study of Baseball's All-Time Best Hitters to include all aspects of batting. He has written a well thought out and soundly based book, taking into account sophisticated time, age, park and positional adjustments to reach valid conclusions. There is plenty of math, but it is not necessary to understand the intricacies of the equations to appreciate the results.
Pete Palmer, co-editor of "The Baseball Encyclopedia" (with Gary Gillette) and co-author of "The Hidden Game of Baseball" (with John Thorn)
Rob Neyer
The way these things work, I don't suppose that Michael Schell's book will be the final word on ranking hitters. What I do know is that anybody who wants the final word will have to read this book first. And that will be the easy part.
Rob Neyer,

Meet the Author

Michael J. Schell is Professor of Biostatistics at the University of North Carolina and Director of the Biostatistics Core Facility in the Lineberger Comprehensive Cancer Center. He is the author of "Baseball's All-Time Best Hitters: How Statistics Can Level the Playing Field" (Princeton).

