Here are 100 books that Fundamentals of Data Visualization fans have personally recommended if you like
Fundamentals of Data Visualization.
Shepherd is a community of 12,000+ authors and super readers sharing their favorite books with the world.
I started my career as a research scientist building machine learning algorithms for weather forecasting. Twenty years later, I found myself at a precision agriculture startup creating models that provided guidance to farmers on when to plant, what to plant, etc. So, I am part of the movement from academia to industry. Now, at Google Cloud, my team builds cross-industry solutions and I see firsthand what our customers need in their data science teams. This set of books is what I suggest when a CTO asks how to upskill their workforce, or when a graduate student asks me how to break into the industry.
What if you are faced with a problem for which a standard approach doesn’t yet exist? In such a case, you will need to be able to figure out the approach from the first principles. This book will help you learn how to derive insights starting from raw data.
'A statistical national treasure' Jeremy Vine, BBC Radio 2
'Required reading for all politicians, journalists, medics and anyone who tries to influence people (or is influenced) by statistics. A tour de force' Popular Science
Do busier hospitals have higher survival rates? How many trees are there on the planet? Why do old men have big ears? David Spiegelhalter reveals the answers to these and many other questions - questions that can only be addressed using statistical science.
Statistics has played a leading role in our scientific understanding of the world for centuries, yet we are all familiar with the way…
The Victorian mansion, Evenmere, is the mechanism that runs the universe.
The lamps must be lit, or the stars die. The clocks must be wound, or Time ceases. The Balance between Order and Chaos must be preserved, or Existence crumbles.
Appointed the Steward of Evenmere, Carter Anderson must learn the…
I am Wes McKinney, creator of the Python pandas project and author of Python for Data Analysis. I have been using Python for data work since 2007 and have worked extensively in the open source community to build accessible and fast data processing tools for Python programmers.
This is a super useful book published more recently that shows how to make the most of pandas’s deep toolbelt of features.
Compared with Python for Data Analysis, it explores some of the newer features added to pandas, and I think that any advanced pandas user will become more effective in their day to day work by reading it.
Best practices for manipulating data with Pandas. This book will arm you with years of knowledge and experience that are condensed into an easy to follow format. Rather than taking months reading blogs and websites and searching mailing lists and groups, this book will teach you how to write good Pandas code.
It covers:
Series manipulation
Creating columns
Summary statistics
Grouping, pivoting, and cross-tabulation
Time series data
Visualization
Chaining
Debugging code
and more...
I started my career as a research scientist building machine learning algorithms for weather forecasting. Twenty years later, I found myself at a precision agriculture startup creating models that provided guidance to farmers on when to plant, what to plant, etc. So, I am part of the movement from academia to industry. Now, at Google Cloud, my team builds cross-industry solutions and I see firsthand what our customers need in their data science teams. This set of books is what I suggest when a CTO asks how to upskill their workforce, or when a graduate student asks me how to break into the industry.
As a data scientist in the industry, it is very helpful to understand the business context behind the problems that you are solving. In many cases, you are trying to predict behavior—who is likely to buy an item, who is likely to click on a link, who is likely to repay a loan, etc.
This book by Eric Siegel is a great introduction to predictive analytics as used in real-life. It will help you frame data science problems in standard ways. For example, suppose you are asked to score sales leads so that salespeople can prioritize their efforts. How would you do it? The common way to frame this problem is to predict the customer lifetime value (LTV) of every sales lead. Before you can do prediction, you have to be able to do analysis though.
The way you estimate the LTV is to break the problem into three sub-problems:…
"The Freakonomics of big data." -Stein Kretsinger, founding executive of Advertising.com
Award-winning | Used by over 30 universities | Translated into 9 languages
An introduction for everyone. In this rich, fascinating - surprisingly accessible - introduction, leading expert Eric Siegel reveals how predictive analytics (aka machine learning) works, and how it affects everyone every day. Rather than a "how to" for hands-on techies, the book serves lay readers and experts alike by covering new case studies and the latest state-of-the-art techniques.
Prediction is booming. It reinvents industries and runs the world. Companies, governments, law…
The Guardian of the Palace is the first novel in a modern fantasy series set in a New York City where magic is real—but hidden, suppressed, and dangerous when exposed.
When an ancient magic begins to leak into the world, a small group of unlikely allies is forced to act…
I started my career as a research scientist building machine learning algorithms for weather forecasting. Twenty years later, I found myself at a precision agriculture startup creating models that provided guidance to farmers on when to plant, what to plant, etc. So, I am part of the movement from academia to industry. Now, at Google Cloud, my team builds cross-industry solutions and I see firsthand what our customers need in their data science teams. This set of books is what I suggest when a CTO asks how to upskill their workforce, or when a graduate student asks me how to break into the industry.
In industry, your data is very likely to live within a data warehouse such as BigQuery, Redshift, or Snowflake. Therefore, to be an effective data scientist in the industry, you should learn how to use data warehouses effectively.
Once you learn data warehousing and SQL with any one of these products, it is quite easy to pick up another. So which one do you start with?
You can use Snowflake on all three of the major public clouds. Because it’s a standalone product, it is the most similar to a “traditional” data warehouse and can be picked up easily even if you are not familiar with cloud computing. That makes it a good data warehouse to start with, and is the reason my second book pick is this book on Snowflake.
BigQuery is also available on all three major public clouds, but it works best (and is used most commonly)…
Explore the modern market of data analytics platforms and the benefits of using Snowflake computing, the data warehouse built for the cloud.
With the rise of cloud technologies, organizations prefer to deploy their analytics using cloud providers such as Amazon Web Services (AWS), Microsoft Azure, or Google Cloud Platform. Cloud vendors are offering modern data platforms for building cloud analytics solutions to collect data and consolidate into single storage solutions that provide insights for business users. The core of any analytics framework is the data warehouse, and previously customers did not have many choices of platform to use.
I am a leader in analytics and AI strategy, and have a broad range of experience in aviation, energy, financial services, and the public sector. I have worked with several major organizations to help them establish a leadership position in data science and to unlock real business value using advanced analytics.
Data scientists and analytics specialists are great at building models and algorithms, but often wrap them in a presentation or dashboard that diminishes their value and reduces the likelihood of their work being adopted. This book encourages practitioners to always consider the last mile and to pay as much attention to presentation and aesthetics as we do to the model itself.
Master the art and science of data storytelling-with frameworks and techniques to help you craft compelling stories with data.
The ability to effectively communicate with data is no longer a luxury in today's economy; it is a necessity. Transforming data into visual communication is only one part of the picture. It is equally important to engage your audience with a narrative-to tell a story with the numbers. Effective Data Storytelling will teach you the essential skills necessary to communicate your insights through persuasive and memorable data stories.
Narratives are more powerful than raw statistics, more enduring than pretty charts. When…
Colin Koopman researches and teaches about technology ethics at the University of Oregon, where he is a Professor of Philosophy and Director of the interdisciplinary certificate program in New Media & Culture. His research pursuits have spanned from the history of efforts in the early twentieth century to standardize birth certificates to our understanding of ourselves as effects of the code inscribed into our genes. Koopman is currently at work on a book that will develop our understanding of what it takes to achieve equality and fairness in data systems, tentatively titled Data Equals.
W.E.B. Du Bois is widely acknowledged as the leading activist for racial equality of his generation. But until very recently little had been known of his deep commitment to the pursuit of equality within and through data technology. As Du Bois was preparing notes for his famous 1903 book The Souls of Black Folk, he was also preparing an exposition of what we would today call “infographics” (or what the editors of this volume aptly call “data portraits”) for exhibition at the 1900 Paris Exposition world’s fair. This volume handsomely reproduces for the first time a full-color complete set of Du Bois’s charts, graphs, maps, and ingenious spirals. A beautiful book to live with, it also subtly transforms one’s understanding of the history of racial progress and inequality in America.
"As visually arresting as it is informative."-The Boston Globe
"Du Bois's bold colors and geometric shapes were decades ahead of modernist graphic design in America."-Fast Company's Co.Design
W.E.B. Du Bois's Data Portraits is the first complete publication of W.E.B. Du Bois's groundbreaking charts, graphs, and maps presented at the 1900 Paris Exposition.
Famed sociologist, writer, and Black rights activist W.E.B. Du Bois fundamentally changed the representation of Black Americans with his exhibition of data visualizations at the 1900 Paris Exposition. Beautiful in design and powerful in content, these data portraits make visible a wide spectrum of African American culture, from…
Aury and Scott travel to the Finger Lakes in New York’s wine country to get to the bottom of the mysterious happenings at the Songscape Winery. Disturbed furniture and curious noises are one thing, but when a customer winds up dead, it’s time to dig into the details and see…
I am not very good at making things. I am good enough to appreciate the craftsmanship of those much better than me. I am more of an ideas person, perhaps why I ended up with a PhD in Philosophy of Science. But I have always held a secret admiration—with a tinge of envy—for people who are makers. As I went deeper into my career as a philosopher of science, I became aware that the material/making aspect of science—and technology—was largely ignored by ideas-obsessed philosophers. So, this is where I focused my attention, and I’ve loved vicariously being able to be part of making the world.
When I was a kid, one of my favorite books was The Way Things Work, not the more recent David Macaulay book—which is also good—but the earlier 1967 book by T. Lodewijk. With great diagrams, it showed how complicated machines work.
Randall Munroe's Thing Explainer, while less comprehensive, similarly captures this magic for me. It has great diagrams and simple clarifying text—self-consciously limited to the 1,000 words people use the most. I could stare at the diagrams for hours, learning about everything from cameras (“picture takers”) to submarines (“boats that go under the sea”).
From the No. 1 bestselling author of What If? - the man who created xkcd and explained the laws of science with cartoons - comes a series of brilliantly simple diagrams ('blueprints' if you want to be complicated about it) that show how important things work: from the nuclear bomb to the biro.
It's good to know what the parts of a thing are called, but it's much more interesting to know what they do. Richard Feynman once said that if you can't explain something to a first-year student, you don't really get it. In Thing Explainer, Randall Munroe takes…
In sixth grade, my teacher tried to teach the class how to read line charts – and something fell into place for me. Ever since then, I’ve tried to sort data into forms that we can use to make sense of it. As a researcher at Microsoft, I consulted with teams across the organization – from sales to legal; and from Excel to XBox – to help them understand their data. At Honeycomb, I design tools for software operations teams to diagnose their complex systems. These books each gave me an “ah-hah” moment that made me think differently about the craft of creating visualization. They now sit on my shelf in easy reach – I hope you find them fascinating too.
A new edition of Bertin’s 1963 Semiology was released a few years ago, and my heart swelled with joy. For years, I’d worked off of bad photocopies of an inter-library loan book that had long since gone out of print. In this new edition, I could see how Bertin works through different dimensions and axes – when you want to plot two different quantitative axes over a map, what are your choices? What if you want to plot them over a graph, instead? What changes? I loved exploring these choices with Bertin, as he explores how different color mappings, iconic representations, and design choices change the way the reader interprets the graph.
Originally published in French in 1967, Semiology of Graphics is internationally recognized as a foundational work in the fields of design and cartography. Based on Jacques Bertin's practical experience as a cartographer, part one of this work is an unprecedented attempt to synthesize principles of graphic communication with the logic of standard rules applied to writing and topography. Part two brings Bertin's theory to life, presenting a close study of graphic techniques, including shape, orientation, colour, texture, volume, and size, in an array of more than 1,000 maps and diagrams.
I’m an applied statistician and academic researcher/lecturer at New Zealand’s oldest university – the University of Otago. R facilitates everything I do – research, academic publication, and teaching. It’s the latter part of my job that motivated my own book on R. From first-year statistics students who have never seen R to my own Ph.D. students using R to implement novel and highly complex statistical methods and models, my experience is that all ultimately love the ease with which the R language permits exploration, visualisation, analysis, and inference of one’s data. The ever-growing need in today’s society for skilled statisticians and data scientists means there's never been a better time to learn this essential language.
For those intending to use R with an eye on the popular 'Tidyverse' suite of packages – which facilitate the handling, manipulation, and visualisation of data sets – it's hard to go past this book. From the founding contributors of the RStudio/Tidyverse worlds, this is a great way to learn about this dialect of R against the overarching backdrop of statistical data analysis and data science.
Learn how to use R to turn raw data into insight, knowledge, and understanding. This book introduces you to R, RStudio, and the tidyverse, a collection of R packages designed to work together to make data science fast, fluent, and fun. Suitable for readers with no previous programming experience, R for Data Science is designed to get you doing data science as quickly as possible. Authors Hadley Wickham and Garrett Grolemund guide you through the steps of importing, wrangling, exploring, and modeling your data and communicating the results. You'll get a complete, big-picture understanding of the data science cycle, along…
Magical realism meets the magic of Christmas in this mix of Jewish, New Testament, and Santa stories–all reenacted in an urban psychiatric hospital!
On locked ward 5C4, Josh, a patient with many similarities to Jesus, is hospitalized concurrently with Nick, a patient with many similarities to Santa. The two argue…
I studied statistics and data science for years before anyone ever suggested to me that these topics might have an ethical dimension, or that my numerical tools were products of human beings with motivations specific to their time and place. I’ve since written about the history and philosophy of mathematical probability and statistics, and I’ve come to understand just how important that historical background is and how critically important it is that the next generation of data scientists understand where these ideas come from and their potential to do harm. I hope anyone who reads these books avoids getting blinkered by the ideas that data = objectivity and that science is morally neutral.
This book is now 50 years old, but its message is as relevant and important now as when it was written. In a series of witty essays that border on rants, Andreski attacks much of social science as fluff obscured by technical jargon and methodology. In particular, he laments the growth of quantitative methods as an attempt to add objectivity to social science and make it appear “harder.” True objectivity is about more than mechanical number-crunching, he says; it’s about a commitment to fairness and resisting the temptations of wishful thinking – a challenge anyone who works with data concerning people and their lives should take seriously.
"Seldom have the social sciences been subject to quite so comprehensive, yet non-partisan, attack. There can be little doubt SOCIAL SCIENCES AS SORCERY is an uncomfortably important and embarassingly comprehensive book." -- Times Literary Supplement "Liberating!" -- Harpers "Andreski has written a new book that is certain to enrage his colleagues ... He documents his charges and spares few of the luminaries of social science in the process." -- TIME Magazine