Here are 99 books that Introduction to Machine Learning with Python fans have personally recommended if you like
Introduction to Machine Learning with Python.
Shepherd is a community of 12,000+ authors and super readers sharing their favorite books with the world.
I am Wes McKinney, creator of the Python pandas project and author of Python for Data Analysis. I have been using Python for data work since 2007 and have worked extensively in the open source community to build accessible and fast data processing tools for Python programmers.
This is a super useful book published more recently that shows how to make the most of pandas’s deep toolbelt of features.
Compared with Python for Data Analysis, it explores some of the newer features added to pandas, and I think that any advanced pandas user will become more effective in their day to day work by reading it.
Best practices for manipulating data with Pandas. This book will arm you with years of knowledge and experience that are condensed into an easy to follow format. Rather than taking months reading blogs and websites and searching mailing lists and groups, this book will teach you how to write good Pandas code.
It covers:
Series manipulation
Creating columns
Summary statistics
Grouping, pivoting, and cross-tabulation
Time series data
Visualization
Chaining
Debugging code
and more...
The Victorian mansion, Evenmere, is the mechanism that runs the universe.
The lamps must be lit, or the stars die. The clocks must be wound, or Time ceases. The Balance between Order and Chaos must be preserved, or Existence crumbles.
Appointed the Steward of Evenmere, Carter Anderson must learn the…
I have been a machine learning engineer applying my ML expertise in computational advertising, and search domain. I am an author of 8 machine learning books. My first book was ranked the #1 bestseller in its category on Amazon in 2017 and 2018 and was translated into many languages. I am also a ML education enthusiast and used to teach ML courses in Toronto, Canada.
This could be the first stop of your brand new machine learning journey. I personally like how the technical concept is translated into plain English – each chapter starts with a high-level overview of a ML algorithm or methodology, concise and clear, followed by lots of visual examples and real world scenarios. I can guarantee you won’t get lost halfway. The book focuses on getting you introduced to ML with minimal math. But if you want to grasp some more of math, the next book I recommend is waiting for you.
NOTICE: To buy the newest edition of this book (2021), please search "Machine Learning Absolute Beginners Third Edition" on Amazon. The product page you are currently viewing is for the 2nd Edition (2017) of this book.
Featured by Tableau as the first of "7 Books About Machine Learning for Beginners."
Ready to spin up a virtual GPU instance and smash through petabytes of data? Want to add 'Machine Learning' to your LinkedIn profile?
Well, hold on there...
Before you embark on your epic journey, there are some high-level theory and statistical principles to weave through first. But rather than spend…
I have been a machine learning engineer applying my ML expertise in computational advertising, and search domain. I am an author of 8 machine learning books. My first book was ranked the #1 bestseller in its category on Amazon in 2017 and 2018 and was translated into many languages. I am also a ML education enthusiast and used to teach ML courses in Toronto, Canada.
The book is a well-curated collection of the essential mathematical concepts that form ML. You may experience a cultural shock jumping to this book from the previous one, because the writing in this book is a bit formal. However, it is the missing but necessary piece for building solid foundations for practical ML. You will find it more valuable combining the intuition behind ML that you gained previously. And the explanations in the book are succinct and from the ML perspectives. For instance, partial derivatives are explained in terms of neural network weight optimization. I wish the concepts in Linear Algebra, Vector Calculus, and Probability courses back in college were introduced this way so I understand better how they are applied.
The fundamental mathematical tools needed to understand machine learning include linear algebra, analytic geometry, matrix decompositions, vector calculus, optimization, probability and statistics. These topics are traditionally taught in disparate courses, making it hard for data science or computer science students, or professionals, to efficiently learn the mathematics. This self-contained textbook bridges the gap between mathematical and machine learning texts, introducing the mathematical concepts with a minimum of prerequisites. It uses these concepts to derive four central machine learning methods: linear regression, principal component analysis, Gaussian mixture models and support vector machines. For students and others with a mathematical background, these…
The Guardian of the Palace is the first novel in a modern fantasy series set in a New York City where magic is real—but hidden, suppressed, and dangerous when exposed.
When an ancient magic begins to leak into the world, a small group of unlikely allies is forced to act…
I have been a machine learning engineer applying my ML expertise in computational advertising, and search domain. I am an author of 8 machine learning books. My first book was ranked the #1 bestseller in its category on Amazon in 2017 and 2018 and was translated into many languages. I am also a ML education enthusiast and used to teach ML courses in Toronto, Canada.
This was my favorite book when I started my career. It talks about how information is processed, in an intelligent way, in the internet age. It acts as a tutorial to teach developers how to code our own ML programs, from online dating services, to document analyzer, and search engine. The author did an excellent job of explaining abstract ML algorithms with clear examples. His coding style in Python reads clearly, which makes the book more beginner-friendly.
Don’t get disappointed when you know this book is more than a decade old. It was a visionary book back in the day and it is still relevant today.
Want to tap the power behind search rankings, product recommendations, social bookmarking, and online matchmaking? This fascinating book demonstrates how you can build Web 2.0 applications to mine the enormous amount of data created by people on the Internet. With the sophisticated algorithms in this book, you can write smart programs to access interesting datasets from other web sites, collect data from users of your own applications, and analyze and understand the data once you've found it. Programming Collective Intelligence takes you into the world of machine learning and statistics, and explains how to draw conclusions about user experience, marketing,…
I have been a machine learning engineer applying my ML expertise in computational advertising, and search domain. I am an author of 8 machine learning books. My first book was ranked the #1 bestseller in its category on Amazon in 2017 and 2018 and was translated into many languages. I am also a ML education enthusiast and used to teach ML courses in Toronto, Canada.
Another practical book that I highly recommend. Its intuitive structure is the first thing I like about it. It gives you a comprehensive walkthrough of the ML workflow, from data exploration to learning. It covers abundant practical guides that get you prepared for real world challenges, such as how to handle outliers and to impute missing data. As a ML practitioner, I appreciate the dedicated case studies throughout the entire book. They really excite learners for future real world applications.
The second edition of a comprehensive introduction to machine learning approaches used in predictive data analytics, covering both theory and practice.
Machine learning is often used to build predictive models by extracting patterns from large datasets. These models are used in predictive data analytics applications including price prediction, risk assessment, predicting customer behavior, and document classification. This introductory textbook offers a detailed and focused treatment of the most important machine learning approaches used in predictive data analytics, covering both theoretical concepts and practical applications. Technical and mathematical material is augmented with explanatory worked examples, and case studies illustrate the application…
I am Wes McKinney, creator of the Python pandas project and author of Python for Data Analysis. I have been using Python for data work since 2007 and have worked extensively in the open source community to build accessible and fast data processing tools for Python programmers.
Polars is a new modern data frame library, powered by Apache Arrow, that has grown like wildfire in the Python ecosystem in recent years. It provides many of the same capabilities as pandas, but with substantially better performance and scalability.
I recommend that Python programmers become proficient at both Polars and pandas, since while pandas is ubiquitous, more and more work will shift to Polars in the coming years.
Want to speed up your data analysis and work with larger-than-memory datasets? Python Polars offers a blazingly fast, multithreaded, and elegant API for data loading, manipulation, and processing. With this hands-on guide, you'll walk through every aspect of Polars and learn how to tackle practical use cases using real-world datasets.
Jeroen Janssens and Thijs Nieuwdorp from Xomnia in Amsterdam show you how this superfast DataFrame library is perfect for efficient data wrangling, ETL pipelines, and so much more. This book helps you quickly learn the syntax and understand Polars' underlying concepts. You don't need to have experience with pandas or…
Aury and Scott travel to the Finger Lakes in New York’s wine country to get to the bottom of the mysterious happenings at the Songscape Winery. Disturbed furniture and curious noises are one thing, but when a customer winds up dead, it’s time to dig into the details and see…
I am Wes McKinney, creator of the Python pandas project and author of Python for Data Analysis. I have been using Python for data work since 2007 and have worked extensively in the open source community to build accessible and fast data processing tools for Python programmers.
This is one of the best books I’ve read on how to write better code and build more maintainable software in Python. It is well-written, concise, and to the point.
Brett’s book is a perfect companion to the other books on this list, which are more focused on data analysis and using specific libraries to build data systems.
It's easy to start developing programs with Python, which is why the language is so popular. However, Python's unique strengths, charms, and expressiveness can be hard to grasp, and there are hidden pitfalls that can easily trip you up. This second edition of Effective Python will help you master a truly "Pythonic" approach to programming, harnessing Python's full power to write exceptionally robust and well-performing code. Using the concise, scenario-driven style pioneered in Scott Meyers' best-selling Effective C++, Brett Slatkin brings together 90 Python best practices, tips, and shortcuts, and explains them with realistic…
I am Wes McKinney, creator of the Python pandas project and author of Python for Data Analysis. I have been using Python for data work since 2007 and have worked extensively in the open source community to build accessible and fast data processing tools for Python programmers.
While this book has a good amount of overlap with my book, it provides a valuable introduction to scikit-learn, one of the most popular libraries for machine learning in Python. There is also excellent content to improve your data visualization skills with matplotlib.
For many researchers, Python is a first-class tool mainly because of its libraries for storing, manipulating, and gaining insight from data. Several resources exist for individual pieces of this data science stack, but only with the Python Data Science Handbook do you get them all-IPython, NumPy, Pandas, Matplotlib, Scikit-Learn, and other related tools. Working scientists and data crunchers familiar with reading and writing Python code will find this comprehensive desk reference ideal for tackling day-to-day issues: manipulating, transforming, and cleaning data; visualizing different types of data; and using data to build statistical or machine learning models. Quite simply, this is…
I am motivated by working on products that many people use. I've been a part of companies that deliver products impacting millions of people. To achieve it, I am working in the Big Data ecosystem and striving to simplify it by contributing to Dremio's Data LakeHouse solution. I worked on projects using Spark, HDFS, Cassandra, and Kafka technologies. I have been working in the software engineering industry for ten years now, and I've tried to share my experience and lessons learned in the Software Mistakes and Tradeoffs book, hoping that it will allow current and the next generation of engineers to create better software, leading to more happy users.
Apache Spark has a very high point of entry for newcomers to the Big Data ecosystem.
However, it is a key tool that almost everyone is using for running distributed processing. I recommend everyone to read this book before delving into production solutions based on Apache Spark.
This book will allow you to alleviate many spark problems, such as serialization, memory utilization, and parallelization of processing.
In this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example. You'll start with an introduction to Spark and its ecosystem, and then dive into patterns that apply common techniques-classification, collaborative filtering, and anomaly detection among others-to fields such as genomics, security, and finance. If you have an entry-level understanding of machine learning and statistics, and you program in Java, Python, or Scala, you'll find these patterns useful for…
Magical realism meets the magic of Christmas in this mix of Jewish, New Testament, and Santa stories–all reenacted in an urban psychiatric hospital!
On locked ward 5C4, Josh, a patient with many similarities to Jesus, is hospitalized concurrently with Nick, a patient with many similarities to Santa. The two argue…
I am a financial data scientist. I think it is important that data scientists are highly specialized if they want to be effective in their careers. I run a business called Conlan Scientific out of Charlotte, NC where me and my team of financial data scientists tackle complicated machine learning problems for our clients. Quant trading is a gladiator’s arena of financial data science. Anyone can try it, but few succeed at it. I am sharing my top five list of math books that are essential to success in this field. I hope you enjoy.
This book might as well be called Introduction to machine learning, and it is probably one of the only books truly deserving of the title. Did you know neural networks have been used for decades to scan checks at the bank? They are called Boltzman Machine. Have you ever heard of how decision trees were used in old-school data mining? You could only get them from proprietary software packages from the early 2000s.
In quant trading, you will constantly face compute power constraints, so it is invaluable to understand the mathematical foundations of the most old-school machine learning methods out there. Researchers 20 years ago used to do a lot of impressive work with a lot less computing power.
This book describes the important ideas in a variety of fields such as medicine, biology, finance, and marketing in a common conceptual framework. While the approach is statistical, the emphasis is on concepts rather than mathematics. Many examples are given, with a liberal use of colour graphics. It is a valuable resource for statisticians and anyone interested in data mining in science or industry. The book's coverage is broad, from supervised learning (prediction) to unsupervised learning. The many topics include neural networks, support vector machines, classification trees and boosting---the first comprehensive treatment of this topic in any book.