Stephen Klosterman

OK
Os clientes também compraram itens de
Você é um autor?
Atualizações do Autor
Livros de Stephen Klosterman
No fim do curso, você terá as habilidades necessárias para usar confiantemente vários algoritmos de machine learning a fim de executar análises de dados detalhadas e extrair insights significativos dos dados.
Objetivos do livro:
• Instalação dos pacotes necessários para a definição de um ambiente de codificação de ciência de dados
• Carregamento de dados em um Jupyter Notebook executando Python
• Uso do Matplotlib para a criação de visualizações de dados
• Criação de um modelo com o uso do scikit-learn
• Uso do lasso e da regressão ridge para dedução do overfitting
• Criação e ajuste de um modelo de floresta aleatória e comparação do desempenho com o da regressão logística
• Criação de visualizações com o uso da saída do Jupyter Notebook.
Gain hands-on experience of Python programming with industry-standard machine learning techniques using pandas, scikit-learn, and XGBoost
Key Features
- Think critically about data and use it to form and test a hypothesis
- Choose an appropriate machine learning model and train it on your data
- Communicate data-driven insights with confidence and clarity
Book Description
If data is the new oil, then machine learning is the drill. As companies gain access to ever-increasing quantities of raw data, the ability to deliver state-of-the-art predictive models that support business decision-making becomes more and more valuable.
In this book, you’ll work on an end-to-end project based around a realistic data set and split up into bite-sized practical exercises. This creates a case-study approach that simulates the working conditions you’ll experience in real-world data science projects.
You’ll learn how to use key Python packages, including pandas, Matplotlib, and scikit-learn, and master the process of data exploration and data processing, before moving on to fitting, evaluating, and tuning algorithms such as regularized logistic regression and random forest.
Now in its second edition, this book will take you through the end-to-end process of exploring data and delivering machine learning models. Updated for 2021, this edition includes brand new content on XGBoost, SHAP values, algorithmic fairness, and the ethical concerns of deploying a model in the real world.
By the end of this data science book, you’ll have the skills, understanding, and confidence to build your own machine learning models and gain insights from real data.
What you will learn
- Load, explore, and process data using the pandas Python package
- Use Matplotlib to create compelling data visualizations
- Implement predictive machine learning models with scikit-learn
- Use lasso and ridge regression to reduce model overfitting
- Evaluate random forest and logistic regression model performance
- Deliver business insights by presenting clear, convincing conclusions
Who this book is for
Data Science Projects with Python – Second Edition is for anyone who wants to get started with data science and machine learning. If you’re keen to advance your career by using data analysis and predictive modeling to generate business insights, then this book is the perfect place to begin. To quickly grasp the concepts covered, it is recommended that you have basic experience of programming with Python or another similar language, and a general interest in statistics.
Table of Contents
- Data Exploration and Cleaning
- Introduction to Scikit-Learn and Model Evaluation
- Details of Logistic Regression and Feature Exploration
- The Bias-Variance Trade-off
- Decision Trees and Random Forests
- Gradient Boosting, XGBoost, and SHAP (SHapley Additive exPlanations) Values
- Test Set Analysis, Financial Insights, and Delivery to the Client
Gain hands-on experience with industry-standard data analysis and machine learning tools in Python
Key Features
- Tackle data science problems by identifying the problem to be solved
- Illustrate patterns in data using appropriate visualizations
- Implement suitable machine learning algorithms to gain insights from data
Book Description
Data Science Projects with Python is designed to give you practical guidance on industry-standard data analysis and machine learning tools, by applying them to realistic data problems. You will learn how to use pandas and Matplotlib to critically examine datasets with summary statistics and graphs, and extract the insights you seek to derive. You will build your knowledge as you prepare data using the scikit-learn package and feed it to machine learning algorithms such as regularized logistic regression and random forest. You'll discover how to tune algorithms to provide the most accurate predictions on new and unseen data. As you progress, you'll gain insights into the working and output of these algorithms, building your understanding of both the predictive capabilities of the models and why they make these predictions.
By then end of this book, you will have the necessary skills to confidently use machine learning algorithms to perform detailed data analysis and extract meaningful insights from unstructured data.
What you will learn
- Install the required packages to set up a data science coding environment
- Load data into a Jupyter notebook running Python
- Use Matplotlib to create data visualizations
- Fit machine learning models using scikit-learn
- Use lasso and ridge regression to regularize your models
- Compare performance between models to find the best outcomes
- Use k-fold cross-validation to select model hyperparameters
Who this book is for
If you are a data analyst, data scientist, or business analyst who wants to get started using Python and machine learning techniques to analyze data and predict outcomes, this book is for you. Basic knowledge of Python and data analytics will help you get the most from this book. Familiarity with mathematical concepts such as algebra and basic statistics will also be useful.
Table of Contents
- Data Exploration and Cleaning
- Introduction to Scikit-Learn and Model Evaluation
- Details of Logistic Regression and Feature Exploration
- The Bias-Variance Trade-off
- Decision Trees and Random Forests
- Imputation of Missing Data, Financial Analysis, and Delivery to Client