Skip to content

Introduction to Data Analysis with Python


GoSkills

Summary

Price
£199 inc VAT
Or £66.33/mo. for 3 months...
Study method
Online
Duration
22 hours · Self-paced
Qualification
No formal qualification
CPD
22 CPD hours / points
Certificates
  • Certificate of completion - Free

Add to basket or enquire

Overview

This Python data analysis course will strengthen your skills as a data scientist so that you can visualize many different types of data. If you’re ready to move beyond using Excel to process and organize your data, then this course will help you grow your skill-set without any previous coding experience required.

Award-winning instructor, John Elder, will take you through an introduction to data analysis with Python using essential Python data analysis tools like Numpy and Pandas. By the end of the course, you will understand the fundamentals of Python for data analysis.

Certificates

Certificate of completion

Digital certificate - Included

CPD

22 CPD hours / points
Accredited by The CPD Certification Service

Description

Development Environment Setup

Introduction - What to Expect From the Course

A quick rundown of what to expect from this course, how we're going to move forward, and the roadmap we'll follow as we progress through the course.

Installing Python

Let's start downloading the tools we need for the course, including downloading Python itself.

Installing The Git Bash Terminal

We need one more major tool to do data analysis with Python: the Git Bash Terminal. In this video, I'll show you how to download and install it.

Setting up a Virtual Environment

A virtual environment is a helpful tool that allows us to install programs and try out code without affecting the rest of our computer. In this video, you'll learn how to set up your own virtual environment.

Installing and Using Jupyter Notebooks

We're going to use Jupyter Notebooks throughout the rest of the course to do all our coding, so in this video, let's install and start using it.

Numpy

Pip Installing and Importing Numpy

In this video, we'll download and use our first data analysis tool: Numpy, which stands for Numerical Python.

Numpy Overview - What is It Used for

Before we start coding with Numpy, let's discuss what it does and how it works. This video will help you understand why Numpy is so important for data analysis.

Numpy Arrays

In this video, we discuss how Numpy arrays work, including how they are much, much faster to work with than regular Python arrays.

Numpy Operations

Let's discuss mathematical operations we can use in Numpy, including scalars.

Numpy Universal Functions

Numpy has universal functions built into it that we can play with and use. In this video, we'll discuss square roots, absolute values, exponents, and more.

Pandas

Installing Pandas and Overview - What is It Used for

Pandas is a powerful Python data analysis toolkit. In this video, we'll discuss what it does and how it works.

Pandas Series

Pandas series is one of the main "workhorses" of Pandas. We'll discuss how series work and some of the helpful ways you can use them.

Pandas DataFrames

DataFrames are like spreadsheets, and in this video, we start building and using them.

Selecting Columns and Multiple Columns

We discuss how to select individual and multiple columns within a data frame.

Creating New Columns

In this video, we cover how to add a new column to your data frame.

Removing Columns

Let's discuss how to use the drop function to remove a column from your data frame.

Deleting Rows

Deleting a row in a data frame is very similar to deleting a column - but with one small difference.

Selecting Rows

In this video, we'll discuss two different ways to select a row within a data frame, including the loc function.

Selecting Subsets of Rows and Columns

Let's discuss how to select specific points within your data frames, such as the intersection of a row and column, or multiple points within different rows and columns.

Conditional Selection of DataFrames

Selecting data based on conditionals - such as greater than, less than, equal to, not equal to - is an essential part of data analysis. We'll discuss that in this video.

Multiple Conditions

In this video, we'll explain how to select data based on multiple conditional statements.

Changing Indexes

There are times when you may need to reset your index within a data frame, so we'll explain how that works in this video.

Selecting Specific Cells From Multi-Index

Let's discuss multi-indexes: what they are, how they work, and how to select data from them.

Dealing With Missing Data

You may encounter missing data while doing analysis. What should you do? In this video, we'll discuss some helpful alternatives.

Groupby Grouping Data

The groupby function allows us to group different pieces of data together. We'll discuss the function in this video.

Concatenating, Merging, and Joining Part 1

There are different ways to combine various pieces of data within your data frame. In this video, we'll discuss concatenating.

Concatenating, Merging, and Joining Part 2

We discuss how to merge and join data frames together in this video.

Operations: Unique Values and Value Counts

In this video, we discuss how to find unique values within a column and how to find value counts as well.

Operations: Apply Method

The apply method allows us to create our own functions and apply them to the data in our columns.

Operations: Sorting, Ordering, and Finding Nulls

In this video, we discuss how to sort and order your data.

Pandas Visualization

Plotting Histograms

In this video, we begin discussing data visualization, and install matplotlib to help us create histograms.

Plotting Area Plots

We discuss how to create another type of data visualization: area plots.

Plotting Bar Plots

In this video, we explain how to create bar plots using your data.

Plotting Line Plots

Let's create line plots and discuss some of the options you can use to change their appearance.

Plotting Scatter Plots

Scatter plots are a bit more complicated, but can be helpful ways to visualize your data.

Plotting Box Plots

In this video, we discuss how to create your own box plots.

Plotting HexPlots

In this video, we explain what a hexplot is and how to create your own.

Plotting Density Plots

In this video, we discuss density plots, kernel density estimation (KDE) plots, and how to create them.

Linear Regression With SciKitLearn

What Is Linear Regression - Least Squares Method

Let's start talking about machine learning. In this video, we introduce linear regression and the least squares method.

Linear Regression Installation

We need to install some more elements to do a linear regression, so we'll discuss what we need in this video.

Split Data into Training and Testing Set

As we begin to set up our linear regression model, we must define testing and training splits.

Train a Linear Regression Model and Fit the Model

In this video, learn how to train a linear regression model and how easy it is to fit the model.

Explore the Coefficents and Intercepts

We'll take a look at the coeffecients and intercepts we discovered at the end of the last video and explain what they mean.

Make Predictions and Analyze Results

In this video, we'll learn how to make predictions and analyze results based on the data analysis we've performed so far.

Who is this course for?

This course is for Python beginners.

Requirements

No pre-requisites required.

Career path

Data science, IT, Web development

Questions and answers

Currently there are no Q&As for this course. Be the first to ask a question.

Reviews

Currently there are no reviews for this course. Be the first to leave a review.

FAQs

Study method describes the format in which the course will be delivered. At Reed Courses, courses are delivered in a number of ways, including online courses, where the course content can be accessed online remotely, and classroom courses, where courses are delivered in person at a classroom venue.

CPD stands for Continuing Professional Development. If you work in certain professions or for certain companies, your employer may require you to complete a number of CPD hours or points, per year. You can find a range of CPD courses on Reed Courses, many of which can be completed online.

A regulated qualification is delivered by a learning institution which is regulated by a government body. In England, the government body which regulates courses is Ofqual. Ofqual regulated qualifications sit on the Regulated Qualifications Framework (RQF), which can help students understand how different qualifications in different fields compare to each other. The framework also helps students to understand what qualifications they need to progress towards a higher learning goal, such as a university degree or equivalent higher education award.

An endorsed course is a skills based course which has been checked over and approved by an independent awarding body. Endorsed courses are not regulated so do not result in a qualification - however, the student can usually purchase a certificate showing the awarding body's logo if they wish. Certain awarding bodies - such as Quality Licence Scheme and TQUK - have developed endorsement schemes as a way to help students select the best skills based courses for them.