Loading
  • Main Menu
GreaterHeight Technologies LLC ~ GreaterHeight Academy
  • All Courses
    • BI and Visualization
      • Mastering Data and Business Analytics
        • Basic Excel for Data Analysis
        • Intermediate and Advanced Excel for Data Analysis
        • Excel for Business Analysis & Analyst
        • PivotTable, PowerPivot, PowerQuery & DAX for Data Analysi
        • Data Analytics and Visualization with Tableau
        • Data Analytics with Power-BI
        • Data Analytics and Visualisation with SQL
      • Mastering Python for Data Analytics and Visualization
        • Python Foundation for Data Analytics
        • Data Analysis Using Python With Numpy and Pandas
        • Data Visualization Using Python with Matplotlib and Seaborn
        • Data Science with SQL Server and Azure SQL Database
        • Data Analytics and Visualisation with PowerBI
      • Complete Microsoft Excel Master Program
        • Basic Excel for Data Analysis
        • Excel Interactive Dashboard for Data Analysis
        • Intermediate and Advanced Excel for Data Analysis
        • PivotTable PowerPivot, PowerQuery & DAX for Data Analysis
        • Excel for Data Analysis and Visualization
        • Excel for Business Analysis & Analyst
      • Data Analytics With SQL Master Program
      • Master Data Analytics With PowerBI
      • Financial Reporting with PowerBI
      • Data Analysts with Power-BI
      • Data Analytics and Visualization with Excel
    • Mastering Python
      • Python Developer Masters Program
        • Python Programming Certification Course
        • Data Science With Python Certification
        • Artificial Intelligence Certification Course
        • PySpark Certification Training Course
        • Python Statistics for Data Science
      • The Complete Python Developer
      • Data Analysis and Visualization with Python
      • Complete Data Scientist with Python
      • Data Engineer with SQL and Python
      • Machine Learning Engineer with Python
    • Azure Cloud Computing
      • DevOps Engineer and Solutions Architect Master Program
      • Greaterheight Azure GH-602 Cloud Solution Architect Master
      • Greaterheight Azure GH-601 Cloud DevOps Master
      • Microsoft Azure az-900 Fundamentals
      • Microsoft Azure az-104 Administrator
      • Microsoft Azure az-204 Developer
      • Microsoft Azure az-305 Solutions Architect
      • Microsoft Azure az-400 DevOps Engineer
      • Microsoft Azure AI-900 Fundamentals
      • Microsoft Azure DP-100 Data Science
    • SQL and SQL-Server Database
      • Mastering SQL Server Development
      • Data Analytics With SQL Master Program
      • Data Engineer Course Online Masters Program
      • Data Science with SQL Server and Azure SQL Database
    • DevOps Development Program
      • DevOps Engineer & Solution Architect Expert Program
    • Data Science
      • Data Science With Python Certification
      • Pythom Statistics for Data Science
      • Data Science with SQL Server and Azure SQL Database
      • Complete Data Scientist with Python
  • Who We Serve
    • Individuals
    • Business
    • Universities
  • Partners
    • Employer Networks
    • Community Partnership
    • Opportunity Funds
    • Future Finance
    • Scholarships
  • Resources
    • Webinars
    • Blog
    • Tutorials
    • White Papers
    • Podcast
    • Events
  • Get Advice
GreaterHeight Technologies LLC ~ GreaterHeight Academy
  • All Courses
    • BI and Visualization
      • Mastering Data and Business Analytics
        • Basic Excel for Data Analysis
        • Intermediate and Advanced Excel for Data Analysis
        • Excel for Business Analysis & Analyst
        • PivotTable, PowerPivot, PowerQuery & DAX for Data Analysi
        • Data Analytics and Visualization with Tableau
        • Data Analytics with Power-BI
        • Data Analytics and Visualisation with SQL
      • Mastering Python for Data Analytics and Visualization
        • Python Foundation for Data Analytics
        • Data Analysis Using Python With Numpy and Pandas
        • Data Visualization Using Python with Matplotlib and Seaborn
        • Data Science with SQL Server and Azure SQL Database
        • Data Analytics and Visualisation with PowerBI
      • Complete Microsoft Excel Master Program
        • Basic Excel for Data Analysis
        • Excel Interactive Dashboard for Data Analysis
        • Intermediate and Advanced Excel for Data Analysis
        • PivotTable PowerPivot, PowerQuery & DAX for Data Analysis
        • Excel for Data Analysis and Visualization
        • Excel for Business Analysis & Analyst
      • Data Analytics With SQL Master Program
      • Master Data Analytics With PowerBI
      • Financial Reporting with PowerBI
      • Data Analysts with Power-BI
      • Data Analytics and Visualization with Excel
    • Mastering Python
      • Python Developer Masters Program
        • Python Programming Certification Course
        • Data Science With Python Certification
        • Artificial Intelligence Certification Course
        • PySpark Certification Training Course
        • Python Statistics for Data Science
      • The Complete Python Developer
      • Data Analysis and Visualization with Python
      • Complete Data Scientist with Python
      • Data Engineer with SQL and Python
      • Machine Learning Engineer with Python
    • Azure Cloud Computing
      • DevOps Engineer and Solutions Architect Master Program
      • Greaterheight Azure GH-602 Cloud Solution Architect Master
      • Greaterheight Azure GH-601 Cloud DevOps Master
      • Microsoft Azure az-900 Fundamentals
      • Microsoft Azure az-104 Administrator
      • Microsoft Azure az-204 Developer
      • Microsoft Azure az-305 Solutions Architect
      • Microsoft Azure az-400 DevOps Engineer
      • Microsoft Azure AI-900 Fundamentals
      • Microsoft Azure DP-100 Data Science
    • SQL and SQL-Server Database
      • Mastering SQL Server Development
      • Data Analytics With SQL Master Program
      • Data Engineer Course Online Masters Program
      • Data Science with SQL Server and Azure SQL Database
    • DevOps Development Program
      • DevOps Engineer & Solution Architect Expert Program
    • Data Science
      • Data Science With Python Certification
      • Pythom Statistics for Data Science
      • Data Science with SQL Server and Azure SQL Database
      • Complete Data Scientist with Python
  • Who We Serve
    • Individuals
    • Business
    • Universities
  • Partners
    • Employer Networks
    • Community Partnership
    • Opportunity Funds
    • Future Finance
    • Scholarships
  • Resources
    • Webinars
    • Blog
    • Tutorials
    • White Papers
    • Podcast
    • Events
  • Get Advice



Complete Data Analytics & Visualization With Python


Start from scratch and learn how to import, clean, manipulate, and visualize data; and also, explore Python's most popular and robust data visualization libraries, including Matplotlib, Seaborn, Bokeh, and others, to create beautiful static and interactive visualizations of categorical, aggregated, and geospatial data.


Get Advice

Data Analysis and Visualization with Python 


Who this course is for:

  • Python developers curious about the data analysis libraries.
  • Python developers curious about the data visualization libraries.
  • Anyone interested in learning Python.
  • Data Analysts
  • Anyone working with data


What you will Learn:

  • Python, we will be using Python3 in this course.
  • Data Analysis Libraries in Python such as NumPy and Pandas.
  • Data Visualization.
  • Data Visualization Libraries in Python such as Matplotlib and Seaborn.
  • How to use Python to manipulate & process data.
  • Data analysis & data visualization using Python.
  • How to analyze data.
  • Jupyter Notebooks IDE / Anaconda Distribution.

Course Benefits & Key Features

Data Analysis and Visualisation with Python’s benefits and key features.
Modules

30+ Modules.

Lessons

80+ Lessons

Practical

40+ Hands-On Labs

Life Projects

5+ Projects

Resume

CV Preparations

Job

Jobs Reference

Recording

Session Recording

Interviews

Mock Interviews

Support

On Job Supports

Membership

Membership Access

Networks

Networking

Certification

Certificate of Completion


INSTRUCTOR-LED LIVE ONLINE CLASSES

Our learn-by-building-project method enables you to build

practical or coding experience that sticks. 95% of our                        

learners say they have confidence and remember more               

when they learn by building real world projects which is                

required to work in your real life.


  • Get step-by-step guidance to practice your skills without getting stuck
  • Validate your technical problem-solving skills in a real environment
  • Troubleshoot complex scenarios to practice what you learned
  • Develop production experience that translates into real-world
.

Python Developer Program Job Outlook


.

Ranked #1 Programming
Language

TIOBE and PYPL ranks Python as the most popular global programming language.

Python Salary
Trend

The average salary for a Python Developer is $114,489 per year in the United States.

44.8% Compound annual growth rate (CAGR)

The global python market size is expected to reach USD 100.6 million in 2030.

Why Data Analysis and Visualization with Python?

Learn In-demand Skills

Those with careers in data analysis learn relevant in-demand skills that span industries and add value to every digital-enabled organization.


Earn a Higher Salary

Experienced data analysts can earn up to $112,000 per year and transition into higher-paying jobs as Senior Data Analysts, Data Scientists, or Analytics Managers.

Positive Job Outlook

The data analytics market is predicted to hit $132.90 Billion USD by 2026. COVID-19 pandemic accelerated the adoption of data analytics solutions and services.

Shape the Future

Data analysts transform organizations by capitalizing on data to improve their business decisions and solve critical real-world problems.

Become a Leader

Being a central part of an organization’s decision-making processes, analytics experts often pick up strong leadership skills as well.

Data Analysis are Constantly Evolving

Data analysis moves quickly, and data analysts are constantly learning and advancing in their careers.




GreaterHeight Certificates holders are prepared to work at companies like these.

Some Alumni Testimonies

Investing in the course "Become a Data Analyst" with GreaterHeight Academy is great value for the money and I highly recommend. The trainer is very knowledgeable, very engaging, provided us with quality training sessions on all courses and was easily acessible for queries. We also had access to the course materials and also the timely availability of the recorded videos made it easy and aided the learning process..

QUEEN OBIWULU

Team Lead, Customer Success

The training was fantastic, the instructor is an awesome lecturer, relentless and not tired in his delivery. He obviously enjoys teaching, it comes natural to him. We got more than we expected. He extended my knowledge of Excel beyond what I knew, and the courses were brilliantly delivered. They reach out, follow up, ask questions, and in fact the support has been great. They are highly recommended and I would definitely subscribe to other training programs from them.

BISOLA OGUNRO

Fraud Analytics Risk Oversight Manager

It's one thing to look for just a Data Analysis training, and it's another to get the knowledge transferred through certified professional trainers. No matter your initial level of proficiency in any of the Data Analysis tools, GreaterHeight Academy would meet you there and take you up to a highly proficienct and confident level in a short time at a reasonable pace. I learnt a lot of Data Analysis tools and skills at GreaterHeight from patient and resourceful teachers.

TUNDE MEREDITH

Operation Director - Abbfem Technology

The Data Analysis training program was one of the best I have attended. The way GreaterHeight took off with Excel and concluded the four courses with Excel was a mind blowing - it was WOW!! I concluded that I'm on the right path with the right mentor to take me from a novice to professional. GreaterHeight is the best as far as impacting Data Analysis knowledge is concern. I would shout it at the rooftop to recommend GreaterHeight to any trainee that really wants to learn.

JOHN OSI PETER

Greaterheight

I wanted to take a moment to express my deepest gratitude for the opportunity to study data analytics at GreaterHeight Academy. I am truly impressed by the level of dedication and support that the sponsor and CEO have put into this program. GreaterHeight Academy is without a doubt the best tech institution out there, providing top-notch education and resources for its students. One of the advantages of studying at GreaterHeight Academy is the access to the best tools and technologies in the field. 

AYODELE PAYNE

Sales/Data Analyst

It is an unforgettable experience that will surely stand the test of time learning to become a Data Analyst with GreaterHeights Academy. The Lecture delivery was so impactful and the Trainer is vast and well knowledgeable in using the applicable tools for the Sessions. Always ready to go extra mile with you. The supports you get during and after the lectures are top notch with materials and resources available to build your confidence on and off the job.

ADEBAYO OLADEJO

Customer Service Advisor (Special Operations)

Data Analysis and Visualization with Python Courses


Start your journey to becoming a data analyst using Python - one of the most popular programming languages in the world. No prior coding experience is required; you’ll start from scratch and learn how to import, clean, manipulate, and visualize data—all integral skills for any aspiring data professional or researcher. You’ll begin your data analyst training with interactive exercises and get hands-on with some of the most popular Python libraries, including pandas, NumPy, Seaborn, and many more. You’ll learn why Python for data analysis is so popular and work with real-world datasets to grow your data manipulation and exploratory data analysis skills. As you progress through the courses, you’ll cover topics such as data manipulation and joining data. You’ll also learn key statistics skills, like hypothesis testing. Get started today, grow your Python skills, and begin your journey to becoming a confident data analyst.

Supercharge your data science skills by learning how to create data visualization in Python. Over four courses and one assessment, you’ll explore Python's most popular and robust data visualization libraries, including Matplotlib, Seaborn, Bokeh, and others, to create beautiful static and interactive visualizations of categorical, aggregated, and geospatial data. Along the way, you’ll develop the essential skills to create informative visualizations that can showcase your data, giving you the confidence to create your own data visualizations with Python. Data visualization is fast becoming an essential skill in industries as diverse as finance, education, healthcare, retail, and more. This track will help you develop practical Python data visualization skills to apply across various data-driven roles, helping you tell stories with your data.


Introduction to Python
An Introduction to Python
Python has grown to become the market leader in programming languages and the language of choice for data analysts and data scientists. Demand for data skills is rising because companies want to gain actionable insights from their data.

Discover the Python Basics
This is a Python course for beginners, and we designed it for people with no prior Python experience. It is even suitable if you have no coding experience at all. You will cover the basics of Python, helping you understand common, everyday functions and applications, including how to use Python as a calculator, understanding variables and types, and building Python lists. The first half of this course prepares you to use Python interactively and teaches you how to store, access, and manipulate data using one of the most popular programming languages in the world.

Explore Python Functions and Packages
The second half of the course starts with a view of how you can use functions, methods, and packages to use code that other Python developers have written. As an open-source language, Python has plenty of existing packages and libraries that you can use to solve your problems.

Get Started with NumPy
NumPy is an essential Python package for data science. You’ll finish this course by learning to use some of the most popular tools in the NumPy array and start exploring data in Python.

4 Modules | 6+ Hours | 4 Skills

Course Modules 


An introduction to the basic concepts of Python. Learn how to use Python interactively and by using a script. Create your first variables and acquaint yourself with Python's basic data types.


  1. Hello Python!
  2. Your first Python code
  3. Any comments?
  4. Python as a calculator
  5. Variables and Types
  6. Variable Assignment
  7. Calculations with variables
  8. Other variable types
  9. Operations with other types

Learn to store, access, and manipulate data in lists: the first step toward efficiently working with huge amounts of data.


  1. Python Lists
  2. Create a list
  3. Create lists with different types
  4. List of lists
  5. Subsetting Lists
  6. Subset and conquer
  7. Slicing and dicing
  8. Subsetting lists of lists
  9. Manipulating Lists
  10. Replace list elements
  11. Extend a list
  12. Delete list elements
  13. Inner workings of lists

You'll learn how to use functions, methods, and packages to efficiently leverage the code that brilliant Python developers have written. The goal is to reduce the amount of code you need to solve challenging problems!


  1. Functions
  2. Familiar functions
  3. Help!
  4. Multiple arguments
  5. Methods
  6. String Methods
  7. List Methods
  8. List Methods (2)
  9. Packages
  10. Import package
  11. Selective import
  12. Different ways of importing

NumPy is a fundamental Python package to efficiently practice data science. Learn to work with powerful tools in the NumPy array, and get started with data exploration.


  1. NumPy
  2. Your First NumPy Array
  3. Baseball players' height
  4. NumPy Side Effects
  5. Subsetting NumPy Arrays
  6. 2D NumPy Arrays
  7. Your First 2D NumPy Array
  8. Baseball data in 2D form
  9. Subsetting 2D NumPy Arrays
  10. 2D Arithmetic
  11. NumPy: Basic Statistics
  12. Average versus median
  13. Explore the baseball data


Intermediate Python
Improve Your Python Skills
Learning Python is crucial for any aspiring data science practitioner. Learn to visualize real data with Matplotlib’s functions and get acquainted with data structures such as the dictionary and pandas DataFrame. This four-hour intermediate course will help you to build on your existing Python skills and explore new Python applications and functions that expand your repertoire and help you work more efficiently.

Learn to Use Python Dictionaries and pandas
Dictionaries offer an alternative to Python lists, while the pandas dataframe is the most popular way of working with tabular data. In the second module of this course, you’ll find out how you can create and manipulate datasets, and how to access them using these structures. Hands-on practice throughout the course will build your confidence in each area.

Explore Python Boolean Logic and Python Loops
In the second half of this course, you’ll look at logic, control flow, filtering and loops. These functions work to control decision-making in Python programs and help you to perform more operations with your data, including repeated statements. You’ll finish the course by applying all of your new skills by using hacker statistics to calculate your chances of winning a bet.

Once you’ve completed all of the modules, you’ll be ready to apply your new skills in your job, new career, or personal project, and be prepared to move onto more advanced Python learning!

5 Modules | 6+ Hours | 5 Skills

Course Modules 


Data visualization is a key skill for aspiring data scientists. Matplotlib makes it easy to create meaningful and insightful plots. In this module, you’ll learn how to build various types of plots, and customize them to be more visually appealing and interpretable.


  1. Basic plots with Matplotlib
  2. Line plot (1)
  3. Line Plot (2): Interpretation
  4. Line plot (3)
  5. Scatter Plot (1)
  6. Scatter plot (2)
  7. Histogram
  8. Build a histogram (1)
  9. Build a histogram (2): bins
  10. Build a histogram (3): compare
  11. Choose the right plot (1)
  12. Choose the right plot (2)
  13. Customization
  14. Labels
  15. Ticks
  16. Sizes
  17. Colors
  18. Additional Customizations
  19. Interpretation

Learn about the dictionary, an alternative to the Python list, and the pandas DataFrame, the de facto standard to work with tabular data in Python. You will get hands-on practice with creating and manipulating datasets, and you’ll learn how to access the information you need from these data structures.


  1. Dictionaries, Part 1
  2. Motivation for dictionaries
  3. Create dictionary
  4. Access dictionary
  5. Dictionaries, Part 2
  6. Dictionary Manipulation (1)
  7. Dictionary Manipulation (2)
  8. Dictionariception
  9. Pandas, Part 1
  10. Dictionary to DataFrame (1)
  11. Dictionary to DataFrame (2)
  12. CSV to DataFrame (1)
  13. CSV to DataFrame (2)
  14. Pandas, Part 2
  15. Square Brackets (1)
  16. Square Brackets (2)
  17. loc and iloc (1)
  18. loc and iloc (2)
  19. loc and iloc (3)

Boolean logic is the foundation of decision-making in Python programs. Learn about different comparison operators, how to combine them with Boolean operators, and how to use the Boolean outcomes in control structures. You'll also learn to filter data in pandas DataFrames using logic.


  1. Comparison Operators
  2. Equality
  3. Greater and less than
  4. Compare arrays
  5. Boolean Operators
  6. and, or, not (1)
  7. and, or, not (2)
  8. Boolean operators with NumPy
  9. if, elif, else
  10. Warmup
  11. if
  12. Add else
  13. Customize further: elif
  14. Filtering pandas DataFrames
  15. Driving right (1)
  16. Driving right (2)
  17. Cars per capita (1)
  18. Cars per capita (2)

There are several techniques you can use to repeatedly execute Python code. While loops are like repeated if statements, the for loop iterates over all kinds of data structures. Learn all about them in this module.


  1. while loop
  2. while: warming up
  3. Basic while loop
  4. Add conditionals
  5. for loop
  6. Loop over a list
  7. Indexes and values (1)
  8. Indexes and values (2)
  9. Loop over list of lists
  10. Loop Data Structures Part 1
  11. Loop over dictionary
  12. Loop over NumPy array
  13. Loop Data Structures Part 2
  14. Loop over DataFrame (1)
  15. Loop over DataFrame (2)
  16. Add column (1)
  17. Add column (2)

This module will allow you to apply all the concepts you've learned in this course. You will use hacker statistics to calculate your chances of winning a bet. Use random number generators, loops, and Matplotlib to gain a competitive edge!


  1. Random Numbers
  2. Random float
  3. Roll the dice
  4. Determine your next move
  5. Random Walk
  6. The next step
  7. How low can you go?
  8. Visualize the walk
  9. Distribution
  10. Simulate multiple walks
  11. Visualize all walks
  12. Implement clumsiness
  13. Plot the distribution
  14. Calculate the odds


Introduction to Functions in Python
It's time to push forward and develop your Python chops even further. Python has tons of fantastic functions and a module ecosystem. However, as a data professional or developer, you'll constantly need to write your own functions to solve problems that are dictated by your data. You will learn the art of function writing in this first course. You'll come out of this course being able to write your very own custom functions, complete with multiple parameters and multiple return values, along with default arguments and variable-length arguments. You'll gain insight into scoping in Python, be able to write lambda functions and handle errors in your function writing practice. You'll wrap up each module by using your new skills to write functions that analyze Twitter data.

3 Modules | 4+ Hours | 3 Skills

Course Modules 


You'll learn how to write simple functions, as well as functions that accept multiple arguments and return multiple values. You'll also have the opportunity to apply these new skills to questions commonly encountered by data professionals and developers.


  1. User-defined functions
  2. Strings in Python
  3. Recapping built-in functions
  4. Write a simple function
  5. Single-parameter functions
  6. Functions that return single values
  7. Multiple parameters and return values
  8. Functions with multiple parameters
  9. A brief introduction to tuples
  10. Functions that return multiple values
  11. Bringing it all together
  12. Bringing it all together (1)
  13. Bringing it all together (2)

You'll learn to write functions with default arguments so that the user doesn't always need to specify them, and variable-length arguments so they can pass an arbitrary number of arguments on to your functions. You'll also learn about the essential concept of scope.


  1. Scope and user-defined functions
  2. Pop quiz on understanding scope
  3. The keyword global
  4. Python's built-in scope
  5. Nested functions
  6. Nested Functions I
  7. Nested Functions II
  8. The keyword nonlocal and nested functions
  9. Default and flexible arguments
  10. Functions with one default argument
  11. Functions with multiple default arguments
  12. Functions with variable-length arguments (*args)
  13. Functions with variable-length keyword arguments (**kwargs)
  14. Bringing it all together
  15. Bringing it all together (1)
  16. Bringing it all together (2)

Learn about lambda functions, which allow you to write functions quickly and on the fly. You'll also practice handling errors in your functions, which is an essential skill. Then, apply your new skills to answer data science questions.


  1. Lambda functions
  2. Pop quiz on lambda functions
  3. Writing a lambda function you already know
  4. Map() and lambda functions
  5. Filter() and lambda functions
  6. Reduce() and lambda functions
  7. Introduction to error handling
  8. Pop quiz about errors
  9. Error handling with try-except
  10. Error handling by raising an error
  11. Bringing it all together
  12. Bringing it all together (1)
  13. Bringing it all together (2)
  14. Bringing it all together (3)
  15. Bringing it all together: testing your error handling skills


Python Toolbox
In this Python Toolbox course, you'll continue to build more advanced Python skills. First, you'll learn about iterators, objects you have already encountered in the context of for loops. You'll then learn about list comprehensions, which are extremely handy tools for all data professionals and developers working in Python. You'll end the course by working through a case study in which you'll apply all the techniques you learned in both parts of this course.

3 Modules | 4+ Hours | 3 Skills

Course Modules 


You'll learn all about iterators and iterables, which you have already worked with when writing for loops. You'll learn some handy functions that will allow you to effectively work with iterators. And you’ll finish the module with a use case that is pertinent to the world of data science and dealing with large amounts of data—in this case, data from Twitter that you will load in chunks using iterators.


  1. Introduction to iterators
  2. Iterators vs. Iterables
  3. Iterating over iterables (1)
  4. Iterating over iterables (2)
  5. Iterators as function arguments
  6. Playing with iterators
  7. Using enumerate
  8. Using zip
  9. Using * and zip to 'unzip'
  10. Using iterators to load large files into memory
  11. Processing large amounts of Twitter data
  12. Extracting information for large amounts of Twitter data

In this module, you'll build on your knowledge of iterators and be introduced to list comprehensions, which allow you to create complicated lists—and lists of lists—in one line of code! List comprehensions can dramatically simplify your code and make it more efficient, and will become a vital part of your Python toolbox. You'll then learn about generators, which are extremely helpful when working with large sequences of data that you may not want to store in memory, but instead generate on the fly.


  1. List comprehensions
  2. Write a basic list comprehension
  3. List comprehension over iterables
  4. Writing list comprehensions
  5. Nested list comprehensions
  6. Advanced comprehensions
  7. Using conditionals in comprehensions (1)
  8. Using conditionals in comprehensions (2)
  9. Dict comprehensions
  10. Introduction to generator expressions
  11. List comprehensions vs. generators
  12. Write your own generator expressions
  13. Changing the output in generator expressions
  14. Build a generator
  15. Wrapping up comprehensions and generators.
  16. List comprehensions for time-stamped data
  17. Conditional list comprehensions for time-stamped data

This module will allow you to apply your newly acquired skills toward wrangling and extracting meaningful information from a real-world dataset—the World Bank's World Development Indicators. You'll have the chance to write your own functions and list comprehensions as you work with iterators and generators to solidify your Python chops.


  1. Welcome to the case study!
  2. Zipping dictionaries
  3. Writing a function to help you
  4. Using a list comprehension
  5. Turning this all into a DataFrame
  6. Using Python generators for streaming data
  7. Processing data in chunks (1)
  8. Writing a generator to load data in chunks (2)
  9. Writing a generator to load data in chunks (3)
  10. Using pandas' read_csv iterator for streaming data
  11. Writing an iterator to load data in chunks (1)
  12. Writing an iterator to load data in chunks (2)
  13. Writing an iterator to load data in chunks (3)
  14. Writing an iterator to load data in chunks (4)
  15. Writing an iterator to load data in chunks (5)
  16. Final thoughts


Data Manipulation With Pandas
Discover Data Manipulation with pandas
With this course, you’ll learn why pandas is the world's most popular Python library, used for everything from data manipulation to data analysis. You’ll explore how to manipulate DataFrames, as you extract, filter, and transform real-world datasets for analysis.

With pandas, you’ll explore all the core data science concepts. Using real-world data, including Walmart sales figures and global temperature time series, you’ll learn how to import, clean, calculate statistics, and create visualizations—using pandas to add to the power of Python.

Work with pandas Data to Explore Core Data Science Concepts
You’ll start by mastering the pandas basics, including how to inspect DataFrames and perform some fundamental manipulations. You’ll also learn about aggregating DataFrames, before moving on to slicing and indexing.

You’ll wrap up the course by learning how to visualize the contents of your DataFrames, working with a dataset that contains weekly US avocado sales.

Learn to Manipulate DataFrames
By completing this pandas course, you’ll understand how to use this Python library for data manipulation. You’ll have an understanding of DataFrames and how to use them, as well as be able to visualize your data in Python.

4 Modules | 5+ Hours | 4 Skills

Course Modules 


Let’s master the pandas basics. Learn how to inspect DataFrames and perform fundamental manipulations, including sorting rows, subsetting, and adding new columns.


  1. Introducing DataFrames
  2. Inspecting a DataFrame
  3. Parts of a DataFrame
  4. Sorting and subsetting
  5. Sorting rows
  6. Subsetting columns
  7. Subsetting rows
  8. Subsetting rows by categorical variables
  9. New columns
  10. Adding new columns
  11. Combo-attack!

In this module, you’ll calculate summary statistics on DataFrame columns, and master grouped summary statistics and pivot tables.


  1. Summary statistics
  2. Mean and median
  3. Summarizing dates
  4. Efficient summaries
  5. Cumulative statistics
  6. Counting
  7. Dropping duplicates
  8. Counting categorical variables
  9. Grouped summary statistics
  10. What percent of sales occurred at each store type?
  11. Calculations with .groupby()
  12. Multiple grouped summaries
  13. Pivot tables
  14. Pivoting on one variable
  15. Fill in missing values and sum values with pivot tables

Indexes are supercharged row and column names. Learn how they can be combined with slicing for powerful DataFrame subsetting.


  1. Explicit indexes
  2. Setting and removing indexes
  3. Subsetting with .loc[]
  4. Setting multi-level indexes
  5. Sorting by index values
  6. Slicing and subsetting with .loc and .iloc
  7. Slicing index values
  8. Slicing in both directions
  9. Slicing time series
  10. Subsetting by row/column number
  11. Working with pivot tables
  12. Pivot temperature by city and year
  13. Subsetting pivot tablesp
  14. Calculating on a pivot table

Learn to visualize the contents of your DataFrames, handle missing data values, and import data from and export data to CSV files.


  1. Visualizing your data
  2. Which avocado size is most popular?
  3. Changes in sales over time
  4. Avocado supply and demand
  5. Price of conventional vs. organic avocados
  6. Missing values
  7. Finding missing values
  8. Removing missing values
  9. Replacing missing values
  10. Creating DataFrames
  11. List of dictionaries
  12. Dictionary of lists
  13. Reading and writing CSVs
  14. CSV to DataFrame
  15. DataFrame to CSV
  16. Wrap-up


Joining Data with Pandas

Being able to combine and work with multiple datasets is an essential skill for any aspiring Data Scientist. pandas is a crucial cornerstone of the Python data science ecosystem, with Stack Overflow recording 5 million views for pandas questions. Learn to handle multiple DataFrames by combining, organizing, joining, and reshaping them using pandas. You'll work with datasets from the World Bank and the City Of Chicago. You will finish the course with a solid skillset for data-joining in pandas.

4 Modules | 5+ Hours | 4 Skills

Course Modules 


Learn how you can merge disparate data using inner joins. By combining information from multiple sources you’ll uncover compelling insights that may have previously been hidden. You’ll also learn how the relationship between those sources, such as one-to-one or one-to-many, can affect your result.


  1. Inner join
  2. What column to merge on?
  3. Your first inner join
  4. Inner joins and number of rows returned
  5. One-to-many relationships
  6. One-to-many classification
  7. One-to-many merge
  8. Merging multiple DataFrames
  9. Total riders in a month
  10. Three table merge
  11. One-to-many merge with multiple tables

Take your knowledge of joins to the next level. In this module, you’ll work with TMDb movie data as you learn about left, right, and outer joins. You’ll also discover how to merge a table to itself and merge on a DataFrame index.


  1. Left join
  2. Counting missing rows with left join
  3. Enriching a dataset
  4. How many rows with a left join?
  5. Other joins
  6. Right join to find unique movies
  7. Popular genres with right join
  8. Using outer join to select actors
  9. Merging a table to itself
  10. Self join
  11. How does pandas handle self joins?
  12. Merging on indexes
  13. Index merge for movie ratings
  14. Do sequels earn more?

In this module, you’ll leverage powerful filtering techniques, including semi-joins and anti-joins. You’ll also learn how to glue DataFrames by vertically combining and using the pandas.concat function to create new datasets. Finally, because data is rarely clean, you’ll also learn how to validate your newly combined data structures.


  1. Filtering joins
  2. Steps of a semi join
  3. Performing an anti join
  4. Performing a semi join
  5. Concatenate DataFrames together vertically
  6. Concatenation basics
  7. Concatenating with keys
  8. Verifying integrity
  9. Validating a merge
  10. Concatenate and merge to find common songs

In this final module, you’ll step up a gear and learn to apply pandas' specialized methods for merging time-series and ordered data together with real-world financial and economic data from the city of Chicago. You’ll also learn how to query resulting tables using a SQL-style format, and unpivot data using the melt method.


  1. Using merge_ordered()
  2. Correlation between GDP and S&P500
  3. Phillips curve using merge_ordered()
  4. merge_ordered() caution, multiple columns
  5. Using merge_asof()
  6. Using merge_asof() to study stocks
  7. Using merge_asof() to create dataset
  8. merge_asof() and merge_ordered() differences
  9. Selecting data with .query()
  10. Explore financials with .query()
  11. Subsetting rows with .query()
  12. Reshaping data with .melt()
  13. Select the right .melt() arguments
  14. Using .melt() to reshape government data
  15. Using .melt() for stocks vs bond performance
  16. Course wrap-up


Introduction to Statistics in Python
Statistics is the study of how to collect, analyze, and draw conclusions from data. It’s a hugely valuable tool that you can use to bring the future into focus and infer the answer to tons of questions. For example, what is the likelihood of someone purchasing your product, how many calls will your support team receive, and how many jeans sizes should you manufacture to fit 95% of the population? In this course, you'll discover how to answer questions like these as you grow your statistical skills and learn how to calculate averages, use scatterplots to show the relationship between numeric values, and calculate correlation. You'll also tackle probability, the backbone of statistical reasoning, and learn how to use Python to conduct a well-designed study to draw your own conclusions from data.

4 Modules | 5+ Hours | 4 Skills

Course Modules 


Summary statistics gives you the tools you need to boil down massive datasets to reveal the highlights. In this module, you'll explore summary statistics including mean, median, and standard deviation, and learn how to accurately interpret them. You'll also develop your critical thinking skills, allowing you to choose the best summary statistics for your data.


  1. What is statistics?
  2. Descriptive and inferential statistics
  3. Data type classification
  4. Measures of center
  5. Mean and median
  6. Mean vs. median
  7. Measures of spread
  8. Quartiles, quantiles, and quintiles
  9. Variance and standard deviation
  10. Finding outliers using IQR

In this module, you'll learn how to generate random samples and measure chance using probability. You'll work with real-world sales data to calculate the probability of a salesperson being successful. Finally, you’ll use the binomial distribution to model events with binary outcomes.


  1. What are the chances?
  2. With or without replacement?
  3. Calculating probabilities
  4. Sampling deals
  5. Discrete distributions
  6. Creating a probability distribution
  7. Identifying distributions
  8. Expected value vs. sample mean
  9. Continuous distributions
  10. Which distribution?
  11. Data back-ups
  12. Simulating wait times
  13. The binomial distribution
  14. Simulating sales deals
  15. Calculating binomial probabilities
  16. How many sales will be won?

It’s time to explore one of the most important probability distributions in statistics, normal distribution. You’ll create histograms to plot normal distributions and gain an understanding of the central limit theorem, before expanding your knowledge of statistical functions by adding the Poisson, exponential, and t-distributions to your repertoire.


  1. The normal distribution
  2. Distribution of Amir's sales
  3. Probabilities from the normal distribution
  4. Simulating sales under new market conditions
  5. Which market is better?
  6. The central limit theorem
  7. Visualizing sampling distributions
  8. The CLT in action
  9. The mean of means
  10. The Poisson distribution
  11. Identifying lambda
  12. Tracking lead responses
  13. More probability distributions
  14. Distribution dragging and dropping
  15. Modeling time between leads
  16. The t-distribution

In this module, you'll learn how to quantify the strength of a linear relationship between two variables, and explore how confounding variables can affect the relationship between two other variables. You'll also see how a study’s design can influence its results, change how the data should be analyzed, and potentially affect the reliability of your conclusions.


  1. Correlation
  2. Guess the correlation
  3. Relationships between variables
  4. Correlation caveats
  5. What can't correlation measure?
  6. Transforming variables
  7. Does sugar improve happiness?
  8. Confounders
  9. Design of experiments
  10. Study types
  11. Longitudinal vs. cross-sectional studies
  12. Course Wrap up!


Introduction to Data Visualization with Seaborn
Create Your Own Seaborn Plots
Seaborn is a powerful Python library that makes it easy to create informative and attractive data visualizations. This 4-hour course provides an introduction to how you can use Seaborn to create a variety of plots, including scatter plots, count plots, bar plots, and box plots, and how you can customize your visualizations.

Turn Real Datasets into Custom Seaborn Visualizations
You’ll explore this library and create your Seaborn plots based on a variety of real-world data sets, including exploring how air pollution in a city changes through the day and looking at what young people like to do in their free time. This data will give you the opportunity to find out about Seaborn’s advantages first hand, including how you can easily create subplots in a single figure and how to automatically calculate confidence intervals.

Improve Your Data Communication Skills
By the end of this course, you’ll be able to use Seaborn in various situations to explore your data and effectively communicate the results of your data analysis to others. These skills are highly sought-after for data analysts, data scientists, and any other job that may involve creating data visualizations. If you’d like to continue your learning, this course is part of several tracks, including the Data Visualization track, where you can add more libraries and techniques to your skillset.

4 Modules | 5+ Hours | 4 Skills

Course Modules 


What is Seaborn, and when should you use it? In this module, you will find out! Plus, you will learn how to create scatter plots and count plots with both lists of data and pandas DataFrames. You will also be introduced to one of the big advantages of using Seaborn - the ability to easily add a third variable to your plots by using color to represent different subgroups.


  1. Introduction to Seaborn
  2. Making a scatter plot with lists
  3. Making a count plot with a list
  4. Using pandas with Seaborn
  5. "Tidy" vs. "untidy" data
  6. Making a count plot with a DataFrame
  7. Adding a third variable with hue
  8. Hue and scatter plots
  9. Hue and count plots

In this module, you will create and customize plots that visualize the relationship between two quantitative variables. To do this, you will use scatter plots and line plots to explore how the level of air pollution in a city changes over the course of a day and how horsepower relates to fuel efficiency in cars. You will also see another big advantage of using Seaborn - the ability to easily create subplots in a single figure!


  1. Introduction to relational plots and subplots
  2. Creating subplots with col and row
  3. Creating two-factor subplots
  4. Customizing scatter plots
  5. Changing the size of scatter plot points
  6. Changing the style of scatter plot points
  7. Introduction to line plots
  8. Interpreting line plots
  9. Visualizing standard deviation with line plots
  10. Plotting subgroups in line plots

Categorical variables are present in nearly every dataset, but they are especially prominent in survey data. In this module, you will learn how to create and customize categorical plots such as box plots, bar plots, count plots, and point plots. Along the way, you will explore survey data from young people about their interests, students about their study habits, and adult men about their feelings about masculinity.


  1. Count plots and bar plots
  2. Count plots
  3. Bar plots with percentages
  4. Customizing bar plots
  5. Box plots
  6. Create and interpret a box plot
  7. Omitting outliers
  8. Adjusting the whiskers
  9. Point plots
  10. Customizing point plots
  11. Point plots with subgroups

In this final module, you will learn how to add informative plot titles and axis labels, which are one of the most important parts of any data visualization! You will also learn how to customize the style of your visualizations in order to more quickly orient your audience to the key takeaways. Then, you will put everything you have learned together for the final exercises of the course!


  1. Changing plot style and color
  2. Changing style and palette
  3. Changing the scale
  4. Using a custom palette
  5. Adding titles and labels: Part 1
  6. FacetGrids vs. AxesSubplots
  7. Adding a title to a FacetGrid object
  8. Adding titles and labels: Part 2
  9. Adding a title and axis labels
  10. Rotating x-tick labels
  11. Putting it all together
  12. Box plot with subgroups
  13. Bar plot with subgroups and subplots
  14. Wrap up!


Exploratory Data Analysis in Python
So you’ve got some interesting data - where do you begin your analysis? This course will cover the process of exploring and analyzing data, from understanding what’s included in a dataset to incorporating exploration findings into a data science workflow.

Using data on unemployment figures and plane ticket prices, you’ll leverage Python to summarize and validate data, calculate, identify and replace missing values, and clean both numerical and categorical values. Throughout the course, you’ll create beautiful Seaborn visualizations to understand variables and their relationships.

For example, you’ll examine how alcohol use and student performance are related. Finally, the course will show how exploratory findings feed into data science workflows by creating new features, balancing categorical features, and generating hypotheses from findings.

By the end of this course, you’ll have the confidence to perform your own exploratory data analysis (EDA) in Python.You’ll be able to explain your findings visually to others and suggest the next steps for gathering insights from your data!

4 Modules | 5+ Hours | 4 Skills

Course Modules 


What's the best way to approach a new dataset? Learn to validate and summarize categorical and numerical data and create Seaborn visualizations to communicate your findings.


  1. Initial exploration
  2. Functions for initial exploration
  3. Counting categorical values
  4. Global unemployment in 2021
  5. Data validation
  6. Detecting data types
  7. Validating continents
  8. Validating range
  9. Data summarization
  10. Summaries with .groupby() and .agg()
  11. Named aggregations
  12. Visualizing categorical summaries

Exploring and analyzing data often means dealing with missing values, incorrect data types, and outliers. In this module, you’ll learn techniques to handle these issues and streamline your EDA processes!


  1. Addressing missing data
  2. Dealing with missing data
  3. Strategies for remaining missing data
  4. Imputing missing plane prices
  5. Converting and analyzing categorical data
  6. Finding the number of unique values
  7. Flight duration categories
  8. Adding duration categories
  9. Working with numeric data
  10. Flight duration
  11. Adding descriptive statistics
  12. Handling outliers
  13. What to do with outliers
  14. Identifying outliers
  15. Removing outliers

Variables in datasets don't exist in a vacuum; they have relationships with each other. In this module, you'll look at relationships across numerical, categorical, and even DateTime data, exploring the direction and strength of these relationships as well as ways to visualize them.


  1. Patterns over time
  2. Importing DateTime data
  3. Updating data type to DateTime
  4. Visualizing relationships over time
  5. Correlation
  6. Interpreting a heatmap
  7. Visualizing variable relationships
  8. Visualizing multiple variable relationships
  9. Factor relationships and distributions
  10. Categorical data in scatter plots
  11. Exploring with KDE plots

Exploratory data analysis is a crucial step in the data science workflow, but it isn't the end! Now it's time to learn techniques and considerations you can use to successfully move forward with your projects after you've finished exploring!


  1. Considerations for categorical data
  2. Checking for class imbalance
  3. Cross-tabulation
  4. Generating new features
  5. Extracting features for correlation
  6. Calculating salary percentiles
  7. Categorizing salaries
  8. Generating hypotheses
  9. Comparing salaries
  10. Choosing a hypothesis
  11. Recap!


Sampling in Python
Sampling in Python is the cornerstone of inference statistics and hypothesis testing. It's a powerful skill used in survey analysis and experimental design to draw conclusions without surveying an entire population. In this Sampling in Python course, you’ll discover when to use sampling and how to perform common types of sampling—from simple random sampling to more complex methods like stratified and cluster sampling. Using real-world datasets, including coffee ratings, Spotify songs, and employee attrition, you’ll learn to estimate population statistics and quantify uncertainty in your estimates by generating sampling distributions and bootstrap distributions.

4 Modules | 5+ Hours | 4 Skills

Course Modules 


Learn what sampling is and why it is so powerful. You’ll also learn about the problems caused by convenience sampling and the differences between true randomness and pseudo-randomness.


  1. Sampling and point estimates
  2. Reasons for sampling
  3. Simple sampling with pandas
  4. Simple sampling and calculating with NumPy
  5. Convenience sampling
  6. Are findings from the sample generalizable?
  7. Are these findings generalizable?
  8. Pseudo-random number generation
  9. Generating random numbers
  10. Understanding random seeds

It’s time to get hands-on and perform the four random sampling methods in Python: simple, systematic, stratified, and cluster.


  1. Simple random and systematic sampling
  2. Simple random sampling
  3. Systematic sampling
  4. Is systematic sampling OK?
  5. Stratified and weighted random sampling
  6. Which sampling method?
  7. Proportional stratified sampling
  8. Equal counts stratified sampling
  9. Weighted sampling
  10. Cluster sampling
  11. Benefits of clustering
  12. Performing cluster sampling
  13. Comparing sampling methods
  14. 3 kinds of sampling
  15. Comparing point estimates

Let’s test your sampling. In this module, you’ll discover how to quantify the accuracy of sample statistics using relative errors, and measure variation in your estimates by generating sampling distributions.


  1. Relative error of point estimates
  2. Calculating relative errors
  3. Relative error vs. sample size
  4. Creating a sampling distribution
  5. Replicating samples
  6. Replication parameters
  7. Approximate sampling distributions
  8. Exact sampling distribution
  9. Generating an approximate sampling distribution
  10. Exact vs. approximate
  11. Standard errors and the Central Limit Theorem
  12. Population & sampling distribution means
  13. Population & sampling distribution variation

You’ll get to grips with resampling to perform bootstrapping and estimate variation in an unknown population. You’ll learn the difference between sampling distributions and bootstrap distributions using resampling.


  1. Introduction to bootstrapping
  2. Principles of bootstrapping
  3. With or without replacement?
  4. Generating a bootstrap distribution
  5. Comparing sampling and bootstrap distributions
  6. Bootstrap statistics and population statistics
  7. Sampling distribution vs. bootstrap distribution
  8. Compare sampling and bootstrap means
  9. Compare sampling and bootstrap standard deviations
  10. Confidence intervals
  11. Confidence interval interpretation
  12. Calculating confidence intervals
  13. Recap!


Hypothesis Testing in Python
Hypothesis testing lets you answer questions about your datasets in a statistically rigorous way. In this course, you'll grow your Python analytical skills as you learn how and when to use common tests like t-tests, proportion tests, and chi-square tests. Working with real-world data, including Stack Overflow user feedback and supply-chain data for medical supply shipments, you'll gain a deep understanding of how these tests work and the key assumptions that underpin them. You'll also discover how non-parametric tests can be used to go beyond the limitations of traditional hypothesis tests.

4 Modules | 5+ Hours | 4 Skills

Course Modules 


How does hypothesis testing work and what problems can it solve? To find out, you’ll walk through the workflow for a one sample proportion test. In doing so, you'll encounter important concepts like z-scores, p-values, and false negative and false positive errors.


  1. Hypothesis tests and z-scores
  2. Uses of A/B testing
  3. Calculating the sample mean
  4. Calculating a z-score
  5. p-values
  6. Criminal trials and hypothesis tests
  7. Left tail, right tail, two tails
  8. Calculating p-values
  9. Statistical significance
  10. Decisions from p-values
  11. Calculating a confidence interval
  12. Type I and type II errors

In this module, you’ll learn how to test for differences in means between two groups using t-tests and extend this to more than two groups using ANOVA and pairwise t-tests.


  1. Performing t-tests
  2. Hypothesis testing workflow
  3. Two sample mean test statistic
  4. Calculating p-values from t-statistics
  5. Why is t needed?
  6. The t-distribution
  7. From t to p
  8. Paired t-tests
  9. Is pairing needed?
  10. Visualizing the difference
  11. Using ttest()
  12. ANOVA tests
  13. Visualizing many categories
  14. Conducting an ANOVA test
  15. Pairwise t-tests

Now it’s time to test for differences in proportions between two groups using proportion tests. Through hands-on exercises, you’ll extend your proportion tests to more than two groups with chi-square independence tests, and return to the one sample case with chi-square goodness of fit tests.


  1. One-sample proportion tests
  2. t for proportions?
  3. Test for single proportions
  4. Two-sample proportion tests
  5. Test of two proportions
  6. proportions_ztest() for two samples
  7. Chi-square test of independence
  8. The chi-square distribution
  9. How many tails for chi-square tests?
  10. Performing a chi-square test
  11. Chi-square goodness of fit tests
  12. Visualizing goodness of fit
  13. Performing a goodness of fit test

Finally, it’s time to learn about the assumptions made by parametric hypothesis tests, and see how non-parametric tests can be used when those assumptions aren't met.


  1. Assumptions in hypothesis testing
  2. Common assumptions of hypothesis tests
  3. Testing sample size
  4. Non-parametric tests
  5. Which parametric test?
  6. Wilcoxon signed-rank test
  7. Non-parametric ANOVA and unpaired t-tests
  8. Wilcoxon-Mann-Whitney
  9. Kruskal-Wallis
  10. Recap!


Introduction to Data Visualization with Matplotlib 
Visualizing data in plots and figures exposes the underlying patterns in the data and provides insights. Good visualizations also help you communicate your data to others, and are useful to data analysts and other consumers of the data. In this course, you will learn how to use Matplotlib, a powerful Python data visualization library. Matplotlib provides the building blocks to create rich visualizations of many different kinds of datasets. You will learn how to create visualizations for different kinds of data and how to customize, automate, and share these visualizations.

4 Modules | 5+ Hours | 4 Skills

Course Modules 


This module introduces the Matplotlib visualization library and demonstrates how to use it with data.


  1. Introduction to data visualization with Matplotlib
  2. Using the matplotlib.pyplot interface
  3. Adding data to an Axes object
  4. Customizing your plots
  5. Customizing data appearance
  6. Customizing axis labels and adding titles
  7. Small multiples
  8. Creating a grid of subplots
  9. Creating small multiples with plt.subplots
  10. Small multiples with shared y axis

Time series data is data that is recorded. Visualizing this type of data helps clarify trends and illuminates relationships between data.


  1. Plotting time-series data
  2. Read data with a time index
  3. Plot time-series data
  4. Using a time index to zoom in
  5. Plotting time-series with different variables
  6. Plotting two variables
  7. Defining a function that plots time-series data
  8. Using a plotting function
  9. Annotating time-series data
  10. Annotating a plot of time-series data
  11. Plotting time-series: putting it all together

Visualizations can be used to compare data in a quantitative manner. This module explains several methods for quantitative visualizations.


  1. Quantitative comparisons: bar-charts
  2. Bar chart
  3. Stacked bar chart
  4. Quantitative comparisons: histograms
  5. Creating histograms
  6. "Step" histogram
  7. Statistical plotting
  8. Adding error-bars to a bar chart
  9. Adding error-bars to a plot
  10. Creating boxplots
  11. Quantitative comparisons: scatter plots
  12. Simple scatter plot
  13. Encoding time by color

This module shows you how to share your visualizations with others: how to save your figures as files, how to adjust their look and feel, and how to automate their creation based on input data.


  1. Preparing your figures to share with others
  2. Selecting a style for printing
  3. Switching between styles
  4. Saving your visualizations
  5. Saving a file several times
  6. Save a figure with different sizes
  7. Automating figures from data
  8. Unique values of a column
  9. Automate your visualization
  10. Where to go next


Improving Your Data Visualizations in Python
Great data visualization is the cornerstone of impactful data science. Visualization helps you to both find insight in your data and share those insights with your audience. Everyone learns how to make a basic scatter plot or bar chart on their journey to becoming a data scientist, but the true potential of data visualization is realized when you take a step back and think about what, why, and how you are visualizing your data. In this course you will learn how to construct compelling and attractive visualizations that help you communicate the results of your analyses efficiently and effectively. We will cover comparing data, the ins and outs of color, showing uncertainty, and how to build the right visualization for your given audience through the investigation of a datasets on air pollution around the US and farmer's markets. We will finish the course by examining open-access farmers market data to build a polished and impactful visual report.

4 Modules | 5+ Hours | 4 Skills

Course Modules 


How do you show all of your data while making sure that viewers don't miss an important point or points? Here we discuss how to guide your viewer through the data with color-based highlights and text. We also introduce a dataset on common pollutant values across the United States.


  1. Highlighting data
  2. Hardcoding a highlight
  3. Programmatically creating a highlight
  4. Comparing groups
  5. Comparing with two KDEs
  6. Improving your KDEs
  7. Beeswarms
  8. Annotations
  9. A basic text annotation
  10. Arrow annotations
  11. Combining annotations and color

Color is a powerful tool for encoded values in data visualization. However, with this power comes danger. In this module, we talk about how to choose an appropriate color palette for your visualization based upon the type of data it is showing.


  1. Color in visualizations
  2. Getting rid of unnecessary color
  3. Fixing Seaborn's bar charts
  4. Continuous color palettes
  5. Making a custom continuous palette
  6. Customizing a diverging palette heatmap
  7. Adjusting your palette according to context
  8. Categorical palettes
  9. Using a custom categorical palette
  10. Dealing with too many categories
  11. Coloring ordinal categories
  12. Choosing the right variable to encode with color

Uncertainty occurs everywhere in data science, but it's frequently left out of visualizations where it should be included. Here, we review what a confidence interval is and how to visualize them for both single estimates and continuous functions. Additionally, we discuss the bootstrap resampling technique for assessing uncertainty and how to visualize it properly.


  1. Point estimate intervals
  2. Basic confidence intervals
  3. Annotating confidence intervals
  4. Confidence bands
  5. Making a confidence band
  6. Separating a lot of bands
  7. Cleaning up bands for overlaps
  8. Beyond 95%
  9. 90, 95, and 99% intervals
  10. 90 and 95% bands
  11. Using band thickness instead of coloring
  12. Visualizing the bootstrap
  13. The bootstrap histogram
  14. Bootstrapped regressions
  15. Lots of bootstraps with beeswarms

Often visualization is taught in isolation, with best practices only discussed in a general way. In reality, you will need to bend the rules for different scenarios. From messy exploratory visualizations to polishing the font sizes of your final product; in this module, we dive into how to optimize your visualizations at each step of a data science workflow.


  1. First explorations
  2. Looking at the farmers market data
  3. Scatter matrix of numeric columns
  4. Digging in with basic transforms
  5. Exploring the patterns
  6. Is latitude related to months open?
  7. What state is the most market-friendly?
  8. Popularity of goods sold by state
  9. Making your visualizations efficient
  10. Stacking to find trends
  11. Using a plot as a legend
  12. Tweaking your plots
  13. Cleaning up the background
  14. Remixing a plot
  15. Enhancing legibility
  16. Recap!


Visualizing Geospatial Data in Python
One of the most important tasks of a data scientist is to understand the relationships between their data's physical location and their geographical context. In this course you'll be learning to make attractive visualizations of geospatial data with the GeoPandas package. You will learn to spatially join datasets, linking data to context. Finally you will learn to overlay geospatial data to maps to add even more spatial cues to your work. You will use several datasets from the City of Nashville's open data portal to find out where the chickens are in Nashville, which neighborhood has the most public art, and more!

4 Modules | 5+ Hours | 4 Skills

Course Modules 


In this module, you will learn how to create a two-layer map by first plotting regions from a shapefile and then plotting location points as a scatterplot.


  1. Introduction
  2. Plotting a scatterplot from longitude and latitude
  3. Styling a scatterplot
  4. Extracting longitude and latitude
  5. Plotting chicken locations
  6. Geometries and shapefiles
  7. Creating a GeoDataFrame & examining the geometry
  8. Plotting shapefile polygons
  9. Scatterplots over polygons
  10. Geometry
  11. Plotting points over polygons - part 1
  12. Plotting points over polygons - part 2

You'll work with GeoJSON to create polygonal plots, learn about projections and coordinate reference systems, and get practice spatially joining data in this module.


  1. GeoJSON and plotting with geopandas
  2. Working with GeoJSON
  3. Colormaps
  4. Map Nashville neighborhoods
  5. Projections and coordinate reference systems
  6. Changing coordinate reference systems
  7. Construct a GeoDataFrame from a DataFrame
  8. Spatial joins
  9. Spatial join practice
  10. Finding the neighborhood with the most public art
  11. Aggregating points within polygons
  12. Plotting the Urban Residents neighborhood and art

First you will learn to get information about the geometries in your data with three different GeoSeries attributes and methods. Then you will learn to create a street map layer using folium.


  1. GeoSeries attributes and methods I
  2. Find the area of the Urban Residents neighborhood
  3. GeoSeries attributes and methods II
  4. The center of the Urban Residents neighborhood
  5. Prepare to calculate distances
  6. Art distances from neighborhood center
  7. Street maps with folium
  8. Create a folium location from the urban centroid
  9. Create a folium map of downtown Nashville
  10. Folium street map of the downtown neighborhood
  11. Creating markers and popups in folium
  12. Adding markers for the public art
  13. Troubleshooting data issues
  14. A map of downtown art

In this module, you will learn about a special map called a choropleth. Then you will learn and practice building choropleths using two different packages: geopandas and folium.


  1. What is a choropleth?
  2. Finding counts from a spatial join
  3. Council district areas and permit counts
  4. Calculating a normalized metric
  5. Choropleths with geopandas
  6. Geopandas choropleths
  7. Area in km squared, geometry in decimal degrees
  8. Spatially joining and getting counts
  9. Building a polished Geopandas choropleth
  10. Choropleths with folium
  11. Folium choropleth
  12. Folium choropleth with markers and popups
  13. Closing thoughts

DATA ANALYSIS & VISUALIZATION WITH PYTHON COST


United States

$799.99

United Kingdom

£899.99

Career and Certifications


GreaterHeight Academy's Certificate Holders also prepared work at companies like:



Our Advisor is just a CALL away

+1 5169831065                                    +447474275645
Available 24x7 for your queries


Talk to our advisors

Our advisors will get in touch with you in the next 24 hours.


Get Advice


FAQs

Complete Data Analysis & Visualization with Python Course

  • Python, created by Guido van Rossum in 1991, is a high-level, readable programming language known for its simplicity. It's versatile, with applications in web development, data analysis, AI, and more. Python's extensive standard library and rich ecosystem enhance its capabilities. It's cross-platform compatible and supported by a large community. Python's popularity has grown, making it widely used in diverse industries.

  • A Python developer is a software developer or programmer who specializes in using the Python programming language for creating applications, software, or solutions. They have expertise in writing Python code, understanding the language's syntax, libraries, and frameworks. Python developers are skilled in utilizing Python's features to develop web applications, data analysis tools, machine learning models, automation scripts, and other software solutions.
  • They work in various industries, collaborating with teams or independently to design, implement, test, and maintain Python-based projects. Python developers often possess knowledge of related technologies and tools to enhance their development process.

  • Python Developer Masters Program is a structured learning path recommended by leading industry experts and ensures that you transform into a proficient Python Developer. Being a full fledged Python Developer requires you to master multiple technologies and this program aims at providing you an in-depth knowledge of the entire Python programming practices. Individual courses at GreaterHeight Academy focus on specialization in one or two specific skills; however, if you intend to become a master in Python programming then this is your go to path to follow.

  • Yes. But you can also raise a ticket with the dedicated support team at any time. If your query does not get resolved through email, we can also arrange one-on-one sessions with our support team. However, our support is provided for a period of Twelve Weeks from the start date of your course.

There are several reasons why becoming a Python developer can be a rewarding career choice. Here are a few:

  • Versatility and Popularity: Python is a versatile programming language that can be used for various purposes, such as web development, data analysis, machine learning, artificial intelligence, scientific computing, and more. It has gained immense popularity in recent years due to its simplicity, readability, and extensive library ecosystem. Python is widely used in both small-scale and large-scale projects, making it a valuable skill in the job market.
  • Ease of Learning: Python has a clean and intuitive syntax that emphasizes readability, which makes it relatively easy to learn compared to other programming languages. Its simplicity allows beginners to grasp the fundamentals quickly and start building useful applications in a relatively short amount of time. This accessibility makes Python an attractive choice for both novice and experienced programmers.
  • Rich Ecosystem and Libraries: Python offers a vast collection of libraries and frameworks that can accelerate development and simplify complex tasks. For example, Django and Flask are popular web development frameworks that provide robust tools for building scalable and secure web applications. NumPy, Pandas, and Matplotlib are widely used libraries for data analysis and visualization. TensorFlow and PyTorch are prominent libraries for machine learning and deep learning. These libraries, among many others, contribute to Python's efficiency and effectiveness as a development language.
  • Job Opportunities: The demand for Python developers has been steadily growing in recent years. Many industries, including technology, finance, healthcare, and academia, rely on Python for various applications. By becoming a Python developer, you open up a wide range of career opportunities, whether you choose to work for a large corporation, a startup, or even as a freelancer. Additionally, Python's versatility allows you to explore different domains and switch roles if desired.
  • Community and Support: Python has a vibrant and supportive community of developers worldwide. This community actively contributes to the language's development, creates open-source libraries, and provides assistance through forums, online communities, and resources.

  • There are no prerequisites for enrollment to this Masters Program. Whether you are an experienced professional working in the IT industry or an aspirant planning to enter the world of Python programming, this masters program is designed and developed to accommodate various professional backgrounds.

  • Python Developer Masters Program has been curated after thorough research and recommendations from industry experts. It will help you differentiate yourself with multi-platform fluency and have real-world experience with the most important tools and platforms. GreaterHeight Academy will be by your side throughout the learning journey - We’re Ridiculously Committed.

  • The recommended duration to complete this Python Developer Masters Program is about 20 weeks, however, it is up to the individual to complete this program at their own pace.

The roles and responsibilities of a Python developer may vary depending on the specific job requirements and industry. However, here are some common tasks and responsibilities associated with the role:

  1. Developing Applications: Python developers are responsible for designing, coding, testing, and debugging applications using Python programming language. This includes writing clean, efficient, and maintainable code to create robust software solutions.
  2. Web Development: Python is widely used for web development. As a Python developer, you may be involved in building web applications, using frameworks like Django or Flask. This includes developing backend logic, integrating databases, handling data processing, and ensuring the smooth functioning of the web application.
  3. Data Analysis and Visualization: Python offers powerful libraries like NumPy, Pandas, and Matplotlib, which are extensively used for data analysis and visualization. Python developers may be responsible for manipulating and analyzing large datasets, extracting insights, and presenting them visually.
  4. Machine Learning and AI: Python is a popular choice for machine learning and artificial intelligence projects. Python developers may work on implementing machine learning algorithms, training models, and integrating them into applications. This involves using libraries like TensorFlow, PyTorch, or scikit-learn.
  5. Collaborating and Teamwork: Python developers often work as part of a development team. They collaborate with other team members, including designers, frontend developers, project managers, and stakeholders. Effective communication and teamwork skills are crucial to ensure smooth project execution.
  6. Documentation: Python developers are expected to document their code, providing clear explanations and instructions for others who may work on or maintain the codebase in the future. Documentation helps in understanding the code and facilitating collaboration.
  7. Continuous Learning: Technology is constantly evolving, and as a Python developer, you need to stay updated with the latest advancements, libraries, frameworks, and best practices. Continuous learning and self-improvement are essential to excel in this role.

The Python Developer training course is for those who want to fast-track their Python programming career. This Python Developer Masters Program will benefit people working in the following roles:

  1. Freshers
  2. Engineers
  3. IT professionals
  4. Data Scientist
  5. Machine Learning Engineer
  6. AI Engineer
  7. Business analysts
  8. Data analysts

  • Top companies such as Microsoft, Google, Meta, Citibank, Well Fargo, and many more are actively hiring certified Python professionals at various positions.

  • On completing this Python Developer Masters Program, you’ll be eligible for the roles like: Python Developer, Web Developer, Data Analyst, Data Scientist, Software Engineer and many more.

  • There is undoubtedly great demand for data analytics as 96% of organizations seek to hire Data Analysts. The most significant data analyst companies that employ graduates who wish to have a data analyst career are Manthan, SAP, Oracle, Accenture Analytics, Alteryx, Qlik, Mu Sigma Analytics, Fractal Analytics, and Tiger Analytics. Professional Data Analyst training will make you become a magician of any organization, and you will spin insights by playing with big data.

A successful data analyst possesses a combination of technical skills and leadership skills.

  • Technical skills include knowledge of database languages such as SQL, R, or Python; spreadsheet tools such as Microsoft Excel or Google Sheets for statistical analysis; and data visualization software such as Tableau or Qlik. Mathematical and statistical skills are also valuable to help gather, measure, organize, and analyze data while using these common tools.
  • Leadership skills prepare a data analyst to complete decision-making and problem-solving tasks. These abilities allow analysts to think strategically about the information that will help stakeholders make data-driven business decisions and to communicate the value of this information effectively. For example, project managers rely on data analysts to track the most important metrics for their projects, to diagnose problems that may be occurring, and to predict how different courses of action could address a problem.

Career openings are available practically from all industries, from telecommunications to retail, banking, healthcare, and even fitness. Without extensive training and effort, it isn't easy to get data analyst career benefits. So, earning our Data Analyst certification will allow you to keep up-to-date on recent trends in the industry.

  • Yes, we do. We will discuss all possible technical interview questions and answers during the training program so that you can prepare yourself for interview.

  • No. Any abuse of copyright is taken seriously. Thanks for your understanding on this one.

  • Yes, we would be providing you with the certificate of completion of the program once you have successfully submitted all the assessment and it has been verified by our subject matter experts.

  • GreaterHeight is offering you the most updated, relevant, and high-value real-world projects as part of the training program. This way, you can implement the learning that you have acquired in real-world industry setup. All training comes with multiple projects that thoroughly test your skills, learning, and practical knowledge, making you completely industry ready.
  • You will work on highly exciting projects in the domains of high technology, ecommerce, marketing, sales, networking, banking, insurance, etc. After completing the projects successfully, your skills will be equal to 6 months of rigorous industry experience.

All our mentors are highly qualified and experience professionals. All have at least 15-20 yrs. of development experience in various technologies and are trained by GreaterHeight Academy to deliver interactive training to the participants.

Yes, we do. As the technology upgrades, we do update our content and provide your training on latest version of that technology.

  • All online training classes are recorded. You will get the recorded sessions so that you can watch the online classes when you want. Also, you can join other class to do your missing classes.

OUR POPULAR COURSES

Data  Analytics and Visualization With Python

Data Analytics and Visualization With Python

Advanced developments of expertise in cleaning, transforming, and modelling data to obtain insight into  corporate decision making as a Senior Data Analyst - Using Python.

View Details
Data Science Training Masters Program

Data Science Training Masters Program

Learn Python, Statistics, Data Preparation, Data Analysis, Querying Data, Machine Learning, Clustering, Text Processing, Collaborative Filtering, Image Processing, etc..

View Details
Microsoft Azure DP-100 Data Science

Microsoft Azure DP-100 Data Science

You will Optimize & Manage Models,   Perform Administration by using T-SQL, Run Experiment & Train Models, Deploy & Consume Models, and Automate Tasks.

View Details
Machine Learning using Python

Machine Learning using Python

Learn Data Science and Machine Learning from scratch, get hired, and have fun along the way with the most modern, up-to-date Data Science course on.

View Details
Microsoft Azure PL-300 Data Analysis

Microsoft Azure PL-300 Data Analysis

You will learn how to Design a Data Model in Power BI, Optimize Model Performance,   Manage Datasets in Power BI and Create Paginated Reports.

View Details
Microsoft Azure DP-203 Data Engineer

Microsoft Azure DP-203 Data Engineer

You will learn Batch & Real Time Analytics, Azure Synapse Analytics, Azure Databricks,   Implementing Security and ETL & ELT Pipelines.

View Details

The GreaterHeight Advantage

0+

Accredited Courseware

Most of our training courses are accredited by the respective governing bodies.

0+

Assured Classes

All our training courses are assured & scheduled dates are confirmed to run by SME.

0+

Expert Instructor Led Programs

We have well equipped and highly experienced instructors to train the professionals.

OUR CLIENTS

We Have Worked With Some Amazing Companies Around The World

Our awesome clients we've had the pleasure to work with!


Client 01
Client 02
Client 03
Client 04
Client 05
Client 06
Client 07
  • Contact info
  • Facebook
  • WhatsApp
  • (+86)1234567809
  • creative@gmail.com
  • Back to top