Data Science Week 2 Visualizing data

Visualizing data

This week I focused on getting through Visualizing Data. At the time of this post I’m over halfway through Visualizing Data with 55% left to complete on the the second half of the lessons.

In the interest of time and getting to items that I’m less familiar with, I’ve decided to focus on the mandatory sections and then come back later to complete the optional (OPT) sections.  As I am doing both this and the FEND at the same time I need to make the best use of my time. Once I finish the core concepts I’ll go back and complete anything optional if I have the time.

Back onto the topic of Visualizing Data.  The items that I’ve covered this week are:

    • Bin sizes
    • Frequency
    • Histograms
    • Distribution
    • Skewness (is this really a word?)
    • Normal distribution
    • Tables vs. Histograms

Thankfully about all of the course so far has been review for me from prior statistics courses. I imagine for folks taking this course with no prior statistics experience  it might be challenging learning all the terminology.

If you’re wondering why I don’t provide more detail here about what I’m learning keep in mind so far this is all review for me. This blog is about what I’m learning not about what I already know. If something pops up that I get stuck on or have to noodle about I’ll definitely mention it here. At some point when I’m not so pressed for time I may also go back and flesh out individual topics. As we get further into the course I expect to go into much more detail as I’ll be learning new things once we get to the Python & SQL sections.

Today I plan to finish up the last section on Visualizing Data and next week my focus will be to complete Central Tendency & Variability.

Current Progress

Intro to Descriptive Statistics – Week 1
Intro to Research Methods – Week 1
PS 1A: Intro to Research Methods – Week 1
OPT PS 1B: Additional Practice
Visualizing Data – Week 2
PA 2A: Visualizing Data – Week 2 (45% complete)
OPT PS 2B: Additional Practice
Google Spreadsheet Tutorial – Week 1
Central Tendency
PS 3A: Central Tendency
OPT PS 3B: Additional Practice
Variability
PS 4: Variability
Standardizing
PS 5A: Standardizing
OPT PS 5B: Additional Practice
Normal Distribution
PS 6: Normal Distribution
Sampling Distributions
PS 7: Sampling Distributions
Advanced concepts: Python & SQL
Why Python Programming
Data Types & Operators
Control Flow
Functions
Scripting
Basic SQL
SQL Joins
SQL Aggregations
SQL Subqueries & Temproary Tables
SQL Data Cleaning