Skip to content

Latest commit

 

History

History
43 lines (31 loc) · 2.2 KB

File metadata and controls

43 lines (31 loc) · 2.2 KB

Data Science as a process

Data science is a process of discovery. Machine learning and Deep learning are techniques used in data science to discover patterns within data.

Existing MOOC specializations

Course Environment Length Notes
Data Science Specialization R 10 courses
Microsoft Professional Program Data Science R, Python, Azure 9 courses

Overview

Course Environment Notes
Data Science Process R, Python, Azure

Data science in three stages

Data science is a process of asking questions to your data, and getting from raw data to a statistically valid answer to those questions. It involves getting useable data, exploring the data by visualizing it or calculating summary statistics, and then establishing a hypothesis and running an experiment to test it.

1. Preparing your data

How to clean up data. If you've worked in Business Intelligence before, you can probably skip this section.

Course Environment Notes
Getting and cleaning data R
Introduction to data science in python Python

2. Exploring your data

Much of data science begins with intuition. Often the best intuitions come from looking at the data.

Course Environment Notes
Data Analysis and Visualization Python, SAS
Plotting, Charting and data visualization Python

3. Running an experiment

This is where you learn about the science in data science.

Course Environment Notes
Data Science in Real Life None
Designing Experiments ? Focussed on User-Experience design