Data-wrangling

This project is used for me to dive deeply into data wrangling. This is a coure from udacity's data analysis nanodegree program

Introduction

Standard data wrangling process can be divied into 4 parts.

Gathering data, in this step, I should try to get enough data, using web scraper
Accessing data, in this step, I find what problems this dataset has
cleaning and tiding data, in this step, I try every menthod to make the data ready for further analysing.
saving data, in this step, I save the file as a csv file, or in a database.

Tools I used

I used requests, API, pandas, numpy, matplotlib to do the analysis, and used jupyter notebook to run all the code.

Process

I followed the project instraction step by step to gather, assess, clean, and save.

License

This project is Hao Xu's Udacity Nanodegree project. The datasets and instractions are all from udacity.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.ipynb_checkpoints		.ipynb_checkpoints
bestofrt_posters		bestofrt_posters
ebert_reviews		ebert_reviews
project4		project4
rt_html		rt_html
.DS_Store		.DS_Store
.gitattributes		.gitattributes
Assessing Data.ipynb		Assessing Data.ipynb
Cleaning Data.ipynb		Cleaning Data.ipynb
Data wrangling introduction.ipynb		Data wrangling introduction.ipynb
Gathering.ipynb		Gathering.ipynb
README.md		README.md
adverse_reactions.csv		adverse_reactions.csv
armenian-online-job-postings.zip		armenian-online-job-postings.zip
bestofrt.db		bestofrt.db
bestofrt.tsv		bestofrt.tsv
bestofrt_audience.csv		bestofrt_audience.csv
bestofrt_critical.csv		bestofrt_critical.csv
bestofrt_master.csv		bestofrt_master.csv
data.txt		data.txt
df.csv		df.csv
df_solution (1).pkl		df_solution (1).pkl
df_solution.pkl		df_solution.pkl
example-job-posting.jpg		example-job-posting.jpg
features.txt		features.txt
online-job-postings.csv		online-job-postings.csv
patients.csv		patients.csv
treatments.csv		treatments.csv
treatments_cut.csv		treatments_cut.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data-wrangling

Introduction

Tools I used

Process

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Data-wrangling

Introduction

Tools I used

Process

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages