-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathNotes
More file actions
37 lines (34 loc) · 1.26 KB
/
Notes
File metadata and controls
37 lines (34 loc) · 1.26 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
Why python for ML?
->Easy, Fast enough,gross plateform,ML libraries
input->data output->rules for proccessing on data
features :- To identify the labels
feature->ML algorithm(processing on data) -> labels
for data - UCI ML repo
In ML input is features and output are labels
Two types of learning
1. Supervised learning:
i.e, Linear regression classifier
1. already tagged data
2. features and labels are present
Two categories:
1. classification(returns already existing labels)
2. regression(returns value of new label)
2. Unsupervised learning
All machine knows is the data in front of it. No features! No labels!
1.Data not already tagged
2. features and labels are not present
3. training not done
4. clustering and association
i.e, Hierarchical clustering
Modules: sklearn, numpy, matplotlib,pandas
sklearn: for datasets
numpy: Array faster than list
matplotlib: to plot graph
pandas: dataframe ,brings data to the RAM for faster access
steps in typical ML problem
1. problem identification
2. Data collection and validation
3. Modal building
4. Feedback
training and test spliting:
Given data usage = (To prepare/train modal) + (To test the modal)