Natural Language Process NLP Notes

Key steps Follow

1) Import Library

2) Import data

3) Clean and compress (Data Preprocessing)

    1) Sentence Tokenization (Paragraphs to Sentence) (. ! ? ;)
    2) Word Tokenization (Sentance to word) (space, _ :)
    3) Punctual and Special character removal and Making text lowercase
    4) Stop word removal (is, a, an, the, them, couldn't, ....)
    5) Lemmatization and Stemming (Extract only the root words from data)

4) Exploratory Data Analysis (EDA)

    1) Generate a Word Cloud by plotting the data.

5) Encoding data (Text data to Numerical data)

TF-IDF (Term Frequency-Inverse Document Frequency)
Score of words in a particular row = (Number of times words in row / Total number of words in row) * log (Number of rows / Number of rows containing the word in them)

6) Apply Machine Learning

1) Split the data

     Features (X-axis) (2D Matrix)
     Targets (Y - axis) (1D Array)
     Train, Test, Split, Random state

2) Scaling the data

       1) Import model
       2) Initialize
       3) Fit (Learning process)
       4) Transform

3) Apply Machine learning algorithm

       1) Import model
       2) Initialize
       3) Fit (learning process)
       4) Predict

4) Evaluation matric (Check whether the model is correct or not)

       1) Regression - The evaluation metric for regression is R^2 between minus infinite to 1 
        A higher the R^2 is a better model
        
       2) Classification - The evaluation metric for classification is
           1) Accuracy score [ Higher accuracy is a better model (The value should be near 1) ]
           2) F1 score [ F1 score between 0 (low) to 1 (high), a Higher F1 score is better for the model ]

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Natural Language Process (NLP) Notes.ipynb		Natural Language Process (NLP) Notes.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Natural Language Process NLP Notes

Key steps Follow

1) Import Library

2) Import data

3) Clean and compress (Data Preprocessing)

4) Exploratory Data Analysis (EDA)

5) Encoding data (Text data to Numerical data)

6) Apply Machine Learning

1) Split the data

2) Scaling the data

3) Apply Machine learning algorithm

4) Evaluation matric (Check whether the model is correct or not)

7) Sentiment analysis

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Natural Language Process NLP Notes

Key steps Follow

1) Import Library

2) Import data

3) Clean and compress (Data Preprocessing)

4) Exploratory Data Analysis (EDA)

5) Encoding data (Text data to Numerical data)

6) Apply Machine Learning

1) Split the data

2) Scaling the data

3) Apply Machine learning algorithm

4) Evaluation matric (Check whether the model is correct or not)

7) Sentiment analysis

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages