From 50b122bc0f6dea68db4905ba044fd382bf2c5d19 Mon Sep 17 00:00:00 2001 From: Purna Chandra Mansingh Date: Tue, 8 Feb 2022 16:17:41 +0530 Subject: [PATCH 01/18] added web scarping files --- Web Scarping/README.md | 1 + Web Scarping/Web Scarping.ipynb | 1 + 2 files changed, 2 insertions(+) create mode 100644 Web Scarping/README.md create mode 100644 Web Scarping/Web Scarping.ipynb diff --git a/Web Scarping/README.md b/Web Scarping/README.md new file mode 100644 index 0000000..164faca --- /dev/null +++ b/Web Scarping/README.md @@ -0,0 +1 @@ +# Web Scarping using Python diff --git a/Web Scarping/Web Scarping.ipynb b/Web Scarping/Web Scarping.ipynb new file mode 100644 index 0000000..370935e --- /dev/null +++ b/Web Scarping/Web Scarping.ipynb @@ -0,0 +1 @@ +print("Hello Python") From 7141444e9f7404cdf2121db67e759f17f84a6253 Mon Sep 17 00:00:00 2001 From: Purna Chandra Mansingh <42216008+purna135@users.noreply.github.com> Date: Tue, 8 Feb 2022 17:00:46 +0530 Subject: [PATCH 02/18] Update README.md --- Web Scarping/README.md | 33 +++++++++++++++++++++++++++++++++ 1 file changed, 33 insertions(+) diff --git a/Web Scarping/README.md b/Web Scarping/README.md index 164faca..22a8e5f 100644 --- a/Web Scarping/README.md +++ b/Web Scarping/README.md @@ -1 +1,34 @@ # Web Scarping using Python +## Course Outline +### Introduction + - Introduction + - Library for scraping: BeautifulSoup, Selenium, Scrapy + - Status Codes + - HTML for Web Scraping: Tags & Elements + - HTML for web scraping: Tree Structure +### Introduction to BeautifulSoup + - Installation & Basics of BeautifulSoup + - Intro to Requests + - Get html form website and html parser + - Scrape Single page + - Scrape multiple page + - Exporting data to CSV file +### XPath & CSS Selector + - XPath Systax, Functions and Operators + - CSS Selector + - Test Your XPath + - Navigation using XPath(going up and down) + - Special Characters and Systax +### Introduction to Selenium + - How to Identify Java Script Drived Websites + - Installation & Chrome Driver + - Chrome Driver Options and Creating the driver + - How to Find Elements with Selenium + - Clicking Button + - Extracting Data from Table + - Selecting Elements within a Dropdown +### Project #1: Twitter Bot + - How to Login to Websites + - Building a Function + - Infinite Scrolling + - Scraping the website with infinite Scrolling From 311b7fa19d1a5e4c34a13e277e0bc8e54e07e268 Mon Sep 17 00:00:00 2001 From: Purna Chandra Mansingh <42216008+purna135@users.noreply.github.com> Date: Tue, 8 Feb 2022 17:19:41 +0530 Subject: [PATCH 03/18] Update README.md --- Web Scarping/README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/Web Scarping/README.md b/Web Scarping/README.md index 22a8e5f..b3906a7 100644 --- a/Web Scarping/README.md +++ b/Web Scarping/README.md @@ -1,4 +1,4 @@ -# Web Scarping using Python +# Web Scraping using Python ## Course Outline ### Introduction - Introduction From 9e7f883526f640a3d260f067f87b9e2e4d82ac6c Mon Sep 17 00:00:00 2001 From: Purna Chandra Mansingh <42216008+purna135@users.noreply.github.com> Date: Tue, 8 Feb 2022 17:38:43 +0530 Subject: [PATCH 04/18] Update README.md --- Web Scarping/README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/Web Scarping/README.md b/Web Scarping/README.md index b3906a7..3851ae5 100644 --- a/Web Scarping/README.md +++ b/Web Scarping/README.md @@ -3,7 +3,7 @@ ### Introduction - Introduction - Library for scraping: BeautifulSoup, Selenium, Scrapy - - Status Codes + - Types of HTTP Status Codes - HTML for Web Scraping: Tags & Elements - HTML for web scraping: Tree Structure ### Introduction to BeautifulSoup From ecd1ad450890010317bc4d4cf9c717fed89ad0bb Mon Sep 17 00:00:00 2001 From: Purna Chandra Mansingh <42216008+purna135@users.noreply.github.com> Date: Wed, 9 Feb 2022 11:29:52 +0530 Subject: [PATCH 05/18] Update README.md --- Web Scarping/README.md | 55 +++++++++++++++++++++--------------------- 1 file changed, 28 insertions(+), 27 deletions(-) diff --git a/Web Scarping/README.md b/Web Scarping/README.md index 3851ae5..83398d3 100644 --- a/Web Scarping/README.md +++ b/Web Scarping/README.md @@ -1,34 +1,35 @@ # Web Scraping using Python ## Course Outline ### Introduction - - Introduction - - Library for scraping: BeautifulSoup, Selenium, Scrapy - - Types of HTTP Status Codes - - HTML for Web Scraping: Tags & Elements - - HTML for web scraping: Tree Structure +- Introduction + - What is Web Scraping + - Why Web Scraping + - How to Scrap(Library for scraping: BeautifulSoup, Selenium, Scrapy) +- Types of HTTP Status Codes +- HTML for Web Scraping: Tags & Elements +- HTML for web scraping: Tree Structure ### Introduction to BeautifulSoup - - Installation & Basics of BeautifulSoup - - Intro to Requests - - Get html form website and html parser - - Scrape Single page - - Scrape multiple page - - Exporting data to CSV file +- Installation & Basics of BeautifulSoup +- Intro to Requests +- Get html form website and html parser +- Scrape Single page +- Scrape multiple Links +- Exporting data to CSV file ### XPath & CSS Selector - - XPath Systax, Functions and Operators - - CSS Selector - - Test Your XPath - - Navigation using XPath(going up and down) - - Special Characters and Systax +- XPath Systax, Functions and Operators +- CSS Selector +- Navigation using XPath(going up and down) +- Special Characters and Systax ### Introduction to Selenium - - How to Identify Java Script Drived Websites - - Installation & Chrome Driver - - Chrome Driver Options and Creating the driver - - How to Find Elements with Selenium - - Clicking Button - - Extracting Data from Table - - Selecting Elements within a Dropdown +- How to Identify Java Script Drived Websites +- Installation & Chrome Driver +- Chrome Driver Options and Creating the driver +- How to Find Elements with Selenium +- Clicking Button +- Extracting Data from Table +- Selecting Elements within a Dropdown ### Project #1: Twitter Bot - - How to Login to Websites - - Building a Function - - Infinite Scrolling - - Scraping the website with infinite Scrolling +- How to Login to Websites +- Building a Function +- Infinite Scrolling +- Scraping the website with infinite Scrolling From fd3e12363a088f901cc02dbba2862e147fe7d965 Mon Sep 17 00:00:00 2001 From: Purna Chandra Mansingh <42216008+purna135@users.noreply.github.com> Date: Wed, 9 Feb 2022 11:50:03 +0530 Subject: [PATCH 06/18] Update README.md --- Web Scarping/README.md | 24 +++++++++++++++++------- 1 file changed, 17 insertions(+), 7 deletions(-) diff --git a/Web Scarping/README.md b/Web Scarping/README.md index 83398d3..f7bbda6 100644 --- a/Web Scarping/README.md +++ b/Web Scarping/README.md @@ -1,15 +1,18 @@ # Web Scraping using Python ## Course Outline -### Introduction +### Getting started with web scraping - Introduction - What is Web Scraping - - Why Web Scraping + - Why we need Web Scraping - How to Scrap(Library for scraping: BeautifulSoup, Selenium, Scrapy) - Types of HTTP Status Codes - HTML for Web Scraping: Tags & Elements - HTML for web scraping: Tree Structure -### Introduction to BeautifulSoup -- Installation & Basics of BeautifulSoup +### BeautifulSoup +- Introduction + - Why we should use BeautifulSoup + - Installation + - Basics of BeautifulSoup - Intro to Requests - Get html form website and html parser - Scrape Single page @@ -20,9 +23,11 @@ - CSS Selector - Navigation using XPath(going up and down) - Special Characters and Systax -### Introduction to Selenium -- How to Identify Java Script Drived Websites -- Installation & Chrome Driver +### Selenium +- Introduction + - What is Selenium and When we should use it + - How to Identify Java Script Drived Websites + - Installation & Chrome Driver - Chrome Driver Options and Creating the driver - How to Find Elements with Selenium - Clicking Button @@ -33,3 +38,8 @@ - Building a Function - Infinite Scrolling - Scraping the website with infinite Scrolling + +### Wrap up +- What did we learn in this module? +- Other resources you can follow +- What next you can do From c14d610b9ae844bbb2bc2edc31454efdac4d2867 Mon Sep 17 00:00:00 2001 From: Purna Chandra Mansingh <42216008+purna135@users.noreply.github.com> Date: Fri, 11 Feb 2022 18:00:43 +0530 Subject: [PATCH 07/18] Update README.md --- Web Scarping/README.md | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/Web Scarping/README.md b/Web Scarping/README.md index f7bbda6..e81afc8 100644 --- a/Web Scarping/README.md +++ b/Web Scarping/README.md @@ -1,9 +1,10 @@ # Web Scraping using Python ## Course Outline ### Getting started with web scraping -- Introduction +- Course Overview +- Exploring Web Scraping with Python - What is Web Scraping - - Why we need Web Scraping + - Why do we need Web Scraping - How to Scrap(Library for scraping: BeautifulSoup, Selenium, Scrapy) - Types of HTTP Status Codes - HTML for Web Scraping: Tags & Elements From a27c2a09cd1a40a708ac75e78168504be895dd62 Mon Sep 17 00:00:00 2001 From: Purna Chandra Mansingh <42216008+purna135@users.noreply.github.com> Date: Thu, 17 Feb 2022 16:38:59 +0530 Subject: [PATCH 08/18] Update README.md --- Web Scarping/README.md | 82 ++++++++++++++++++++---------------------- 1 file changed, 39 insertions(+), 43 deletions(-) diff --git a/Web Scarping/README.md b/Web Scarping/README.md index e81afc8..f554cc6 100644 --- a/Web Scarping/README.md +++ b/Web Scarping/README.md @@ -1,46 +1,42 @@ # Web Scraping using Python ## Course Outline -### Getting started with web scraping -- Course Overview -- Exploring Web Scraping with Python - - What is Web Scraping - - Why do we need Web Scraping - - How to Scrap(Library for scraping: BeautifulSoup, Selenium, Scrapy) -- Types of HTTP Status Codes -- HTML for Web Scraping: Tags & Elements -- HTML for web scraping: Tree Structure -### BeautifulSoup -- Introduction - - Why we should use BeautifulSoup - - Installation - - Basics of BeautifulSoup -- Intro to Requests -- Get html form website and html parser -- Scrape Single page -- Scrape multiple Links -- Exporting data to CSV file -### XPath & CSS Selector -- XPath Systax, Functions and Operators -- CSS Selector -- Navigation using XPath(going up and down) -- Special Characters and Systax -### Selenium -- Introduction - - What is Selenium and When we should use it - - How to Identify Java Script Drived Websites - - Installation & Chrome Driver -- Chrome Driver Options and Creating the driver -- How to Find Elements with Selenium -- Clicking Button -- Extracting Data from Table -- Selecting Elements within a Dropdown -### Project #1: Twitter Bot -- How to Login to Websites -- Building a Function -- Infinite Scrolling -- Scraping the website with infinite Scrolling -### Wrap up -- What did we learn in this module? -- Other resources you can follow -- What next you can do +### 1. Course Overview +- [ ] 1. Course Overview +### 2. Getting Started with BeautifulSoup +- [ ] 1. Module Overview and Prerequisites +- [ ] 2. HTTP Requests and Responses +- [ ] 3. GET and POST Requests Using the Requests Library +- [ ] 4. Handling Redirects with the Requests Library +- [ ] 5. Module Summary +### 3. Getting Started with BeautifulSoup +- [ ] 1. Module Overview +- [ ] 2. The HTML Parse Tree +- [ ] 3. Introducing Beautiful Soup +- [ ] 4. Extracting Specific Page Elements +- [ ] 5. Filtering Elements Using Find and Find All +- [ ] 6. Searching and Filtering Using Custom Functions +- [ ] 7. Extracting Links from a Page +- [ ] 8. XML and XPath +- [ ] 9. Performing Advanced Search on the Parse Tree +- [ ] 10. CSS Selectors Using Soup Sieve +- [ ] 11. Using XPath to Navigate an XML Tree +- [ ] 12. Module Summary +### 4. Exploring Selenium with Python +- [ ] 1. Module Overview +- [ ] 2. How to Identify Java Script Driven Websites +- [ ] 3. Installing Selenium and ChromeDriver +- [ ] 4. Importing Libraries and Creating The Driver +- [ ] 5. How to Find Elements with Selenium +- [ ] 6. Clicking on a Button +- [ ] 7. Extracting Data from a Table +- [ ] 8. Selecting Elements within a Dropdown +- [ ] 9. Module Summary +### 5. Project: Twitter Bot +- [ ] 1. Module Overview +- [ ] 2. How to Login to Websites +- [ ] 3. Building a Function +- [ ] 4. Infinite Scrolling +- [ ] 5. Scraping the Website with Infinite Scrolling +- [ ] 6. Module Summary +### 6. Web Scraping Wrap Up From b0aae0ae18dc9c39e1ab3683c4b4ad13131ad8c6 Mon Sep 17 00:00:00 2001 From: Purna Chandra Mansingh <42216008+purna135@users.noreply.github.com> Date: Thu, 17 Feb 2022 17:56:44 +0530 Subject: [PATCH 09/18] Update README.md --- Web Scarping/README.md | 5 ++++- 1 file changed, 4 insertions(+), 1 deletion(-) diff --git a/Web Scarping/README.md b/Web Scarping/README.md index f554cc6..348348b 100644 --- a/Web Scarping/README.md +++ b/Web Scarping/README.md @@ -2,8 +2,11 @@ ## Course Outline ### 1. Course Overview -- [ ] 1. Course Overview +- [x] 1. Course Overview ### 2. Getting Started with BeautifulSoup +- [x] 1. What is Web Scraping +- [x] 2. Real Life Example of Web Scraping +- [x] 3. Different Libraries for Web Scraping - [ ] 1. Module Overview and Prerequisites - [ ] 2. HTTP Requests and Responses - [ ] 3. GET and POST Requests Using the Requests Library From 53b87ce0997d57c16b7bb00c6bf31fd735bac1e8 Mon Sep 17 00:00:00 2001 From: Purna Chandra Mansingh <42216008+purna135@users.noreply.github.com> Date: Thu, 17 Feb 2022 18:07:41 +0530 Subject: [PATCH 10/18] Update README.md --- Web Scarping/README.md | 19 ++++++++++--------- 1 file changed, 10 insertions(+), 9 deletions(-) diff --git a/Web Scarping/README.md b/Web Scarping/README.md index 348348b..08f6ee8 100644 --- a/Web Scarping/README.md +++ b/Web Scarping/README.md @@ -2,16 +2,17 @@ ## Course Outline ### 1. Course Overview -- [x] 1. Course Overview +- [ ] 1. Course Overview ### 2. Getting Started with BeautifulSoup -- [x] 1. What is Web Scraping -- [x] 2. Real Life Example of Web Scraping -- [x] 3. Different Libraries for Web Scraping -- [ ] 1. Module Overview and Prerequisites -- [ ] 2. HTTP Requests and Responses -- [ ] 3. GET and POST Requests Using the Requests Library -- [ ] 4. Handling Redirects with the Requests Library -- [ ] 5. Module Summary +- [ ] 5. Module Overview and Prerequisites +- [ ] 1. What is Web Scraping +- [ ] 2. Real Life Example of Web Scraping +- [ ] 3. Different Libraries for Web Scraping +- [ ] 4. Basics of HTML +- [ ] 5. HTTP Requests and Responses +- [ ] 6. GET and POST Requests Using the Requests Library +- [ ] 7. Handling Redirects with the Requests Library +- [ ] 8. Module Summary ### 3. Getting Started with BeautifulSoup - [ ] 1. Module Overview - [ ] 2. The HTML Parse Tree From 532aac1bd9c72353688d34414bad0ffb45559985 Mon Sep 17 00:00:00 2001 From: Purna Chandra Mansingh <42216008+purna135@users.noreply.github.com> Date: Thu, 17 Feb 2022 18:11:05 +0530 Subject: [PATCH 11/18] Update README.md --- Web Scarping/README.md | 39 +++++++++++++++++++-------------------- 1 file changed, 19 insertions(+), 20 deletions(-) diff --git a/Web Scarping/README.md b/Web Scarping/README.md index 08f6ee8..4e10b7c 100644 --- a/Web Scarping/README.md +++ b/Web Scarping/README.md @@ -4,28 +4,27 @@ ### 1. Course Overview - [ ] 1. Course Overview ### 2. Getting Started with BeautifulSoup -- [ ] 5. Module Overview and Prerequisites -- [ ] 1. What is Web Scraping -- [ ] 2. Real Life Example of Web Scraping -- [ ] 3. Different Libraries for Web Scraping -- [ ] 4. Basics of HTML -- [ ] 5. HTTP Requests and Responses -- [ ] 6. GET and POST Requests Using the Requests Library -- [ ] 7. Handling Redirects with the Requests Library -- [ ] 8. Module Summary +- [ ] 1. Module Overview and Prerequisites +- [ ] 2. What is Web Scraping +- [ ] 3. Real Life Example of Web Scraping +- [ ] 4. Different Libraries for Web Scraping +- [ ] 5. Basics of HTML and HTML Parse Tree +- [ ] 6. HTTP Requests and Responses +- [ ] 7. GET and POST Requests Using the Requests Library +- [ ] 8. Handling Redirects with the Requests Library +- [ ] 9. Module Summary ### 3. Getting Started with BeautifulSoup - [ ] 1. Module Overview -- [ ] 2. The HTML Parse Tree -- [ ] 3. Introducing Beautiful Soup -- [ ] 4. Extracting Specific Page Elements -- [ ] 5. Filtering Elements Using Find and Find All -- [ ] 6. Searching and Filtering Using Custom Functions -- [ ] 7. Extracting Links from a Page -- [ ] 8. XML and XPath -- [ ] 9. Performing Advanced Search on the Parse Tree -- [ ] 10. CSS Selectors Using Soup Sieve -- [ ] 11. Using XPath to Navigate an XML Tree -- [ ] 12. Module Summary +- [ ] 2. Introducing Beautiful Soup +- [ ] 3. Extracting Specific Page Elements +- [ ] 4. Filtering Elements Using Find and Find All +- [ ] 5. Searching and Filtering Using Custom Functions +- [ ] 6. Extracting Links from a Page +- [ ] 7. XML and XPath +- [ ] 8. Performing Advanced Search on the Parse Tree +- [ ] 0. CSS Selectors Using Soup Sieve +- [ ] 10. Using XPath to Navigate an XML Tree +- [ ] 11. Module Summary ### 4. Exploring Selenium with Python - [ ] 1. Module Overview - [ ] 2. How to Identify Java Script Driven Websites From 6bbc436b4e0810b3be3155d0a8f47624ac62a130 Mon Sep 17 00:00:00 2001 From: Purna Chandra Mansingh <42216008+purna135@users.noreply.github.com> Date: Thu, 17 Feb 2022 18:12:57 +0530 Subject: [PATCH 12/18] Update README.md --- Web Scarping/README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/Web Scarping/README.md b/Web Scarping/README.md index 4e10b7c..e5e1aea 100644 --- a/Web Scarping/README.md +++ b/Web Scarping/README.md @@ -3,7 +3,7 @@ ### 1. Course Overview - [ ] 1. Course Overview -### 2. Getting Started with BeautifulSoup +### 2. Getting Started with Web Scraping - [ ] 1. Module Overview and Prerequisites - [ ] 2. What is Web Scraping - [ ] 3. Real Life Example of Web Scraping From 7263fe46d140523ead7993ca482c3c3f5e622a09 Mon Sep 17 00:00:00 2001 From: Purna Chandra Mansingh <42216008+purna135@users.noreply.github.com> Date: Fri, 18 Feb 2022 15:23:15 +0530 Subject: [PATCH 13/18] Update README.md --- Web Scarping/README.md | 15 +++++++-------- 1 file changed, 7 insertions(+), 8 deletions(-) diff --git a/Web Scarping/README.md b/Web Scarping/README.md index e5e1aea..e9bf48a 100644 --- a/Web Scarping/README.md +++ b/Web Scarping/README.md @@ -5,14 +5,13 @@ - [ ] 1. Course Overview ### 2. Getting Started with Web Scraping - [ ] 1. Module Overview and Prerequisites -- [ ] 2. What is Web Scraping -- [ ] 3. Real Life Example of Web Scraping -- [ ] 4. Different Libraries for Web Scraping -- [ ] 5. Basics of HTML and HTML Parse Tree -- [ ] 6. HTTP Requests and Responses -- [ ] 7. GET and POST Requests Using the Requests Library -- [ ] 8. Handling Redirects with the Requests Library -- [ ] 9. Module Summary +- [ ] 2. What is Web Scraping & Real Life Example of Web Scraping +- [ ] 3. Different Libraries for Web Scraping +- [ ] 4. Basics of HTML and HTML Parse Tree +- [ ] 5. HTTP Requests and Responses +- [ ] 6. GET and POST Requests Using the Requests Library +- [ ] 7. Handling Redirects with the Requests Library +- [ ] 8. Module Summary ### 3. Getting Started with BeautifulSoup - [ ] 1. Module Overview - [ ] 2. Introducing Beautiful Soup From 519b575e9b384986ca69ebbd07fd5e24c788cd56 Mon Sep 17 00:00:00 2001 From: Purna Chandra Mansingh <42216008+purna135@users.noreply.github.com> Date: Fri, 18 Feb 2022 15:42:02 +0530 Subject: [PATCH 14/18] Update README.md --- Web Scarping/README.md | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/Web Scarping/README.md b/Web Scarping/README.md index e9bf48a..ca09a0e 100644 --- a/Web Scarping/README.md +++ b/Web Scarping/README.md @@ -5,10 +5,10 @@ - [ ] 1. Course Overview ### 2. Getting Started with Web Scraping - [ ] 1. Module Overview and Prerequisites -- [ ] 2. What is Web Scraping & Real Life Example of Web Scraping -- [ ] 3. Different Libraries for Web Scraping -- [ ] 4. Basics of HTML and HTML Parse Tree -- [ ] 5. HTTP Requests and Responses +- [x] 2. What is Web Scraping & Real Life Example of Web Scraping +- [x] 3. Different Libraries for Web Scraping +- [x] 4. Basics of HTML and HTML Parse Tree +- [x] 5. HTTP Requests and Responses - [ ] 6. GET and POST Requests Using the Requests Library - [ ] 7. Handling Redirects with the Requests Library - [ ] 8. Module Summary From 8ac1d3af2b0bb471c98729ec682b9b30b8820bfc Mon Sep 17 00:00:00 2001 From: Purna Chandra Mansingh <42216008+purna135@users.noreply.github.com> Date: Fri, 18 Feb 2022 20:06:47 +0530 Subject: [PATCH 15/18] Update README.md --- Web Scarping/README.md | 22 +++++++++++----------- 1 file changed, 11 insertions(+), 11 deletions(-) diff --git a/Web Scarping/README.md b/Web Scarping/README.md index ca09a0e..851e74d 100644 --- a/Web Scarping/README.md +++ b/Web Scarping/README.md @@ -9,27 +9,27 @@ - [x] 3. Different Libraries for Web Scraping - [x] 4. Basics of HTML and HTML Parse Tree - [x] 5. HTTP Requests and Responses -- [ ] 6. GET and POST Requests Using the Requests Library -- [ ] 7. Handling Redirects with the Requests Library +- [ ] 6. GET and POST Requests +- [ ] 7. Handling Redirects - [ ] 8. Module Summary ### 3. Getting Started with BeautifulSoup - [ ] 1. Module Overview - [ ] 2. Introducing Beautiful Soup -- [ ] 3. Extracting Specific Page Elements -- [ ] 4. Filtering Elements Using Find and Find All -- [ ] 5. Searching and Filtering Using Custom Functions +- [ ] 3. Extracting Page Elements +- [ ] 4. Finding Elements Using Find and Find All +- [ ] 5. Filtering Using Custom Functions - [ ] 6. Extracting Links from a Page -- [ ] 7. XML and XPath +- [ ] 7. Intro to XML and XPath - [ ] 8. Performing Advanced Search on the Parse Tree -- [ ] 0. CSS Selectors Using Soup Sieve -- [ ] 10. Using XPath to Navigate an XML Tree +- [ ] 0. CSS Selectors +- [ ] 10. XPath to Navigate an XML Tree - [ ] 11. Module Summary ### 4. Exploring Selenium with Python - [ ] 1. Module Overview -- [ ] 2. How to Identify Java Script Driven Websites +- [ ] 2. How to Identify Dynamic Websites - [ ] 3. Installing Selenium and ChromeDriver -- [ ] 4. Importing Libraries and Creating The Driver -- [ ] 5. How to Find Elements with Selenium +- [ ] 4. Creating The Driver +- [ ] 5. Find Elements with Selenium - [ ] 6. Clicking on a Button - [ ] 7. Extracting Data from a Table - [ ] 8. Selecting Elements within a Dropdown From 52f5c43206569a7c87c163067482d071d1697fd2 Mon Sep 17 00:00:00 2001 From: Purna Chandra Mansingh <42216008+purna135@users.noreply.github.com> Date: Fri, 18 Feb 2022 20:08:19 +0530 Subject: [PATCH 16/18] Update README.md --- Web Scarping/README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/Web Scarping/README.md b/Web Scarping/README.md index 851e74d..50a8e50 100644 --- a/Web Scarping/README.md +++ b/Web Scarping/README.md @@ -9,8 +9,8 @@ - [x] 3. Different Libraries for Web Scraping - [x] 4. Basics of HTML and HTML Parse Tree - [x] 5. HTTP Requests and Responses -- [ ] 6. GET and POST Requests -- [ ] 7. Handling Redirects +- [x] 6. GET and POST Requests +- [x] 7. Handling Redirects - [ ] 8. Module Summary ### 3. Getting Started with BeautifulSoup - [ ] 1. Module Overview From 71490674347aa38562a350b6d1187653a07a5d7a Mon Sep 17 00:00:00 2001 From: Purna Chandra Mansingh <42216008+purna135@users.noreply.github.com> Date: Fri, 18 Feb 2022 21:16:51 +0530 Subject: [PATCH 17/18] Update README.md --- Web Scarping/README.md | 19 ++++++++++--------- 1 file changed, 10 insertions(+), 9 deletions(-) diff --git a/Web Scarping/README.md b/Web Scarping/README.md index 50a8e50..fbcfd98 100644 --- a/Web Scarping/README.md +++ b/Web Scarping/README.md @@ -15,15 +15,16 @@ ### 3. Getting Started with BeautifulSoup - [ ] 1. Module Overview - [ ] 2. Introducing Beautiful Soup -- [ ] 3. Extracting Page Elements -- [ ] 4. Finding Elements Using Find and Find All -- [ ] 5. Filtering Using Custom Functions -- [ ] 6. Extracting Links from a Page -- [ ] 7. Intro to XML and XPath -- [ ] 8. Performing Advanced Search on the Parse Tree -- [ ] 0. CSS Selectors -- [ ] 10. XPath to Navigate an XML Tree -- [ ] 11. Module Summary +- [ ] 3. Code: How to use Beautiful Soup +- [ ] 4. Extracting Page Elements +- [ ] 5. Finding Elements Using Find and Find All +- [ ] 6. Filtering Using Custom Functions +- [ ] 7. Extracting Links from a Page +- [ ] 8. Intro to XML and XPath +- [ ] 9. Performing Advanced Search on the Parse Tree +- [ ] 10. CSS Selectors +- [ ] 11. XPath to Navigate an XML Tree +- [ ] 12. Module Summary ### 4. Exploring Selenium with Python - [ ] 1. Module Overview - [ ] 2. How to Identify Dynamic Websites From ecc20805e41daa5cb74db0d0d983e0e91cf47896 Mon Sep 17 00:00:00 2001 From: Purna Chandra Mansingh <42216008+purna135@users.noreply.github.com> Date: Fri, 18 Feb 2022 21:38:06 +0530 Subject: [PATCH 18/18] Update README.md --- Web Scarping/README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/Web Scarping/README.md b/Web Scarping/README.md index fbcfd98..834f8bc 100644 --- a/Web Scarping/README.md +++ b/Web Scarping/README.md @@ -20,6 +20,7 @@ - [ ] 5. Finding Elements Using Find and Find All - [ ] 6. Filtering Using Custom Functions - [ ] 7. Extracting Links from a Page +- [ ] 8. Soup Strainer - [ ] 8. Intro to XML and XPath - [ ] 9. Performing Advanced Search on the Parse Tree - [ ] 10. CSS Selectors