A2. Scraping Your First Web Page with Python (Janani Ravi, 2019)
1. Course Overview:
1. Course Overview 00:00:00
2. Getting Started with Web Scraping:
01. Version Check 00:01:45
02. Module Overview 00:02:02
03. Prerequisites and Course Outline 00:03:10
04. HTTP Requests and Responses 00:04:31
05. Web Scraping 00:10:17
06. HTTP Client Libraries 00:12:41
07. Making GET Requests Using httplib2 00:17:02
08. Making OPTIONS POST PUT Requests with httplib2 00:24:20
09. Handling Redirects with httplib2 00:28:28
10. Making HTTP Requests and Parsing URLs Using urllib 00:32:02
11. GET and POST Requests Using the Requests Library 00:39:31
12. Handling Redirects with the Requests Library 00:44:07
13. Module Summary 00:47:24
3. Working with the Parse Tree in BeautifulSoup:
01. Module Overview 00:48:41
02. The HTML Parse Tree 00:49:57
03. Beautiful Soup for HTML Parsing 00:53:35
04. Introducing Beautiful Soup 00:55:38
05. Extracting Specific Page Elements 01:01:00
06. Filtering Elements Using Find and Find All 01:07:18
07. Searching and Filtering Using Custom Functions 01:14:31
08. Extracting Links from a Page 01:17:21
09. Using a Soup Strainer to Parse a Subset of a Document 01:23:23
10. Module Summary 01:27:09
4. Selecting Elements Using the Scrapy Shell:
1. Module Overview 01:28:22
2. Parsing Web Content 01:29:27
3. Introducing Scrapy 01:31:47
4. Getting Started with Scrapy 01:35:45
5. Introducing the Scrapy Shell 01:39:58
6. Selecting Elements Using CSS Selectors 01:44:27
7. Advanced Selections Using CSS Selectors 01:51:20
8. Selecting Elements Using XPath Selectors 01:56:33
9. Module Summary 02:03:15
5. Scraping Web Sites Using Scrapy Spiders:
1. Module Overview 02:04:22
2. How Scrapy Works 02:05:29
3. Creating Your First Custom Spider 02:08:46
4. Writing Scraped Contents to a File 02:15:49
5. Exploring Items Using the Scrapy Shell 02:18:15
6. Using Items to Store Extracted Content 02:22:11
7. Using Item Loaders and Input and Output Processors for Scraped Data 02:26:32
8. Using Pipelines to Transform Scraped Data 02:33:35
9. Module Summary 02:38:19