• LOGIN
  • No products in the cart.

See the in-depth capabilities of Python’s web scraping tools.


Course Description

Python is a high-level programming language used for general-purpose programming. It has a design philosophy which emphasizes code readability and a syntax which allows programmers to express concepts in fewer lines of code than possible in languages such as C++ or Java.
This course is a rich collection of recipes that will come in handy when you are scraping a website using Python, addressing your usual and unusual problems while scraping websites by diving deep into the capabilities of Python’sweb scraping tools such as Selenium, BeautifulSoup, and urllib2. The course will start with showing how to use selenium module for scraping by setting up a web driver, debugging with the Console and downloading files and streamlining with a Headless Browser (PhantomJS). The course will then move on to demonstrate how to do parsing with Beautifulsoup which would include an introduction to the BeautifulSoupObjects, Nested Selectors and Regular Expressions Basics and how to do UTF-8 Encoding. The course will finally end by showing how to do fetching with urlib2 by using the developer tools Network tab, how to bypass the browser and retrieve files.
By The end of this course, you will be successfully able to understand the in-depth capabilities of python web scraping tools.

Learning Outcomes

  • Use the Selenium module and scrape with Selenium
  • Find out how to set up a web driver
  • Perform debugging with the console and download files
  • Learn to work with Nested selectors and regular expression basics
  • Discover how to perform parsing with BeautifulSoup
  • Understand authentication with Wireshark.
  • Master the use of URL Query Strings and HTTP Requests (GET and POST)
  • Implement streamlining with headless browser

Pre-requisite

  • Basic foundation in Python programming
  • Basic Python + pip knowledge suggested
  • Web development experience beneficial, but not mandatory
  • Software’s used – Python (3.3+), Pip package manager, Windows 10, PhantomJS, and Selenium

Who is this course intended for?

This course is for Python developers and web analysts who want to improve their web scraping skills in Python. It is ideal for those who are looking for reference guide they can use to solve any challenges encountered while web scraping in Python.

Your Instructor

Packt Publishing

Packt has been committed to developer learning since 2004. A lot has changed in software since then – but Packt has remained responsive to these changes, continuing to look forward at the trends and tools defining the way we work and live. And how to put them to work.

With an extensive library of content – more than 4000 books and video courses -Packt’s mission is to help developers stay relevant in a rapidly changing world. From new web frameworks and programming languages to cutting-edge data analytics, and DevOps, Packt takes software professionals in every field to what’s important to them now.

From skills that will help you to develop and future-proof your career to immediate solutions to everyday tech challenges, Packt is a go-to resource to make you a better, smarter developer.

Course Curriculum

Scraping with Selenium
The Course Overview 00:00:00
When to Web Scrape 00:00:00
What Makes up a Website 00:00:00
How to Interact with a Website 00:00:00
Using the Selenium Module 00:00:00
Ethical Web Scraping 00:00:00
Parsing with BeautifulSoup
Requesting HTML 00:00:00
Using the BeautifulSoup Module 00:00:00
Example: Parsing Wikipedia 00:00:00
Fetching the urlib2 and API’s
Bypassing the Browser 00:00:00
Introduction to APIs 00:00:00
Working with APIs 00:00:00

Course Reviews

N.A

ratings
  • 5 stars0
  • 4 stars0
  • 3 stars0
  • 2 stars0
  • 1 stars0

No Reviews found for this course.

TAKE THIS COURSE
  • $99.00 $10.00
  • UNLIMITED ACCESS
  • Course Certificate
1 STUDENTS ENROLLED
© Learnsector