Books Pagination Scraper

This project is an advanced Python web scraping script that extracts detailed book information from multiple pages of the website:

https://books.toscrape.com

The script automatically navigates through all catalogue pages and collects structured data from individual book detail pages.

Features

Pagination scraping
Multi-page data extraction
Scrapes individual book detail pages
Extracts structured product information
Handles missing descriptions safely
Saves all data into CSV format
Uses urljoin for safe URL handling

Data Extracted

This script extracts:

Book Name
Star Rating
Product Description
UPC
Product Type
Price (excl. tax)
Price (incl. tax)
Tax
Availability
Number of Reviews

Tools Used

Python
Requests
BeautifulSoup
CSV Module
urllib.parse (urljoin)

Output

The script generates:

book_pagination_data.csv

This CSV file contains structured information about all books from the website.

How to Run

Install required libraries:

pip install requests beautifulsoup4

Run the script:

python books_pagination_scraper.py

Project Purpose

This project demonstrates advanced web scraping techniques including:

Pagination scraping
Nested page scraping
Structured table data extraction

It was created as part of learning real-world data extraction workflows.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
README.md		README.md
Sample_output.jpg		Sample_output.jpg
book_pagination_data.csv		book_pagination_data.csv
books_pagination.py		books_pagination.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Books Pagination Scraper

Features

Data Extracted

Tools Used

Output

How to Run

Project Purpose

Sample Output

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Books Pagination Scraper

Features

Data Extracted

Tools Used

Output

How to Run

Project Purpose

Sample Output

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages