GitHub - yhamidaddin/clean-chatgpt-html.py: Python code that cleans a saved ChatGPT page from scripts and external references, making it available as a stand-alone html for offline reading.

README for `clean-chatgpt-html.py`

# clean-chatgpt-html.py

A Python script that cleans saved ChatGPT HTML pages by removing unnecessary elements like external references, scripts, and extraneous HTML. It outputs a stand-alone HTML file that does not rely on any external resources (images, CSS, or JavaScript). The output file can be easily viewed offline.

## Table of Contents
- [Overview](#overview)
- [Installation](#installation)
- [Usage](#usage)
- [Features](#features)
- [Dependencies](#dependencies)
- [License](#license)
- [Contact](#contact)

## Overview

The `clean-chatgpt-html.py` script is designed to clean up saved HTML files of ChatGPT outputs by removing unwanted scripts, external references, and tags. The resulting HTML file is simplified, self-contained, and can be viewed offline without dependencies. This is particularly useful for saving and sharing ChatGPT interactions in a clean format, making them portable and easy to read.

## Installation

### Prerequisites

To run this script, you will need the following Python packages:

- `BeautifulSoup` (from `bs4`): A library for parsing HTML content.
- `lxml`: Required for parsing HTML content efficiently (it’s the default parser for BeautifulSoup).

### Step 1: Install Dependencies

You can install the required dependencies using `pip`:

```bash
pip install beautifulsoup4 lxml

Step 2: Download the Script

Clone the repository containing this script or download the Python file directly.

git clone https://github.com/yourusername/clean-chatgpt-html.git

Step 3: Ensure Python Environment

Make sure you are running this script in an environment where the dependencies are installed (either a virtual environment or globally).

Usage

To use the script, follow these steps:

Run the Python script:
```
python clean-chatgpt-html.py
```
The script will prompt you to enter the full path of the HTML file you want to process.
```
Enter the full path of the HTML file to be processed: /path/to/chatgpt_output.html
```
The script will clean up the HTML file, remove unnecessary content, and create a new file with the suffix -clean added to the original file name.

Example:
- Input file: chatgpt_output.html
- Output file: chatgpt_output-clean.html
The cleaned file will be saved in the same directory as the original file.

Example

Enter the full path of the HTML file to be processed: /home/user/chatgpt_output.html
File is opened.
Processed HTML has been saved to /home/user/chatgpt_output-clean.html

Features

Removes Unnecessary Tags: Automatically removes <script>, <iframe>, and unwanted <div> or <button> elements.
Self-contained HTML: Cleans the page so it doesn't require any external images, CSS, or JavaScript files.
Title Injection: Adds a custom title to the <head> section based on the file name.
Invert Color Button: Adds a button to toggle between light and dark modes for better readability.
Stand-alone Output: The output HTML file is fully self-contained and can be opened offline without relying on external resources.

Dependencies

This script requires the following Python libraries:

BeautifulSoup4: For parsing HTML and making modifications.
lxml: Efficient HTML parser for BeautifulSoup.

Install them via pip:

pip install beautifulsoup4 lxml

License

This script is licensed under the MIT License. Feel free to modify and redistribute it under the terms of the license.

Contact

For questions, feedback, or suggestions, please reach out to the author:

Author: Yahya Hamidaddin
Email: [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
README.md		README.md
clean-chatgpt-html.py		clean-chatgpt-html.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

README for `clean-chatgpt-html.py`

Step 2: Download the Script

Step 3: Ensure Python Environment

Usage

Example

Features

Dependencies

License

Contact

About

Uh oh!

Releases

Packages

Languages

License

yhamidaddin/clean-chatgpt-html.py

Folders and files

Latest commit

History

Repository files navigation

README for clean-chatgpt-html.py

Step 2: Download the Script

Step 3: Ensure Python Environment

Usage

Example

Features

Dependencies

License

Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

README for `clean-chatgpt-html.py`

Packages