PDF Duplicate Finder

License: GPL v3 Python 3.8+ Code style: black

A powerful tool to find and manage duplicate PDF files on your computer. PDF Duplicate Finder helps you identify and remove duplicate PDF documents, saving disk space and organizing your files more efficiently.

✨ Features

πŸ“¦ Installation

Prerequisites

See PREREQUISITES.md for platform-specific setup.

Install from source

  1. Clone the repository:

    git clone https://github.com/Nsfr750/PDF_finder.git
    cd PDF_finder
    
  2. Create and activate a virtual environment (recommended):

    python -m venv venv
    .\venv\Scripts\activate  # Windows
    source venv/bin/activate  # Linux/Mac
    
  3. Install the required dependencies:

    pip install -r requirements.txt
    

Usage

  1. Launch the application:

    python main.py
    
  2. Click β€œScan Folder” to select a directory to scan for duplicate PDFs.

  3. Review the results in the main window. After a scan completes, the file list is automatically populated with the scanned PDFs and duplicate groups.

  4. Use the tools to manage duplicates:

Key Features in Detail

Smart PDF Comparison

Performance Optimizations

User Experience

PDF Backends and Fallback

Version History

See CHANGELOG.md for a complete list of changes in each version.

Contributing

Contributions are welcome! Please read our Contributing Guidelines for details on how to contribute to this project.

πŸ“„ License

This project is licensed under the GNU General Public License v3.0 - see the LICENSE file for details.

πŸ™ Acknowledgments

🐞 Known Bugs


πŸ“… Last Updated: August 2025
🐍 Python Version: 3.8+
πŸ“œ License: GPL-3.0