Python program for searching pdf text, ranking the results and exporting highlighted search results in pdf. Uses trie structure, stack, heap, page graph. Converts queries to postfix notation. Allows for logical expressions and phrases. Offers did you mean functionality.
- PyMuPDF
- didyoumean.py
-
Create a virtual environment in the project directory:
virtualenv venv
-
Activate the virtual environment:
2.1. For Windows:
venv\Scripts\activate
2.2. For Linux:
source venv/bin/activate
-
Install the required libraries:
pip install -r requirements.txt
-
Run the program:
python main.py
-
All in one command:
5.1. For linux
virtualenv venv && source venv/bin/activate && pip install -r requirements.txt && python main.py
5.1. For windows (if using Powershell)
virtualenv venv; venv\Scripts\Activate; pip install -r requirements.txt; python main.py