MATH VS LLMs

This project evaluates the mathematical capabilities of Large Language Models (LLMs) and examines how providing hints in prompts affects their performance. Further we look at how adverserial hinting and examples can misdirect the answers of the LLMs and look at capabilites an LLM to get out of that direction and realise the mistake.

Project Overview

Here we prompt the mathematical problems to LLMs, with and without hints, and analyzes the responses to assess:

The baseline mathematical ability of LLMs
How hint-enhanced prompts impact the accuracy and approach of LLM solutions
How adverserial hints affect the problem solving of an LLM
Test the capabilites of VLMs on mathematical problems

Installation

git clone https://github.com/vlgiitr/LLM-Math.git
pip install -r requirements.txt

To generate the baseline:

cd Evaluation
python Evaluation/baseline.py

Similary, To generate other results:

cd Evaluation
python {filename}.py

Contributing

Contributions are welcome! Please read our Contributing Guide for more information.

License

This project is licensed under the - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Evaluation		Evaluation
train_MATH		train_MATH
.gitignore		.gitignore
GPT_eval.py		GPT_eval.py
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MATH VS LLMs

Project Overview

Installation

Contributing

License

About

Releases

Packages

Contributors 4

Languages

vlgiitr/LLM-Math

Folders and files

Latest commit

History

Repository files navigation

MATH VS LLMs

Project Overview

Installation

Contributing

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages