An innovative tool designed to revolutionize the testing, evaluation, and security of LLM applications. This versatile tool supports a test-driven development approach, allowing developers to optimize prompts, models, and APIs efficiently.
Whether you’re using CLI, integrating into CI/CD, or seeking robust security through automated red teaming, promptfoo offers a comprehensive solution to enhance the reliability and security of your LLM apps.
promptfoo
is a tool for testing, evaluating, and red-teaming LLM apps.
With promptfoo, you can:
The goal: test-driven LLM development instead of trial-and-error.
npx promptfoo@latest init
There are many different ways to evaluate prompts. Here are some reasons to consider promptfoo:
Start by establishing a handful of test cases – core use cases and failure cases that you want to ensure your prompt can handle.
As you explore modifications to the prompt, use promptfoo eval
to rate all outputs. This ensures the prompt is actually improving overall.
As you collect more examples and establish a user feedback loop, continue to build the pool of test cases.
To get started, run this command:
npx promptfoo@latest init
This will create a promptfooconfig.yaml
placeholder in your current directory.
After editing the prompts and variables to your liking, run the eval command to kick off an evaluation:
npx promptfoo@latest eval
For more information click here.
garak checks if an LLM can be made to fail in a way we don't…
Vermilion is a simple and lightweight CLI tool designed for rapid collection, and optional exfiltration…
ADCFFS is a PowerShell script that can be used to exploit the AD CS container…
Tartufo will, by default, scan the entire history of a git repository for any text…
Loco is strongly inspired by Rails. If you know Rails and Rust, you'll feel at…
A data hoarder’s dream come true: bundle any web page into a single HTML file.…