Datasets

Datasets#

The datasets component within PyRIT is the first piece of an attack. By fetching datsets from different sources, we often load them into PyRIT as a SeedPromptDataset to build out the prompts which we will attack with. The building block of this dataset consists of a SeedPrompt which can use a template with parameters or just a prompt. The datasets can be loaded through different formats such as from an open source repository or through a YAML file. By storing these datasets within PyRIT, we can further distinguish them by incorporating data attributes such as harm_categories or other labels. In order to further define these datasets, we use SeedPrompts.

Seed Prompts:

By using SeedPrompts through loading from a YAML file or loading them via system prompt, the following sections will demonstrate specific examples using prompts or templates. These currently support multi-modal datasets including images, audio, and videos.

Loading Datasets:

We also show examples of common methods to fetch datasets into PyRIT from different sources. Most datasets will be loaded as a SeedPromptDataset. Outside of these examples, the fetch functions which are currently available can be found in the pyrit.datasets module. There is a wide range of datasets which are included and can be used as example to also load in other datasets. As these datasets are the first component of building an attack in PyRIT, the following notebooks also continue to demonstrate how these prompts can be used in the process.

Datasets Loading Process: Seed Prompt De-duplication: PyRIT checks for existence of duplicate seed prompts using hashes to make sure it is not uploading duplicate seed prompts in the memory. The feature follows following decision-tree:

If PyRIT receives duplicate seed-prompt within the same dataset, it doesn’t upload the seed-prompt
But if it receives a new seed-prompt in the same dataset with even a slight modification and having a different hash, it accepts it.
If PyRIT receives a duplicate seed-prompt in a different dataset, it accepts it.