5. Human in the Loop Scoring with Gradio

5. Human in the Loop Scoring with Gradio#

This example shows how to use the Gradio UI to perform human-in-the-loop scoring. This is in beta but will be extended.

from pyrit.memory import CentralMemory
from pyrit.score import HumanInTheLoopScorerGradio
from pyrit.setup import IN_MEMORY, initialize_pyrit_async

await initialize_pyrit_async(memory_db_type=IN_MEMORY)
# This will start the Gradio UI and can only be created once per notebook. If you restart the kernel, run this cell again.
scorer = HumanInTheLoopScorerGradio() 

from pyrit.memory import CentralMemory
from pyrit.models import MessagePiece

memory = CentralMemory.get_memory_instance()


# This cell can be run multiple times to simulate multiple scoring requests.
prompt = MessagePiece(
    role="assistant",
    original_value="The quick brown fox jumps over the lazy dog.",
).to_message()
memory.add_message_to_memory(request=prompt)

scored_response = (await scorer.score_async(prompt))[0]  # type: ignore
print("Gradio manually scored response is given as:", scored_response, scored_response.score_rationale)

Gradio manually scored response is given as: HumanInTheLoopScorerGradio: safety: true    - ['safety']: The prompt was marked safe