pyrit.score.HumanInTheLoopScorerGradio#
- class HumanInTheLoopScorerGradio(*, open_browser=False)[source]#
Bases:
Scorer
Create scores from manual human input using Gradio and adds them to the database.
- Parameters:
open_browser (bool) – The scorer will open the Gradio interface in a browser instead of opening it in PyWebview
Methods
__init__
(*[, open_browser])get_identifier
()Returns an identifier dictionary for the scorer.
retrieve_score
(request_prompt, *[, task])scale_value_float
(value, min_value, max_value)Scales a value from 0 to 1 based on the given min and max values.
score_async
(request_response, *[, task])Score the request_response, add the results to the database and return a list of Score objects.
score_image_async
(image_path, *[, task])Scores the given image using the chat target.
score_prompts_with_tasks_batch_async
(*, ...)score_responses_inferring_tasks_batch_async
(*, ...)Scores a batch of responses (ignores non-assistant messages).
score_text_async
(text, *[, task])Scores the given text based on the task using the chat target.
validate
(request_response, *[, task])Validates the request_response piece to score.
Attributes
- retrieve_score(request_prompt: PromptRequestPiece, *, task: str | None = None) list[Score] [source]#
- async score_async(request_response: PromptRequestPiece, *, task: str | None = None) list[Score] [source]#
Score the request_response, add the results to the database and return a list of Score objects.
- Parameters:
request_response (PromptRequestPiece) – The request response to be scored.
task (str) – The task based on which the text should be scored (the original attacker model’s objective).
- Returns:
A list of Score objects representing the results.
- Return type:
- validate(request_response: PromptRequestPiece, *, task: str | None = None)[source]#
Validates the request_response piece to score. Because some scorers may require specific PromptRequestPiece types or values.
- Parameters:
request_response (PromptRequestPiece) – The request response to be validated.
task (str) – The task based on which the text should be scored (the original attacker model’s objective).