agent_inspect.models.tools package
Submodules
agent_inspect.models.tools.analysis_models module
- class agent_inspect.models.tools.analysis_models.AnalyzedSubgoalValidation(subgoal_validation, data_sample_id, base_error, agent_run_id=None)[source]
Bases:
objectRepresents the error analysis result for a single subgoal validation within a data sample.
- Parameters:
subgoal_validation (SubGoalValidationResult)
data_sample_id (int)
base_error (str | None)
agent_run_id (int | None)
-
agent_run_id:
Optional[int] = None Unique identifier for the agent run associated with this data sample.
-
base_error:
Optional[str] A description of the identified error in the subgoal validation.
-
data_sample_id:
int The unique identifier of the data sample that this result’s subgoal validation originates from.
-
subgoal_validation:
SubGoalValidationResult The subgoal validation result being analyzed.
- class agent_inspect.models.tools.analysis_models.ErrorAnalysisDataSample(data_sample_id, subgoal_validations, agent_run_id=None)[source]
Bases:
objectRepresents a single data sample with its associated subgoal validations.
- Parameters:
data_sample_id (int)
subgoal_validations (List[SubGoalValidationResult])
agent_run_id (int | None)
-
agent_run_id:
Optional[int] = None Unique identifier for the agent run associated with this data sample.
-
data_sample_id:
int Unique identifier for the data sample. Final results of error analysis will reference this id.
-
subgoal_validations:
List[SubGoalValidationResult] A list of ordered subgoal validation results to perform error analysis on.
- class agent_inspect.models.tools.analysis_models.ErrorAnalysisResult(analyzed_validations_clustered_by_errors, completed_subgoal_validations)[source]
Bases:
objectRepresents the overall error analysis result for a set of data samples.
- Parameters:
analyzed_validations_clustered_by_errors (Dict[str, List[AnalyzedSubgoalValidation]])
completed_subgoal_validations (List[AnalyzedSubgoalValidation])
-
analyzed_validations_clustered_by_errors:
Dict[str,List[AnalyzedSubgoalValidation]] A mapping from generalized errors to lists of analyzed subgoal validations that exhibit those errors.
-
completed_subgoal_validations:
List[AnalyzedSubgoalValidation] A list of analyzed subgoal validations consisting of the subgoal validations that were marked completed, and thus have no associated errors.
- class agent_inspect.models.tools.analysis_models.StatisticAnalysisResult(data_sample_id, subgoal_validations, judge_expectation=None, judge_std=None, agent_run_id=None)[source]
Bases:
objectRepresents the statistical analysis result for a single data sample.
- Parameters:
data_sample_id (int)
subgoal_validations (List[SubGoalValidationResult])
judge_expectation (float | None)
judge_std (float | None)
agent_run_id (int | None)
-
agent_run_id:
Optional[int] = None Unique identifier for the agent run associated with this data sample.
-
data_sample_id:
int The unique identifier of the data sample that this statistic analysis result corresponds to.
-
judge_expectation:
Optional[float] = None The computed expectation (mean) of judge scores across all subgoals in one data sample.
-
judge_std:
Optional[float] = None The computed standard deviation of judge scores across all subgoals in one data sample.
-
subgoal_validations:
List[SubGoalValidationResult] A list of subgoal validation results to perform error analysis on.
Module contents
- class agent_inspect.models.tools.AnalyzedSubgoalValidation(subgoal_validation, data_sample_id, base_error, agent_run_id=None)[source]
Bases:
objectRepresents the error analysis result for a single subgoal validation within a data sample.
- Parameters:
subgoal_validation (SubGoalValidationResult)
data_sample_id (int)
base_error (str | None)
agent_run_id (int | None)
-
agent_run_id:
Optional[int] = None Unique identifier for the agent run associated with this data sample.
-
base_error:
Optional[str] A description of the identified error in the subgoal validation.
-
data_sample_id:
int The unique identifier of the data sample that this result’s subgoal validation originates from.
-
subgoal_validation:
SubGoalValidationResult The subgoal validation result being analyzed.
- class agent_inspect.models.tools.ErrorAnalysisDataSample(data_sample_id, subgoal_validations, agent_run_id=None)[source]
Bases:
objectRepresents a single data sample with its associated subgoal validations.
- Parameters:
data_sample_id (int)
subgoal_validations (List[SubGoalValidationResult])
agent_run_id (int | None)
-
agent_run_id:
Optional[int] = None Unique identifier for the agent run associated with this data sample.
-
data_sample_id:
int Unique identifier for the data sample. Final results of error analysis will reference this id.
-
subgoal_validations:
List[SubGoalValidationResult] A list of ordered subgoal validation results to perform error analysis on.
- class agent_inspect.models.tools.ErrorAnalysisResult(analyzed_validations_clustered_by_errors, completed_subgoal_validations)[source]
Bases:
objectRepresents the overall error analysis result for a set of data samples.
- Parameters:
analyzed_validations_clustered_by_errors (Dict[str, List[AnalyzedSubgoalValidation]])
completed_subgoal_validations (List[AnalyzedSubgoalValidation])
-
analyzed_validations_clustered_by_errors:
Dict[str,List[AnalyzedSubgoalValidation]] A mapping from generalized errors to lists of analyzed subgoal validations that exhibit those errors.
-
completed_subgoal_validations:
List[AnalyzedSubgoalValidation] A list of analyzed subgoal validations consisting of the subgoal validations that were marked completed, and thus have no associated errors.
- class agent_inspect.models.tools.StatisticAnalysisResult(data_sample_id, subgoal_validations, judge_expectation=None, judge_std=None, agent_run_id=None)[source]
Bases:
objectRepresents the statistical analysis result for a single data sample.
- Parameters:
data_sample_id (int)
subgoal_validations (List[SubGoalValidationResult])
judge_expectation (float | None)
judge_std (float | None)
agent_run_id (int | None)
-
agent_run_id:
Optional[int] = None Unique identifier for the agent run associated with this data sample.
-
data_sample_id:
int The unique identifier of the data sample that this statistic analysis result corresponds to.
-
judge_expectation:
Optional[float] = None The computed expectation (mean) of judge scores across all subgoals in one data sample.
-
judge_std:
Optional[float] = None The computed standard deviation of judge scores across all subgoals in one data sample.
-
subgoal_validations:
List[SubGoalValidationResult] A list of subgoal validation results to perform error analysis on.