Wandb Sweep Config Dataclasses¤

Wandb Sweep Config Dataclasses.

Weights & Biases just provide a JSON Schema, so we've converted here to dataclasses.

`Controller` `dataclass` ¤

Controller.

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

@dataclass
class Controller:
    """Controller."""

    type: ControllerType

`ControllerType` ¤

Bases: LowercaseStrEnum

Controller Type.

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

class ControllerType(LowercaseStrEnum):
    """Controller Type."""

    CLOUD = auto()
    """Weights & Biases cloud controller.

    Utilizes Weights & Biases as the sweep controller, enabling launching of multiple nodes that all
    communicate with the Weights & Biases cloud service to coordinate the sweep.
    """

    LOCAL = auto()
    """Local controller.

    Manages the sweep operation locally, without the need for cloud-based coordination or external
    services.
    """

`CLOUD = auto()` `class-attribute` `instance-attribute` ¤

Weights & Biases cloud controller.

Utilizes Weights & Biases as the sweep controller, enabling launching of multiple nodes that all communicate with the Weights & Biases cloud service to coordinate the sweep.

`LOCAL = auto()` `class-attribute` `instance-attribute` ¤

Local controller.

Manages the sweep operation locally, without the need for cloud-based coordination or external services.

`Distribution` ¤

Bases: LowercaseStrEnum

Sweep Distribution.

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

class Distribution(LowercaseStrEnum):
    """Sweep Distribution."""

    BETA = auto()
    """Beta distribution.

    Utilizes the Beta distribution, a family of continuous probability distributions defined on the
    interval [0, 1], for parameter sampling.
    """

    CATEGORICAL = auto()
    """Categorical distribution.

    Employs a categorical distribution for discrete variable sampling, where each category has an
    equal probability of being selected.
    """

    CATEGORICAL_W_PROBABILITIES = auto()
    """Categorical distribution with probabilities.

    Similar to categorical distribution but allows assigning different probabilities to each
    category.
    """

    CONSTANT = auto()
    """Constant distribution.

    Uses a constant value for the parameter, ensuring it remains the same across all runs.
    """

    INT_UNIFORM = auto()
    """Integer uniform distribution.

    Samples integer values uniformly across a specified range.
    """

    INV_LOG_UNIFORM = auto()
    """Inverse log-uniform distribution.

    Samples values according to an inverse log-uniform distribution, useful for parameters that span
    several orders of magnitude.
    """

    INV_LOG_UNIFORM_VALUES = auto()
    """Inverse log-uniform values distribution.

    Similar to the inverse log-uniform distribution but allows specifying exact values to be
    sampled.
    """

`BETA = auto()` `class-attribute` `instance-attribute` ¤

Beta distribution.

Utilizes the Beta distribution, a family of continuous probability distributions defined on the interval [0, 1], for parameter sampling.

`CATEGORICAL = auto()` `class-attribute` `instance-attribute` ¤

Categorical distribution.

Employs a categorical distribution for discrete variable sampling, where each category has an equal probability of being selected.

`CATEGORICAL_W_PROBABILITIES = auto()` `class-attribute` `instance-attribute` ¤

Categorical distribution with probabilities.

Similar to categorical distribution but allows assigning different probabilities to each category.

`CONSTANT = auto()` `class-attribute` `instance-attribute` ¤

Constant distribution.

Uses a constant value for the parameter, ensuring it remains the same across all runs.

`INT_UNIFORM = auto()` `class-attribute` `instance-attribute` ¤

Integer uniform distribution.

Samples integer values uniformly across a specified range.

`INV_LOG_UNIFORM = auto()` `class-attribute` `instance-attribute` ¤

Inverse log-uniform distribution.

Samples values according to an inverse log-uniform distribution, useful for parameters that span several orders of magnitude.

`INV_LOG_UNIFORM_VALUES = auto()` `class-attribute` `instance-attribute` ¤

Inverse log-uniform values distribution.

Similar to the inverse log-uniform distribution but allows specifying exact values to be sampled.

`Goal` ¤

Bases: LowercaseStrEnum

Goal.

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

class Goal(LowercaseStrEnum):
    """Goal."""

    MAXIMIZE = auto()
    """Maximization goal.

    Sets the objective of the hyperparameter tuning process to maximize a specified metric.
    """

    MINIMIZE = auto()
    """Minimization goal.

    Aims to minimize a specified metric during the hyperparameter tuning process.
    """

`MAXIMIZE = auto()` `class-attribute` `instance-attribute` ¤

Maximization goal.

Sets the objective of the hyperparameter tuning process to maximize a specified metric.

`MINIMIZE = auto()` `class-attribute` `instance-attribute` ¤

Minimization goal.

Aims to minimize a specified metric during the hyperparameter tuning process.

`HyperbandStopping` `dataclass` ¤

Hyperband Stopping Config.

Speed up hyperparameter search by killing off runs that appear to have lower performance than successful training runs.

Example

HyperbandStopping(type=HyperbandStoppingType.HYPERBAND) HyperbandStopping(type=hyperband)

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

@dataclass
class HyperbandStopping:
    """Hyperband Stopping Config.

    Speed up hyperparameter search by killing off runs that appear to have lower performance
    than successful training runs.

    Example:
        >>> HyperbandStopping(type=HyperbandStoppingType.HYPERBAND)
        HyperbandStopping(type=hyperband)
    """

    type: HyperbandStoppingType | None = HyperbandStoppingType.HYPERBAND

    eta: float | None = None
    """ETA.

    Specify the bracket multiplier schedule (default: 3).
    """

    maxiter: int | None = None
    """Max Iterations.

    Specify the maximum number of iterations. Note this is number of times the metric is logged, not
    the number of activations.
    """

    miniter: int | None = None
    """Min Iterations.

    Set the first epoch to start trimming runs, and hyperband will automatically calculate
    the subsequent epochs to trim runs.
    """

    s: float | None = None
    """Set the number of steps you trim runs at, working backwards from the max_iter."""

    strict: bool | None = None
    """Use a more aggressive condition for termination, stops more runs."""

    @final
    def __str__(self) -> str:
        """String representation of this object."""
        items_representation = []
        for key, value in self.__dict__.items():
            if value is not None:
                items_representation.append(f"{key}={value}")
        joined_items = ", ".join(items_representation)

        class_name = self.__class__.__name__

        return f"{class_name}({joined_items})"

    @final
    def __repr__(self) -> str:
        """Representation of this object."""
        return self.__str__()

`eta: float | None = None` `class-attribute` `instance-attribute` ¤

ETA.

Specify the bracket multiplier schedule (default: 3).

`maxiter: int | None = None` `class-attribute` `instance-attribute` ¤

Max Iterations.

Specify the maximum number of iterations. Note this is number of times the metric is logged, not the number of activations.

`miniter: int | None = None` `class-attribute` `instance-attribute` ¤

Min Iterations.

Set the first epoch to start trimming runs, and hyperband will automatically calculate the subsequent epochs to trim runs.

`s: float | None = None` `class-attribute` `instance-attribute` ¤

Set the number of steps you trim runs at, working backwards from the max_iter.

`strict: bool | None = None` `class-attribute` `instance-attribute` ¤

Use a more aggressive condition for termination, stops more runs.

`repr()` ¤

Representation of this object.

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

@final
def __repr__(self) -> str:
    """Representation of this object."""
    return self.__str__()

`str()` ¤

String representation of this object.

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

@final
def __str__(self) -> str:
    """String representation of this object."""
    items_representation = []
    for key, value in self.__dict__.items():
        if value is not None:
            items_representation.append(f"{key}={value}")
    joined_items = ", ".join(items_representation)

    class_name = self.__class__.__name__

    return f"{class_name}({joined_items})"

`HyperbandStoppingType` ¤

Bases: LowercaseStrEnum

Hyperband Stopping Type.

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

class HyperbandStoppingType(LowercaseStrEnum):
    """Hyperband Stopping Type."""

    HYPERBAND = auto()
    """Hyperband algorithm.

    Implements the Hyperband stopping algorithm, an adaptive resource allocation and early-stopping
    method to efficiently tune hyperparameters.
    """

`HYPERBAND = auto()` `class-attribute` `instance-attribute` ¤

Hyperband algorithm.

Implements the Hyperband stopping algorithm, an adaptive resource allocation and early-stopping method to efficiently tune hyperparameters.

`Impute` ¤

Bases: LowercaseStrEnum

Metric value to use in bayes search for runs that fail, crash, or are killed.

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

class Impute(LowercaseStrEnum):
    """Metric value to use in bayes search for runs that fail, crash, or are killed."""

    BEST = auto()
    LATEST = auto()
    WORST = auto()

`ImputeWhileRunning` ¤

Bases: LowercaseStrEnum

Appends a calculated metric even when epochs are in a running state.

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

class ImputeWhileRunning(LowercaseStrEnum):
    """Appends a calculated metric even when epochs are in a running state."""

    BEST = auto()
    FALSE = auto()
    LATEST = auto()
    WORST = auto()

`Kind` ¤

Bases: LowercaseStrEnum

Kind.

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

class Kind(LowercaseStrEnum):
    """Kind."""

    SWEEP = auto()

`Method` ¤

Bases: LowercaseStrEnum

Method.

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

class Method(LowercaseStrEnum):
    """Method."""

    BAYES = auto()
    """Bayesian optimization.

    Employs Bayesian optimization for hyperparameter tuning, a probabilistic model-based approach
    for finding the optimal set of parameters.
    """

    CUSTOM = auto()
    """Custom method.

    Allows for a user-defined custom method for hyperparameter tuning, providing flexibility in the
    sweep process.
    """

    GRID = auto()
    """Grid search.

    Utilizes a grid search approach for hyperparameter tuning, systematically working through
    multiple combinations of parameter values.
    """

    RANDOM = auto()
    """Random search.

    Implements a random search strategy for hyperparameter tuning, exploring the parameter space
    randomly.
    """

`BAYES = auto()` `class-attribute` `instance-attribute` ¤

Bayesian optimization.

Employs Bayesian optimization for hyperparameter tuning, a probabilistic model-based approach for finding the optimal set of parameters.

`CUSTOM = auto()` `class-attribute` `instance-attribute` ¤

Custom method.

Allows for a user-defined custom method for hyperparameter tuning, providing flexibility in the sweep process.

`GRID = auto()` `class-attribute` `instance-attribute` ¤

Grid search.

Utilizes a grid search approach for hyperparameter tuning, systematically working through multiple combinations of parameter values.

`RANDOM = auto()` `class-attribute` `instance-attribute` ¤

Random search.

Implements a random search strategy for hyperparameter tuning, exploring the parameter space randomly.

`Metric` `dataclass` ¤

Metric to optimize.

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

@dataclass(frozen=True)
class Metric:
    """Metric to optimize."""

    name: str
    """Name of metric."""

    goal: Goal | None = Goal.MINIMIZE

    impute: Impute | None = None
    """Metric value to use in bayes search for runs that fail, crash, or are killed"""

    imputewhilerunning: ImputeWhileRunning | None = None
    """Appends a calculated metric even when epochs are in a running state."""

    target: float | None = None
    """The sweep will finish once any run achieves this value."""

    @final
    def __str__(self) -> str:
        """String representation of this object."""
        items_representation = []
        for key, value in self.__dict__.items():
            if value is not None:
                items_representation.append(f"{key}={value}")
        joined_items = ", ".join(items_representation)

        class_name = self.__class__.__name__

        return f"{class_name}({joined_items})"

    @final
    def __repr__(self) -> str:
        """Representation of this object."""
        return self.__str__()

`impute: Impute | None = None` `class-attribute` `instance-attribute` ¤

Metric value to use in bayes search for runs that fail, crash, or are killed

`imputewhilerunning: ImputeWhileRunning | None = None` `class-attribute` `instance-attribute` ¤

Appends a calculated metric even when epochs are in a running state.

`name: str` `instance-attribute` ¤

Name of metric.

`target: float | None = None` `class-attribute` `instance-attribute` ¤

The sweep will finish once any run achieves this value.

`repr()` ¤

Representation of this object.

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

@final
def __repr__(self) -> str:
    """Representation of this object."""
    return self.__str__()

`str()` ¤

String representation of this object.

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

@final
def __str__(self) -> str:
    """String representation of this object."""
    items_representation = []
    for key, value in self.__dict__.items():
        if value is not None:
            items_representation.append(f"{key}={value}")
    joined_items = ", ".join(items_representation)

    class_name = self.__class__.__name__

    return f"{class_name}({joined_items})"

`NestedParameter` `dataclass` ¤

Bases: ABC

Nested Parameter.

Example

from dataclasses import field @dataclass(frozen=True) ... class MyNestedParameter(NestedParameter): ... a: int = field(default=Parameter(1)) ... b: int = field(default=Parameter(2)) MyNestedParameter().to_dict() {'parameters': {'a': {'value': 1}, 'b': {'value': 2}}}

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

@dataclass(frozen=True)
class NestedParameter(ABC):  # noqa: B024 (abstract so that we can check against its type)
    """Nested Parameter.

    Example:
        >>> from dataclasses import field
        >>> @dataclass(frozen=True)
        ... class MyNestedParameter(NestedParameter):
        ...     a: int = field(default=Parameter(1))
        ...     b: int = field(default=Parameter(2))
        >>> MyNestedParameter().to_dict()
        {'parameters': {'a': {'value': 1}, 'b': {'value': 2}}}
    """

    def to_dict(self) -> dict[str, Any]:
        """Return dict representation of this object."""

        def dict_without_none_values(obj: Any) -> dict:  # noqa: ANN401
            """Return dict without None values.

            Args:
                obj: The object to convert to a dict.

            Returns:
                The dict representation of the object.
            """
            dict_none_removed = {}
            dict_with_none = dict(obj)
            for key, value in dict_with_none.items():
                if value is not None:
                    dict_none_removed[key] = value
            return dict_none_removed

        return {"parameters": asdict(self, dict_factory=dict_without_none_values)}

    def __dict__(self) -> dict[str, Any]:  # type: ignore[override]
        """Return dict representation of this object."""
        return self.to_dict()

`dict()` ¤

Return dict representation of this object.

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

def __dict__(self) -> dict[str, Any]:  # type: ignore[override]
    """Return dict representation of this object."""
    return self.to_dict()

`to_dict()` ¤

Return dict representation of this object.

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

def to_dict(self) -> dict[str, Any]:
    """Return dict representation of this object."""

    def dict_without_none_values(obj: Any) -> dict:  # noqa: ANN401
        """Return dict without None values.

        Args:
            obj: The object to convert to a dict.

        Returns:
            The dict representation of the object.
        """
        dict_none_removed = {}
        dict_with_none = dict(obj)
        for key, value in dict_with_none.items():
            if value is not None:
                dict_none_removed[key] = value
        return dict_none_removed

    return {"parameters": asdict(self, dict_factory=dict_without_none_values)}

`Parameter` `dataclass` ¤

Bases: Generic[ParamType]

Sweep Parameter.

https://docs.wandb.ai/guides/sweeps/define-sweep-configuration#parameters

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

@dataclass(frozen=True)
class Parameter(Generic[ParamType]):
    """Sweep Parameter.

    https://docs.wandb.ai/guides/sweeps/define-sweep-configuration#parameters
    """

    value: ParamType | None = None
    """Single value.

    Specifies the single valid value for this hyperparameter. Compatible with grid.
    """

    max: ParamType | None = None
    """Maximum value."""

    min: ParamType | None = None
    """Minimum value."""

    distribution: Distribution | None = None
    """Distribution

    If not specified, will default to categorical if values is set, to int_uniform if max and min
    are set to integers, to uniform if max and min are set to floats, or to constant if value is
    set.
    """

    q: float | None = None
    """Quantization parameter.

    Quantization step size for quantized hyperparameters.
    """

    values: list[ParamType] | None = None
    """Discrete values.

    Specifies all valid values for this hyperparameter. Compatible with grid.
    """

    probabilities: list[float] | None = None
    """Probability of each value"""

    mu: float | None = None
    """Mean for normal or lognormal distributions"""

    sigma: float | None = None
    """Std Dev for normal or lognormal distributions"""

    @final
    def __str__(self) -> str:
        """String representation of this object."""
        items_representation = []
        for key, value in self.__dict__.items():
            if value is not None:
                items_representation.append(f"{key}={value}")
        joined_items = ", ".join(items_representation)

        class_name = self.__class__.__name__

        return f"{class_name}({joined_items})"

    @final
    def __repr__(self) -> str:
        """Representation of this object."""
        return self.__str__()

`distribution: Distribution | None = None` `class-attribute` `instance-attribute` ¤

Distribution

If not specified, will default to categorical if values is set, to int_uniform if max and min are set to integers, to uniform if max and min are set to floats, or to constant if value is set.

`max: ParamType | None = None` `class-attribute` `instance-attribute` ¤

Maximum value.

`min: ParamType | None = None` `class-attribute` `instance-attribute` ¤

Minimum value.

`mu: float | None = None` `class-attribute` `instance-attribute` ¤

Mean for normal or lognormal distributions

`probabilities: list[float] | None = None` `class-attribute` `instance-attribute` ¤

Probability of each value

`q: float | None = None` `class-attribute` `instance-attribute` ¤

Quantization parameter.

Quantization step size for quantized hyperparameters.

`sigma: float | None = None` `class-attribute` `instance-attribute` ¤

Std Dev for normal or lognormal distributions

`value: ParamType | None = None` `class-attribute` `instance-attribute` ¤

Single value.

Specifies the single valid value for this hyperparameter. Compatible with grid.

`values: list[ParamType] | None = None` `class-attribute` `instance-attribute` ¤

Discrete values.

Specifies all valid values for this hyperparameter. Compatible with grid.

`repr()` ¤

Representation of this object.

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

@final
def __repr__(self) -> str:
    """Representation of this object."""
    return self.__str__()

`str()` ¤

String representation of this object.

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

@final
def __str__(self) -> str:
    """String representation of this object."""
    items_representation = []
    for key, value in self.__dict__.items():
        if value is not None:
            items_representation.append(f"{key}={value}")
    joined_items = ", ".join(items_representation)

    class_name = self.__class__.__name__

    return f"{class_name}({joined_items})"

`Parameters` `dataclass` ¤

Parameters

Source code in sparse_autoencoder/train/utils/wandb_sweep_types.py

@dataclass
class Parameters:
    """Parameters"""