Replace activations hook¤
Replace activations hook.
replace_activations_hook(value, hook, sparse_autoencoder, component_idx=None, n_components=None)
¤
Replace activations hook.
This should be pre-initialised with functools.partial
.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
value |
Tensor
|
The activations to replace. |
required |
hook |
HookPoint
|
The hook point. |
required |
sparse_autoencoder |
SparseAutoencoder | DataParallel[SparseAutoencoder] | LitSparseAutoencoder | Module
|
The sparse autoencoder. |
required |
component_idx |
int | None
|
The component index to replace the activations with, if just replacing activations for a single component. Requires the model to have a component axis. |
None
|
n_components |
int | None
|
The number of components that the SAE is trained on. |
None
|
Returns:
Type | Description |
---|---|
Tensor
|
Replaced activations. |
Raises:
Type | Description |
---|---|
RuntimeError
|
If |
Source code in sparse_autoencoder/source_model/replace_activations_hook.py
18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 |
|