Steering
#15053
Replies: 1 comment
-
|
It would be interesting to see this implemented in such a way that if we enabled steering, on load, llama.cpp inserts synthetic layers between every layer that essentially pass data straight through with no further agitation. However if you pass in a steering vector, scaling value, and target layer id at inference time, the synthetic layer inserts the steering vector for the duration of the inference call, then goes back to straight passthrough. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
It seems many recent papers and researches are dealing with a new technique named "Steering", where layers attentions are manipulated using a provided vector. On a more technical level, this involves another multiplication operation given an artifact of the vector/matrix.
At the moment, the only available tool to use this is
transformers, as it allows full interaction and manipulation with the model weights.I believe future versions of
llama.cppshould support this functionality too. What do others here think?Beta Was this translation helpful? Give feedback.
All reactions