sinksTensor
An optional attention-sinks tensor of shape [nHeads]. Each element seeds the online-softmax accumulator for the corresponding query head with a virtual token logit, causing real-token attention weights to sum to less than one.
Declaration
var sinksTensor: MPSGraphTensor? { get set }