Filter invalid sequences or labels.
Scale labels (e.g., T_m values) to a standard range.
The model learns by repeatedly processing the data in small batches. This entire loop is performed many times (across "epochs").
For each batch (e.g., 2 sequences):
A small set of parameters representing the changes made to the original model's target layers.
A small neural network that converts the final protein embedding into a specific value (e.g., T_m).