ATracking which model and weights serve traffic so you can roll back
BRenaming the model file each time the server happens to restart
CCounting how many tokens each version of the model has generated
DStoring user prompts so the model can be retrained on them later
Why this is the answer
Versioning records exactly which model is live, making rollbacks and comparisons possible; without it you cannot reliably reproduce or revert a deployment.