The mamba paper Diaries
This design inherits from PreTrainedModel. Verify the superclass documentation for the generic approaches the library implements for all its product (for instance downloading or conserving, resizing the enter embeddings, pruning heads Use it as a daily PyTorch Module and refer to the PyTorch documentation for all issue connected with standard uti