`trainweights`

Pet project meant to quantize, then serialize PyTorch models in a compact binary format. Ultimately meant to be useful for saving model checkpoints in a more compressed format to save heavily on disk space.

Still to do:

Now that general quantization and save/load is working, expand Python package to quantize checkpoints that differ little from some reference checkpoint so that this actually can be useful during training (as small diffs arguably are too unimportant to warrant their high precisions). Maybe implement this in library code with a context manager or something around a training loop
Probably want to allow quantization to be filterable since usually want to preserve full precision for things like optimizer states
General tidiness
bf16 support

Usage example:

Using the example script in examples/for_readme.py:

import os

from transformers import AutoModelForSequenceClassification
import trainweights

MODEL_NAME = "distilbert-base-uncased"

device = "cpu"
model = AutoModelForSequenceClassification.from_pretrained(
    MODEL_NAME,
    num_labels=2,
).to(device)

dir_name = ""
default_dir = os.getenv("PWD") or None
save_dir = dir_name or default_dir

if save_dir is None:
    raise RuntimeError("No save dir was set")

model.save_pretrained(save_dir)

safetensors_filepath = f"{save_dir}/model.safetensors"
trainweights_filepath = f"{save_dir}/trainweights_tensors.tws"

trainweights.save_model(model, trainweights_filepath)

print(f"Safetensors filesize: "
      f"{os.stat(safetensors_filepath).st_size / 1e9:.2f} GB")

print(f"Trainweights int8 quantized filesize: "
      f"{os.stat(trainweights_filepath).st_size / 1e9:.2f} GB")

tw_model = trainweights.load_model(trainweights_filepath, MODEL_NAME)

num_elems = 5
state_dicts = zip(list(model.state_dict().items())[:num_elems], list(tw_model.state_dict().items())[:num_elems])

for (orig_key, orig_value), (tw_key, tw_value) in state_dicts:
    orig_mean = orig_value.mean()
    mean_diff = ((orig_value - tw_value).mean() / orig_mean).item()
    print(f"Proportioned mean difference for key {orig_key}: {mean_diff:.3f}")

With output:

Safetensors filesize: 0.27 GB
Trainweights int8 quantized filesize: 0.10 GB
Proportioned mean difference for key distilbert.embeddings.word_embeddings.weight: 0.088
Proportioned mean difference for key distilbert.embeddings.position_embeddings.weight: 0.297
Proportioned mean difference for key distilbert.embeddings.LayerNorm.weight: -0.011
Proportioned mean difference for key distilbert.embeddings.LayerNorm.bias: 0.298
Proportioned mean difference for key distilbert.transformer.layer.0.attention.q_lin.weight: 0.125

Getting set up:

Simply navigate to the root directory of the repo and run:

pip install .

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
examples		examples
src		src
trainweights		trainweights
CMakeLists.txt		CMakeLists.txt
README.md		README.md
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`trainweights`

Still to do:

Usage example:

Getting set up:

About

Uh oh!

Releases

Packages

Languages

sangstar/trainweights

Folders and files

Latest commit

History

Repository files navigation

trainweights

Still to do:

Usage example:

Getting set up:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`trainweights`

Packages