当前位置：文江博客话题详情

如何找到深度学习模型的大小？

发布于 2025-01-20 20:56:05 字数 95 浏览 1 评论 0原文

我正在处理同一模型的不同量化实现，主要区别是权重，偏见和激活的精度。因此，我想知道如何找到MBS中32位浮点的模型大小与INT8中的差异之间的区别。我的模型以.pth格式保存。

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

痴情 2025-01-27 20:56:05

您可以计算参数和缓冲区的数量。
然后尝试将它们与元素大小相乘，您将具有所有参数的大小。

model = models.resnet18()
param_size = 0
buffer_size = 0
for param in model.parameters():
    param_size += param.nelement() * param.element_size()

for buffer in model.buffers():
    buffer_size += buffer.nelement() * buffer.element_size()

size_all_mb = (param_size + buffer_size) / 1024**2
print('Size: {:.3f} MB'.format(size_all_mb))

它将打印：

大小：361.209 MB

You are able to calculate the number of parameters and buffers.
Then try to multiply them with the element size and you will have the size of all parameters.

model = models.resnet18()
param_size = 0
buffer_size = 0
for param in model.parameters():
    param_size += param.nelement() * param.element_size()

for buffer in model.buffers():
    buffer_size += buffer.nelement() * buffer.element_size()

size_all_mb = (param_size + buffer_size) / 1024**2
print('Size: {:.3f} MB'.format(size_all_mb))

And it will print:

Size: 361.209 MB

回复收藏 0 原文

鱼忆七猫命九 2025-01-27 20:56:05

“要计算模型大小（以字节为单位），请将参数数量乘以所选精度的大小（以字节为单位）。例如，如果我们使用 BLOOM-176B 模型的 bfloat16 版本，则有 176*10**9 x 2 字节 = 352GB！”

这个关于 HF 的博客值得一读：https://huggingface.co/blog/hf-位和字节集成

回复收藏 0 原文

压抑⊿情绪 2025-01-27 20:56:05

遵循@Prajot的代码，我们可以得到一个精度乘数和多个参数乘数，从而得出一条经验法则。

规则

stats for 8 bit models:
---
7 billion param = 7 GB
13 billion param = 13 GB
175 billion param = 175 GB

extending the rule of thumb
---
16 bits? 2x
24 bits? 3x
32 bits? 4x

的经验计算

TLDR -达到精度乘数

import numpy as np
import matplotlib.pyplot as plt

def cal_size(precision, num_params_in_billion=10**9):
    bits = precision/8
    size_in_gb = (num_params_in_billion/1024**3) * bits
    return round(size_in_gb,2)

# Assuming num_params as 1 (you can change it accordingly)
precisions = list(range(1, 25))
sizes = [cal_size(precision) for precision in precisions]

plt.plot(precisions, sizes, marker='o')
plt.title('Model Size vs. Precision')
plt.xlabel('Precision (bits)')
plt.ylabel('Size (GB)')
plt.grid(True)
plt.xticks(range(1, 25))
plt.show()

# Calculate the slope using linear regression
precisions_arr = np.array(precisions)
sizes_arr = np.array(sizes)
slope, intercept = np.polyfit(precisions_arr, sizes_arr, 1)

print("Slope of the line:", slope, "intercept:", intercept)

结果：精度乘数 = 直线斜率 = 0.1164

参数乘数

if precision = 24 bits, param multiplier = 24*0.1164 = 2.79 ~ 3
if precision = 16 bits, param multiplier = 16*0.1164 = 1.86 ~ 2
if precision = 8 bits, param multiplier = 8*0.1164 = 0.93 ~ 1

模型尺寸计算

model size =  Params in billion x param multiplier
7B-24bits = 7 x 3 = 21 GB
13B-24bits = 13 x 3 = 39 GB
7B-16bits = 7 x 2 = 14 GB
7B-8bits = 7 x 1 = 7 GB

Following @Prajot's code, one can arrive at a Precision multiplier and multiple parameter multipliers leading to a rule of thumb.

TLDR - Rule of thumb

stats for 8 bit models:
---
7 billion param = 7 GB
13 billion param = 13 GB
175 billion param = 175 GB

extending the rule of thumb
---
16 bits? 2x
24 bits? 3x
32 bits? 4x

Calculations

Arriving at a Precision multiplier

import numpy as np
import matplotlib.pyplot as plt

def cal_size(precision, num_params_in_billion=10**9):
    bits = precision/8
    size_in_gb = (num_params_in_billion/1024**3) * bits
    return round(size_in_gb,2)

# Assuming num_params as 1 (you can change it accordingly)
precisions = list(range(1, 25))
sizes = [cal_size(precision) for precision in precisions]

plt.plot(precisions, sizes, marker='o')
plt.title('Model Size vs. Precision')
plt.xlabel('Precision (bits)')
plt.ylabel('Size (GB)')
plt.grid(True)
plt.xticks(range(1, 25))
plt.show()

# Calculate the slope using linear regression
precisions_arr = np.array(precisions)
sizes_arr = np.array(sizes)
slope, intercept = np.polyfit(precisions_arr, sizes_arr, 1)

print("Slope of the line:", slope, "intercept:", intercept)

Result: Precision Multiplier = slope of line = 0.1164

Param multiplier

if precision = 24 bits, param multiplier = 24*0.1164 = 2.79 ~ 3
if precision = 16 bits, param multiplier = 16*0.1164 = 1.86 ~ 2
if precision = 8 bits, param multiplier = 8*0.1164 = 0.93 ~ 1

Model size calculation

model size =  Params in billion x param multiplier
7B-24bits = 7 x 3 = 21 GB
13B-24bits = 13 x 3 = 39 GB
7B-16bits = 7 x 2 = 14 GB
7B-8bits = 7 x 1 = 7 GB

回复收藏 0 原文

铁轨上的流浪者 2025-01-27 20:56:05

已经编写了一个小型代码来计算模型的大小，具体取决于您的模型的参数次数＆amp; dtype您的模型
当前支持fp32，fp16，bfloat16＆amp; int8

def cal_size(num_params,dtype):
    if dtype == "float32":
        return (num_params/1024**2) * 4
    elif dtype == "float16" or dtype == "bfloat16":
        return (num_params/1024**2) * 2
    elif dtype == "int8":
        return (num_params/1024**2) * 1
    else:
        return -1

if __name__ == "__main__":
    import torchvision.models as models
    model = models.mobilenet_v2()
    #mobilenetv2 of width multiplier 1 has 3.4M params 
    total_params = sum(p.numel() for p in model.parameters())
    model_size = cal_size(total_params,"float32")
    if model_size != -1:
        print("Size of model is :{:.2f} MB".format(model_size))
    else:
        print("Incorrect dtype")

Have written a small code to calculate the size of your model depending on the number of params your model has & dtype of your model
Currently supports fp32,fp16, bfloat16 & int8

def cal_size(num_params,dtype):
    if dtype == "float32":
        return (num_params/1024**2) * 4
    elif dtype == "float16" or dtype == "bfloat16":
        return (num_params/1024**2) * 2
    elif dtype == "int8":
        return (num_params/1024**2) * 1
    else:
        return -1

if __name__ == "__main__":
    import torchvision.models as models
    model = models.mobilenet_v2()
    #mobilenetv2 of width multiplier 1 has 3.4M params 
    total_params = sum(p.numel() for p in model.parameters())
    model_size = cal_size(total_params,"float32")
    if model_size != -1:
        print("Size of model is :{:.2f} MB".format(model_size))
    else:
        print("Incorrect dtype")

回复收藏 0 原文

伤感在游骋 2025-01-27 20:56:05

另一种方法是仅计算将权重下载到您的计算机的文件夹的大小（如果使用诸如 Hugging Face 之类的东西）。

import subprocess

def get_folder_size(model_name):
    start_path = '/home/ubuntu/.cache/huggingface/hub'
    folder_name = 'models--' + model_name.replace('/', '--')
    folder_path = os.path.join(start_path, folder_name)
    size = subprocess.check_output(['du', '-sb', folder_path]).split([0].decode('utf-8')
    size_in_bytes = int(size)
    size_in_gb = round(size_in_bytes / (1024 ** 3), 3)  # Convert bytes to GB and round to 3 decimal places
    return size_in_gb

get_folder_size('thenlper/gte-base')

An alternative is to just to figure out the size of the folder where the weights were downloaded to your machine (if using something like Hugging Face).

import subprocess

def get_folder_size(model_name):
    start_path = '/home/ubuntu/.cache/huggingface/hub'
    folder_name = 'models--' + model_name.replace('/', '--')
    folder_path = os.path.join(start_path, folder_name)
    size = subprocess.check_output(['du', '-sb', folder_path]).split([0].decode('utf-8')
    size_in_bytes = int(size)
    size_in_gb = round(size_in_bytes / (1024 ** 3), 3)  # Convert bytes to GB and round to 3 decimal places
    return size_in_gb

get_folder_size('thenlper/gte-base')

回复收藏 0 原文

~没有更多了~

关于作者

神魇的王

暂无简介

文章

25 人气

关注发私信

友情链接

文江博客

如何找到深度学习模型的大小？

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（5）

规则

的经验计算

TLDR -达到精度乘数

参数乘数

模型尺寸计算

TLDR - Rule of thumb

Calculations

Arriving at a Precision multiplier

Param multiplier

Model size calculation

关于作者

相关话题

热门标签

推荐作者

╰ゝ天使的微笑

少女净妖师

朱洁

觉浅

滥情空心

hl1314520

友情链接

如何找到深度学习模型的大小？

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（5）

规则

的经验计算

TLDR -达到精度乘数

参数乘数

模型尺寸计算

TLDR - Rule of thumb

Calculations

Arriving at a Precision multiplier

Param multiplier

Model size calculation

关于作者

相关话题

热门标签

推荐作者

╰ゝ天使的微笑

少女净妖师

朱洁

觉浅

滥情空心

hl1314520

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。