2 years ago · 033cf9d6e8
--- a/README.md
+++ b/README.md
@@ -1,3 +1,97 @@
 
															 # bitsandbytes
														
 
															-高效量化和低精度训练工具库，减少模型训练和推理的计算资源消耗
														
 
															+高效量化和低精度训练工具库，减少模型训练和推理的计算资源消耗
														
 
															+
														
 
															+### 安装
														
 
															+
														
 
															+可以通过 pip 安装 BitsAndBytes：
														
 
															+
														
 
															+```
														
 
															+pip install bitsandbytes
														
 
															+```
														
 
															+
														
 
															+### 示例代码
														
 
															+
														
 
															+以下是使用 BitsAndBytes 进行 8 位量化和混合精度训练的基本示例：
														
 
															+
														
 
															+#### 8 位量化示例
														
 
															+
														
 
															+```
														
 
															+import torch
														
 
															+from bitsandbytes.optim import Adam8bit
														
 
															+
														
 
															+# 定义一个简单的模型
														
 
															+model = torch.nn.Linear(10, 2)
														
 
															+
														
 
															+# 将模型移动到 GPU 上
														
 
															+model.cuda()
														
 
															+
														
 
															+# 使用 8 位量化的 Adam 优化器
														
 
															+optimizer = Adam8bit(model.parameters(), lr=0.001)
														
 
															+
														
 
															+# 生成一些随机输入数据
														
 
															+inputs = torch.randn(16, 10).cuda()
														
 
															+targets = torch.randint(0, 2, (16,)).cuda()
														
 
															+
														
 
															+# 定义损失函数
														
 
															+criterion = torch.nn.CrossEntropyLoss()
														
 
															+
														
 
															+# 前向传播
														
 
															+outputs = model(inputs)
														
 
															+loss = criterion(outputs, targets)
														
 
															+
														
 
															+# 反向传播和优化
														
 
															+loss.backward()
														
 
															+optimizer.step()
														
 
															+```
														
 
															+
														
 
															+#### 混合精度训练示例
														
 
															+
														
 
															+```
														
 
															+python
														
 
															+Copy code
														
 
															+import torch
														
 
															+from torch.cuda.amp import autocast, GradScaler
														
 
															+from bitsandbytes.optim import AdamW
														
 
															+
														
 
															+# 定义一个简单的模型
														
 
															+model = torch.nn.Linear(10, 2)
														
 
															+
														
 
															+# 将模型移动到 GPU 上
														
 
															+model.cuda()
														
 
															+
														
 
															+# 使用 AdamW 优化器
														
 
															+optimizer = AdamW(model.parameters(), lr=0.001)
														
 
															+
														
 
															+# 混合精度训练的梯度缩放器
														
 
															+scaler = GradScaler()
														
 
															+
														
 
															+# 生成一些随机输入数据
														
 
															+inputs = torch.randn(16, 10).cuda()
														
 
															+targets = torch.randint(0, 2, (16,)).cuda()
														
 
															+
														
 
															+# 定义损失函数
														
 
															+criterion = torch.nn.CrossEntropyLoss()
														
 
															+
														
 
															+# 前向传播和反向传播
														
 
															+with autocast():
														
 
															+    outputs = model(inputs)
														
 
															+    loss = criterion(outputs, targets)
														
 
															+
														
 
															+# 缩放损失
														
 
															+scaler.scale(loss).backward()
														
 
															+
														
 
															+# 优化器步骤
														
 
															+scaler.step(optimizer)
														
 
															+scaler.update()
														
 
															+```
														
 
															+
														
 
															+### 应用场景
														
 
															+
														
 
															+1.  **大规模模型训练**： 通过量化和低精度训练减少内存和计算需求，加速大规模模型的训练过程。
														
 
															+2.  **推理优化**： 在推理阶段使用 8 位量化来减少模型大小，提高推理效率，特别适用于边缘设备和资源受限的环境。
														
 
															+3.  **节省计算资源**： 在现有计算资源不变的情况下，通过更高效的计算实现更大规模的模型训练，节省硬件成本。
														
 
															+
														
 
															+### 
														
 
															+
														
 
															+-   GitHub 仓库: [BitsAndBytes](https://github.com/TimDettmers/bitsandbytes)