当前位置：首页 > 生肖 > 正文内容

五蕴皆空是什么生肖（10分钟了解Pytorch）

2024-09-20 10:43:32生肖267

什么是 Pytorch ？可以把 Pytorch 当做 Numpy 的替代方案使用，做科学计算，而且更强大。做 AI 任务的模型建模、训练、部署等。这方面唯一能与其比肩的是 Google 家的 Tensorflow。

本文是对 Pytorch 非常简短的一个介绍，目录如下：

Tensor 的创建与操作，cuda 加速计算。 Pytorch 的自动微分功能。 Tensor 创建与操作

Tensor 的创建

创建一个未初始化的、形状为 (5,3) 的空 tensor。

输入：

x = torch.empty(5, 3) print(x)

输出：

tensor([[1.0102e-38, 9.0919e-39, 1.0102e-38], [8.9082e-39, 8.4489e-39, 9.6429e-39], [8.4490e-39, 9.6429e-39, 9.2755e-39], [1.0286e-38, 9.0919e-39, 8.9082e-39], [9.2755e-39, 8.4490e-39, 1.0194e-38]])

创建一个随机初始化，形状为 (5,3) 的 tensor。

输入：

x = torch.rand(5, 3) print(x)

输出：

tensor([[0.4133, 0.0885, 0.0503], [0.6771, 0.5543, 0.8236], [0.3047, 0.1217, 0.4441], [0.6269, 0.6820, 0.4217], [0.5631, 0.8517, 0.8708]])

创建一个0填充的、形状为 (5,3) 的 tensor。

输入：

x = torch.zeros(5, 3, dtype=torch.long) print(x)

输出：

tensor([[0, 0, 0], [0, 0, 0], [0, 0, 0], [0, 0, 0], [0, 0, 0]])

直接从列表创建一个 tensor：

x = torch.tensor([5.5, 3]) print(x)

输出：

tensor([5.5000, 3.0000])

在现有 tensor 的基础上新建一个全1 tensor，默认情况下新的 tensor 会继承已有 tensor 的属性，比如形状、数据类型等，当然也可以手动指定。：

x = x.new_ones(5, 3, dtype=torch.double) print(x)

输出：

tensor([[1., 1., 1.], [1., 1., 1.], [1., 1., 1.], [1., 1., 1.], [1., 1., 1.]], dtype=torch.float64)

生成一个与现有 tensor 一样形状的随机 tensor，且数据类型覆盖为 torch.float：

x = torch.randn_like(x, dtype=torch.float) print(x)

输出：

tensor([[ 0.9629, 0.0349, 0.5597], [-2.1172, 1.1874, -0.1596], [ 0.6841, -0.6172, -0.4732], [ 0.0468, -0.3634, 1.1014], [ 0.6064, 0.1740, 0.2344]])

查看 tensor 的形状输入：

print(x.size())

输出：

torch.Size([5, 3])

tensor.size方法返回的是一个tuple对象，可以执行tuple的各种操作。

tensor 的基本操作

详细内容可参考官方文档

1. 创建

torch.rand* 类随机生成 tensor 方法。

torch.rand() torch.rand_like() torch.randn() torch.randn_like() torch.randint() torch.randint_like() torch.randperm() torch.empty()

从其他数据源创建

torch.tensor() torch.from_numpy() torch.full() torch.range() torch.linspace() torch.eye()

2. 索引、切片、合并等转换

# 按指定维度拼接 tensor torch.cat() # 按指定维度将一个 tensor 分割为几个小块 torch.chunk() # 按照深度方向（第三维）将多个 tensor 拼接，若tensor不足三维，则先转为3维 tensor 再行拼接。 torch.dstack() # 将多个 tensor 按第1维进行拼接 torch.hstack() # 按指定维度进行索引查询 torch.index_select() # 根据一个 BoolTensor 进行 mask 查询 torch.masked_select() # 调换 tensor 维度 torch.movedim() # 返回 tensor 中非0值的索引 torch.nonzero() # 改变 tensor 形状 torch.reshape() # 按照指定维度及指定大小将 tensor 分割为几块 torch.split() # 去除所有 tensor 中大小为1的维度，也可以指定哪个维度。 torch.squeeze() # 按照指定维度拼接多个 tensor torch.stack() # tensor 转置 torch.t() # 根据给定的索引从已有 tensor 中抽取出一个新的 tensor。 torch.take() # 将 tensor 的指定两个维度进行互换 torch.transpose() # 在指定地方增加 tensor 维度 torch.unsqueeze() # 类似 dstack，只是是在第二个维度进行拼接 torch.vstack() # 对 tensor 中每个元素根据条件判断如何返回 torch.where()

3. 随机采样机器参数设置

# 设置非确定性随机数的随机种子 torch.seed # 设置生成随机数的随机种子 torch.manual_seed # 查看初始化的随机种子，为 `long` 类型 torch.initial_seed # 查看随机数生成器状态 torch.get_rng_state # 设置随机数生成器状态 torch.set_rng_state

4. 序列化与反序列化

# 对 Pytorch 中的对象进行序列化保存和读取 torch.save() torch.load()

5. 并行计算

# 获取和设置 CPU 并行操作时的线程数 torch.get_num_threads torch.set_num_threads # 获取和设置 CPU 上的互操作并行的线程数 torch.get_num_interop_threads torch.set_num_interop_threads

6. 梯度控制

有多重方法可以控制 tensor 是否计算梯度

输入：

x = torch.zeros(1, requires_grad=True) with torch.no_grad(): y = x * 2 print(y.requires_grad) is_train = False with torch.set_grad_enabled(is_train): y = x * 2 print(y.requires_grad) torch.set_grad_enabled(True) y = x * 2 print(y.requires_grad)

输出：

False False True

7. 数学操作

pytorch 支持大部分常见的数学操作，这里不详细列举，详见官方文档

# Pointwise 操作 torch.abs torch.clip torch.cos torch.sin torch.div torch.exp torch.pow torch.log torch.sigmoid # Reduction 操作 torch.argmax torch.argmin torch.max torch.dist torch.mean torch.norm torch.count_nonzero # 比较操作 torch.allclose torch.argsort torch.eq torch.equal torch.ge torch.gt torch.isinf torch.isfinite torch.isnan torch.isreal torch.isneginf torch.sort torch.topk # 光谱操作及其他操作 ......

Pytorch 的自动微分功能

Pytorch 具备自动微分功能。

requires_grad

这个参数表示这个 tensor是否需要计算梯度。

x = torch.ones(2, 2, requires_grad=True) print(x)

输出：

tensor([[1., 1.], [1., 1.]], requires_grad=True)

grad_fn

指当前这个 tensor 是通过哪个函数得来的，在链式求导时会按照此函数进行计算。一般来讲除了用户自己创建的 tensor 外，如果是 pytorch 内置函数所生成的 tensor 都会有 grad_fn。

输入：

y = x + 2 print(y) print(y.grad_fn) z = y * y * 3 out = z.mean() print(z, out)

输出：

tensor([[3., 3.], [3., 3.]], grad_fn=<AddBackward0>) <AddBackward0 object at 0x000001CEE32753C8> tensor([[27., 27.], [27., 27.]], grad_fn=<MulBackward0>) tensor(27., grad_fn=<MeanBackward0>)

backward & grad

backward 方法用于计算梯度，通过链式求导法则计算好的梯度就存在 tensor 的 grad 属性中。

输入：

out.backward() print(x.grad)

输出：

tensor([[4.5000, 4.5000], [4.5000, 4.5000]])

with torch.no_grad

通过这种方法，可以避免在计算中的梯度计算

输入：

print(x.requires_grad) print((x ** 2).requires_grad) with torch.no_grad(): print((x ** 2).requires_grad)

输出：

True True False

detach

通过 detach 方法避免梯度计算

输入：

print(x.requires_grad) y = x.detach() print(y.requires_grad) print(x.eq(y).all())

输出：

True False tensor(True)

requires_grad_

设置 requires_grad_ 属性可直接更改 tensor 梯度计算配置。

输入：

a = torch.randn(2, 2) a = ((a * 3) / (a - 1)) print(a.requires_grad) a.requires_grad_(True) print(a.requires_grad) b = (a * a).sum() print(b.grad_fn)

输出：

False True <SumBackward0 object at 0x000001CEE32753C8>

结束

本篇记录了 Pytorch 中 tensor 的基本创建方法及基本操作，也介绍了 Pytorch 的自动微分机制。算是简单入门。

下一篇我们来看看怎么使用 Pytorch 来构造数据集、搭建神经网络、模型训练、日志监控等等。

参考https://pytorch.org/tutorials/beginner/blitz/tensor_tutorial.html#sphx-glr-beginner-blitz-tensor-tutorial-py https://pytorch.org/docs/stable/torch.html https://pytorch.org/tutorials/beginner/blitz/autograd_tutorial.html#sphx-glr-beginner-blitz-autograd-tutorial-py

扫码持续关注公众号~