Learn PyTorch—张量

张量如同数组和矩阵一样, 是一种特殊的数据结构。在PyTorch中, 神经网络的输入、输出以及网络的参数等数据, 都是使用张量来进行描述。

张量初始化

1. 直接生成张量

由原始数据直接生成张量, 张量类型由原始数据类型决定。

import torch
import numpy as np

data = [[1, 2], [3, 4]]
x_data = torch.tensor(data)

2. 通过Numpy数组来生成张量

由已有的Numpy数组来生成张量。张量与Numpy可以相互转化。

1 2	np_array = np.array(data) x_np = torch.from_numpy(np_array)

3. 通过已有的张量来生成新的张量

新的张量将继承已有张量的数据属性(结构、类型), 也可以重新指定新的数据类型。

x_ones = torch.ones_like(x_data)   # 保留 x_data 的属性
print(f"Ones Tensor: \n {x_ones} \n")

x_rand = torch.rand_like(x_data, dtype=torch.float)   # 重写 x_data 的数据类型
                                                      # int -> float
print(f"Random Tensor: \n {x_rand} \n")

output：

Ones Tensor:
 tensor([[1, 1],
         [1, 1]])

Random Tensor:
 tensor([[0.0381, 0.5780],
         [0.3963, 0.0840]])

4. 通过指定数据维度来生成张量

shape是元组类型, 用来描述张量的维数, 下面3个函数通过传入

shape来指定生成张量的维数。

shape = (2,3,)
rand_tensor = torch.rand(shape)
ones_tensor = torch.ones(shape)
zeros_tensor = torch.zeros(shape)

print(f"Random Tensor: \n {rand_tensor} \n")
print(f"Ones Tensor: \n {ones_tensor} \n")
print(f"Zeros Tensor: \n {zeros_tensor}")

output:

Random Tensor:
 tensor([[0.0266, 0.0553, 0.9843],
         [0.0398, 0.8964, 0.3457]])

Ones Tensor:
 tensor([[1., 1., 1.],
         [1., 1., 1.]])

Zeros Tensor:
 tensor([[0., 0., 0.],
         [0., 0., 0.]])

张量属性

得到张量的维数、数据类型以及它们所存储的设备(CPU或GPU)。

tensor = torch.rand(3,4)

print(f"Shape of tensor: {tensor.shape}")
print(f"Datatype of tensor: {tensor.dtype}")
print(f"Device tensor is stored on: {tensor.device}")

output：

1
2
3

Shape of tensor: torch.Size([3, 4])   # 维数
Datatype of tensor: torch.float32     # 数据类型
Device tensor is stored on: cpu       # 存储设备

张量运算

这些运算都可以在GPU上运行(相对于CPU来说可以达到更高的运算速度)。

1
2
3

# 判断当前环境GPU是否可用, 然后将tensor导入GPU内运行
if torch.cuda.is_available():
  tensor = tensor.to('cuda')

1.张量的索引和切片

1
2
3

tensor = torch.ones(4, 4)
tensor[:,1] = 0            # 将第1列(从0开始)的数据全部赋值为0
print(tensor)

output:

tensor([[1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.]])

2. 张量的拼接

通过torch.cat方法将一组张量按照指定的维度进行拼接

t1 = torch.cat([tensor, tensor, tensor], dim=1)   # dim=1 为横向拼接
t2 = torch.cat([tensor, tensor, tensor], dim=0)   # dim=0 为纵向拼接
print(t1)
print(t2)

output：

tensor([[1., 0., 1., 1., 1., 0., 1., 1., 1., 0., 1., 1.],
        [1., 0., 1., 1., 1., 0., 1., 1., 1., 0., 1., 1.],
        [1., 0., 1., 1., 1., 0., 1., 1., 1., 0., 1., 1.],
        [1., 0., 1., 1., 1., 0., 1., 1., 1., 0., 1., 1.]])
        
tensor([[1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.]])

3. 张量的乘积和矩阵乘法

逐个元素相乘

# 逐个元素相乘结果
print(f"tensor.mul(tensor): \n {tensor.mul(tensor)} \n")

# 等价写法:
print(f"tensor * tensor: \n {tensor * tensor}")

output：

tensor.mul(tensor):
 tensor([[1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.]])

tensor * tensor:
 tensor([[1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.]])

张量与张量的矩阵乘法

1
2
3

print(f"tensor.matmul(tensor.T): \n {tensor.matmul(tensor.T)} \n")
# 等价写法:
print(f"tensor @ tensor.T: \n {tensor @ tensor.T}")

output：

tensor.matmul(tensor.T):
 tensor([[3., 3., 3., 3.],
        [3., 3., 3., 3.],
        [3., 3., 3., 3.],
        [3., 3., 3., 3.]])

tensor @ tensor.T:
 tensor([[3., 3., 3., 3.],
        [3., 3., 3., 3.],
        [3., 3., 3., 3.],
        [3., 3., 3., 3.]])

4. 自动赋值运算

自动赋值运算通常在方法后有 _ 作为后缀, 例如: x.copy_(y), x.t_()操作会改变 x 的取值。

1
2
3

print(tensor, "\n")
tensor.add_(5)
print(tensor)

output：

tensor([[1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.],
        [1., 0., 1., 1.]])

tensor([[6., 5., 6., 6.],
        [6., 5., 6., 6.],
        [6., 5., 6., 6.],
        [6., 5., 6., 6.]])

自动赋值运算虽然可以节省内存, 但在求导时会因为丢失了中间过程而导致一些问题。

Tensor与Numpy的转化

张量和Numpy array数组在CPU上可以共用一块内存区域, 改变其中一个另一个也会随之改变。

1. 由张量变换为Numpy array数组

t = torch.ones(5)
print(f"t: {t}")
n = t.numpy()
print(f"n: {n}")

output：

1 2	t: tensor([1., 1., 1., 1., 1.]) n: [1. 1. 1. 1. 1.]

修改张量的值，则Numpy array数组值也会随之改变。

1
2
3

t.add_(1)
print(f"t: {t}")
print(f"n: {n}")

output：

1 2	t: tensor([2., 2., 2., 2., 2.]) n: [2. 2. 2. 2. 2.]

2. 由Numpy array数组转为张量

1 2	n = np.ones(5) t = torch.from_numpy(n)

output：

1 2	t:tensor([1., 1., 1., 1., 1.], dtype=torch.float64) t:[1. 1. 1. 1. 1.]

修改Numpy array数组的值，则张量值也会随之改变。

1
2
3

np.add(n, 1, out=n)
print(f"t: {t}")
print(f"n: {n}")

output：

1 2	t: tensor([2., 2., 2., 2., 2.], dtype=torch.float64) n: [2. 2. 2. 2. 2.]

Learn PyTorch—张量

张量初始化

1. 直接生成张量

2. 通过Numpy数组来生成张量

3. 通过已有的张量来生成新的张量

4. 通过指定数据维度来生成张量

张量属性

张量运算

1.张量的索引和切片

2. 张量的拼接

3. 张量的乘积和矩阵乘法

逐个元素相乘

张量与张量的矩阵乘法

4. 自动赋值运算

Tensor与Numpy的转化

1. 由张量变换为Numpy array数组

2. 由Numpy array数组转为张量

附录