shard_tensor¶
- paddle.distributed. shard_tensor ( data, mesh, placements, dtype=None, place=None, stop_gradient=True ) [source]
-
Constructs a
paddle.Tensor
with distributed attributes fromdata
, which can scalar, tuple, list, numpy.ndarray, paddle.Tensor.If the
data
is already a Tensor, transform it to a Distributed Tensor.- Parameters
-
data (scalar|tuple|list|ndarray|Tensor) – Initial data for the tensor. Can be a scalar, list, tuple, numpy.ndarray, paddle.Tensor.
mesh (paddle.distributed.ProcessMesh) – The ProcessMesh object describes the Cartesian topology of the used processes.
placements (list[paddle.distributed.Placement]) – the placements describe how to place the tensor on ProcessMesh, it can be Shard, Replicate and Partial.
dtype (str|np.dtype, optional) – The desired data type of returned tensor. Can be ‘bool’ , ‘float16’ , ‘float32’ , ‘float64’ , ‘int8’ , ‘int16’ , ‘int32’ , ‘int64’ , ‘uint8’, ‘complex64’ , ‘complex128’. Default: None, infers dtype from
data
except for python float number which gets dtype fromget_default_type
.place (CPUPlace|CUDAPinnedPlace|CUDAPlace|str, optional) – The place to allocate Tensor. Can be CPUPlace, CUDAPinnedPlace, CUDAPlace. Default: None, means global place. If
place
is string, It can becpu
,gpu:x
andgpu_pinned
, wherex
is the index of the GPUs.stop_gradient (bool, optional) – Whether to block the gradient propagation of Autograd. Default: True.
- Returns
-
A Tensor constructed from
data
with distributed attributes. - Return type
-
Tensor
Examples
>>> import paddle >>> import paddle.distributed as dist >>> mesh = dist.ProcessMesh([[2, 4, 5], [0, 1, 3]], dim_names=['x', 'y']) >>> # dense tensor >>> a = paddle.to_tensor([[1,2,3], ... [5,6,7]]) >>> >>> # distributed tensor >>> d_tensor = dist.shard_tensor(a, mesh, [dist.Shard(0), dist.Shard(1)]) >>> print(d_tensor)