ConcatDataset¶
- class paddle.io. ConcatDataset ( datasets: Iterable[paddle.io.dataloader.dataset.Dataset] ) [source]
-
Dataset as a concatenation of multiple datasets.
This class is useful to assemble different existing datasets.
- Parameters
-
datasets (sequence) – List of datasets to be concatenated
- Returns
-
A Dataset which concatenated by multiple datasets.
- Return type
-
Dataset
Examples
>>> import numpy as np >>> import paddle >>> from paddle.io import Dataset, ConcatDataset >>> # define a random dataset >>> class RandomDataset(Dataset): ... def __init__(self, num_samples): ... self.num_samples = num_samples ... ... def __getitem__(self, idx): ... image = np.random.random([32]).astype('float32') ... label = np.random.randint(0, 9, (1, )).astype('int64') ... return image, label ... ... def __len__(self): ... return self.num_samples ... >>> dataset = ConcatDataset([RandomDataset(10), RandomDataset(10)]) >>> for i in range(len(dataset)): ... image, label = dataset[i] ... # do something