DatasetPipeline API
Contents
DatasetPipeline API#
Constructor#
|
Implements a pipeline of Datasets. |
Basic Transformations#
|
Apply |
|
Apply |
|
Apply |
Apply a transform to each dataset/window in this pipeline. |
|
|
Apply |
|
Apply |
|
Apply |
|
Apply |
Sorting, Shuffling, Repartitioning#
|
Apply |
Apply |
|
Apply |
|
Apply |
Splitting DatasetPipelines#
|
Split the pipeline into |
|
Split the datasets within the pipeline at the given indices (like np.split). |
Creating DatasetPipelines#
|
Repeat this pipeline a given number or times, or indefinitely. |
|
Change the windowing (blocks per dataset) of this pipeline. |
|
Create a pipeline from an sequence of Dataset producing functions. |
Consuming DatasetPipelines#
|
Call |
|
Print up to the given number of records from each window/dataset. |
|
Call |
|
Call |
Return a |
|
|
Return a local row iterator over the data in the pipeline. |
|
Return a local batched iterator over the data in the pipeline. |
|
Call |
|
Call |
I/O and Conversion#
|
Call |
|
Call |
|
Call |
|
Call |
|
Call |
|
Call |
Inspecting Metadata#
|
Return the schema of the dataset pipeline. |
Count the number of records in the dataset pipeline. |
|
|
Returns a string containing execution timing information. |
Sum the records in the dataset pipeline. |