Data Representations#

Block API#

block.Block

The central part of internal API.

block.BlockExecStats()

Execution stats for this block.

block.BlockMetadata(num_rows, size_bytes, ...)

Metadata about the block.

block.BlockAccessor(*args, **kwds)

Provides accessor methods for a specific block.

Batch API#

block.DataBatch

The central part of internal API.

Row API#

row.TableRow(row)

A dict-like row of a tabular Dataset.

Tensor Column Extension API#

extensions.tensor_extension.create_ragged_ndarray(values)

Create an array that contains arrays of different length

extensions.tensor_extension.TensorDtype(...)

Pandas extension type for a column of homogeneous-typed tensors.

extensions.tensor_extension.TensorArray(values)

Pandas ExtensionArray representing a tensor column, i.e. a column consisting of ndarrays as elements.

extensions.tensor_extension.ArrowTensorType(...)

Arrow ExtensionType for an array of fixed-shaped, homogeneous-typed tensors.

extensions.tensor_extension.ArrowTensorArray

An array of fixed-shape, homogeneous-typed tensors.

extensions.tensor_extension.ArrowVariableShapedTensorType(...)

Arrow ExtensionType for an array of heterogeneous-shaped, homogeneous-typed tensors.

extensions.tensor_extension.ArrowVariableShapedTensorArray

An array of heterogeneous-shaped, homogeneous-typed tensors.