ray.data.Dataset.with_columns#

Dataset.with_columns(exprs: Dict[str, Expr]) Dataset[source]#

Add new columns to the dataset.

Examples

>>> import ray
>>> from ray.data.expressions import col
>>> ds = ray.data.range(100)
>>> ds.with_columns({"new_id": col("id") * 2, "new_id_2": col("id") * 3}).schema()
Column    Type
------    ----
id        int64
new_id    int64
new_id_2  int64
Parameters:

exprs – A dictionary mapping column names to expressions that define the new column values.

Returns:

A new dataset with the added columns evaluated via expressions.

PublicAPI (alpha): This API is in alpha and may change before becoming stable.