ray.data.aggregate.ApproximateTopK#

class ray.data.aggregate.ApproximateTopK(on: str, k: int, log_capacity: int = 15, alias_name: str | None = None)[source]#

Bases: AggregateFnV2

PublicAPI (alpha): This API is in alpha and may change before becoming stable.

Methods

__init__

Computes the approximate top k items in a column by using a datasketches frequent_strings_sketch.