flox: fast & furious GroupBy reductions for dask.array¶
Overview¶
flox mainly provides strategies for fast GroupBy reductions with dask.array. flox uses the MapReduce paradigm (or a “tree reduction”)
to run the GroupBy operation in a parallel-native way totally avoiding a sort or shuffle operation. It was motivated by
See a presentation (video, slides) about this package, from the Pangeo Showcase.
Why flox?¶
flox.groupby_reduce()wraps thenumpy-groupiespackage for performant Groupby reductions on nD arrays.flox.groupby_reduce()provides parallel-friendly strategies for GroupBy reductions by wrappingnumpy-groupiesfor dask arrays.floxintegrates with xarray to provide more performant Groupby and Resampling operations.