Data Processing with Optimus : Supercharge big data preparation tasks for analytics and machine learning with Optimus using Dask and PySpark
(Reklamlänk)
Optimus is a Python library that works as a unified API for data cleaning, processing, and merging data. It can be used for handling small and big data on your local laptop or on remote clusters using CPUs or GPUs. The book begins by covering the internals of Optimus and how it works in tandem with