Apache MADlib Tutorial
Introduction
Apache MADlib is an open-source library for scalable in-database analytics. It provides data-parallel implementations of mathematical, statistical and machine learning algorithms.
Apache MADlib is primarily for data scientists (that is you) who are working with very large datasets. MADlib really shines when datasets are really large and need MPP (Massively Parallel Processing) Architecture in which to operate them.
MAD in MADlib mean : Magnetic Agile & Deep
