New features include:
Deep learning - New grid and random search methods.
Deep learning - AutoML methods Hyperband and Hyperopt.
Deep learning - Custom loss functions and custom metrics.
Deep learning - TensorBoard support.
Deep learning - Multi-input and output support for fit and evaluate.
DBSCAN - Density based clustering (phase 1).
Improvements:
Deep learning - Implement cache logic to speed performance.
Deep learning - Reduce GPU idle time when moving model state between workers.
Deep learning - Use Keras version from TensorFlow.
Deep learning - Add top n to evaluate.
Graph - Support BIGINT for all graph methods.
Infra - Switch to CloudBees (was Jenkins).
You are invited to download the 1.18.0 release and review the release notes. Also please refer to the list of supported databases and OS.
New features include:
Deep learning - Model selection framework for Keras with Tensorflow backend with GPU acceleration, for model architecture search and hyperparameter optimization.
Deep learning - Support for heterogeneous clusters where GPUs are attached to only certain segment hosts.
Deep learning - Support inference for imported models not trained in MADlib ("bring your own model").
Deep learning - Support transfer learning for multiple model fit function.
Deep learning - Generate model selection table for grid search or random search.
Deep learning - Helper function to get GPU type and configuration in a database cluster.
k-Means clustering - Select optimal number of centroids using elbow or silhouette methods.
PostgreSQL 12 support.
Improvements:
Association rules - Add option to set number of posterior rules.
Correlation and covariance - Improve memory usage with large number of groups.
Deep learning - Improve performance of mini-batch preprocessor and fit functions.
Docs - Inprove installation guide on wiki.
Graph - SSSP should not show vertices in output table that are unreachable.
LDA - Add stopping criteria on perplexity.
You are invited to download the 1.17.0 release and review the release notes. For more details about the new deep learning feature, please refer to the Apache MADlib deep learning notes and the Jupyter notebook examples.
New features include:
Deep learning - Early stage support for Keras with Tensorflow backend with GPU acceleration. Focus on image classification use cases.
Deep learning utilities - Load model architectures and weights, parallel loading of images from NumPy arrays or file system, preprocess images for gradient descent optimization algorithms.
Greenplum 6 support.
PostgreSQL 11 support.
Improvements:
K-nearest neighbors - Improve performance with kd-tree approximate method.
Association rules - Set default maximum itemset rules to 10 to reduce runtime.
You are invited to download the 1.16 release and review the release notes. For more details about the new deep learning feature, please refer to the Apache MADlib deep learning notes and the Jupyter notebook examples.
MADlib entered incubation in the fall of 2015 and made five releases as an incubating project. Along the way, the MADlib community has worked hard to ensure that the project is being developed according to the principles of the The Apache Way. We will continue to do so in the future as a TLP, to the best of our ability.
Thank you to all who have contributed to the project so far, and we look forward more innovation in machine learning in the future as a TLP!
Downloads for Apache MADlib releases. This also includes links to pre-Apache MADlib releases.