NEW! Now with Kubeflow 1.3!
Create an AutoML workflow with the click of a button. Start from a dataset, define a task, then discover, train, and optimize a model from inside your notebook automagically.
This is a maintenance release of 20210428.0.0
New MiniKF release including:
- Kubeflow 1.3
- Rok 1.2-pre-315-g8029fc3c1
New Features
- Upgrade Rok to the latest pre-release version.
- Upgrade Kubeflow to version 1.3.
- Upgrade NVIDIA drivers to version 460.
- Upgrade Linux Mainline Kernel to version 5.4.104.
- Upgrade Istio to version 1.9.0.
- Brand new TensorBoard UI for logging of training jobs.
- Updated Katib UI for a better hyperparameter tuning experience.
- Updated Volume Manager UI for a better experience when managing Kubernetes PVCs.
- Updated Kubeflow Pipelines UI with the ability to manage recurring runs.
- Updated Kale UI for an enhanced data science experience.
Improvements
- Support fine-grained authorization in Rok with Kubernetes RBAC.
- Remove custom kernel modules, use Arrikto
dm-clone
module from the upstream Linux kernel. - Use SubjectAccessReview to authorize requests performed inside the Kubernetes cluster.
- Improve authentication mechanism in the Rok API and client.
- Use kustomizations for all applied manifests.
- Use upstream NGINX controller instead of Minikube’s addon.
- Improve pre-pulling of Docker images to auto-generate the list of images based on the kustomizations to be applied.
- Support token-based authentication as the sole method of programmatic authentication in the Rok client and CLI.
- Introduce pagination in the Rok UI.
- Feature a new landing page for MiniKF on Vagrant.
- Produce a smaller Vagrant box by excluding non-critical images from the pre-pull list.
Fixed Issues
- Fix a bug that could lead to data corruption when snapshotting a filesystem that needs recovery.
- Fix an issue with the Go runtime which could lead to CSI pods crashing due to max locked memory.
- Fix an issue with Rok becoming ready after an unscheduled node reboot.
- Fix pre-clone verification steps that could lead to errors.
- Fix a KFP API server issue causing pod crashes if the default parameter values are empty.
- Reduce the amount of time that the provisioning script might wait for APT in an air-gapped environment.
- [20210428.0.1] Remove stale stopped containers from the final MiniKF image.
Known issues
- The OpenVaccine Codelab does not currently run end-to-end inside an airgapped environment, due to an issue with the Knative Serving controller.
- Under heavy load Kubernetes might delay starting pods even if Rok has successfully provisioned the underlying storage.