Last week we hosted the July 2022 “Data Science, Machine Learning and Kubeflow” Meetup. In this blog post we’ll recap some highlights from the Meetup and preview what’s next. Ok, let’s dig in.
Join a Meetup near you
First, if you missed last week’s Meetup, there’s no need to suffer from FOMO. Here’s a list of the Meetups that are part of the “Data Science, Machine Learning and Kubeflow” Meetup network. Please join the one that is the most time friendly to your location.
Get involved in the Kubeflow community
- Join Kubeflow Community Slack
- Are you interested in speaking at a future Meetup?
- Is your company interested in sponsoring a Meetup?
- Would you like to be a co-organizer of a local Meetup?
If you answered yes to any of the above, Send one of the organizers/hosts a message on Meetup.com.
First, thanks for voting for your favorite charity!
With the unprecedented circumstances facing our global community, Arrikto is looking for even more ways to contribute. With this in mind, we thought that in lieu of swag we could give workshop attendees the opportunity to vote for their favorite charity and help guide our monthly donation to charitable causes. The charity that won this workshop’s voting was the International Committee of the Red Cross (ICRC). Since 1863, the mission of the International Committee of the Red Cross (ICRC) has been to protect and assist victims of armed conflict and promote understanding and respect for international humanitarian law. We are pleased to be making a donation of $100 to them on behalf of the workshop attendees. Again, thanks to all of you who attended and voted!
Talk #1: Jet Energy Corrections with GNN Regression using Kubeflow at CERN
The Large Hadron Collider is the world’s largest particle accelerator measuring 27 km in circumference. It accelerates beams of particles in opposite directions almost to the speed of light before making them collide. The particles emerging from the collisions are then measured in large detectors such as the Compact Muon Solenoid. An especially important object of study are so-called jets composed of multiple particles shooting out in the same direction from the collision point. Data-driven methods are used to correct the energy values for these jets, and what we’ll present here is the utilization of Kubeflow to enable state-of-the-art graph neural network based corrections. Kubeflow’s pipeline component allows us to define our machine learning workflow in a well-structured and reproducible manner, and its built-in training operators are used to scale up the training with ease. This work is expected to pave the way for future adoption of Kubeflow among the physics community at CERN.
Daniel Holberg is a technical student at CERN investigating deep learning applications for the CMS experiment. Dejan Golubović is a CERN software engineer with experience in machine learning.
Talk #2: FiftyOne – the open-source tool for building high-quality datasets and computer vision models
Nothing hinders the success of machine learning systems more than poor quality data. And without the right tools, improving a model can be time-consuming and inefficient.
In this talk, we’ll do a brief overview and technical demo of FiftyOne, an emerging open source tool that provides the building blocks for dataset analysis and integrates with complementary solutions across the ML stack to solve the biggest existential threat to any ML project: dataset quality. Tens of thousands of engineers and scientists use FiftyOne everyday to get hands-on with their data, visualize complex labels, evaluate models, explore scenarios, identify failure modes, improve annotations, and much more.
Brian Moore is CTO/Co-founder of Voxel51, an AI software company that enables machine learning and computer vision scientists to rapidly curate and experiment with their datasets in order to build higher performing machine learning systems.
Lightning Talks
There was also one short lightning talk at the Meetup worth checking out.
- A 10 Minute Introduction to Kubeflow: Basics, Architecture & Components – Jimmy Guerrero, VP Developer Relations (Arrikto)
Upcoming August 2022 Meetup
Details about the talks are coming soon! Sign up for the Aug 4 MLOps Meetup to attend more interesting talks of the Kubeflow community.
Ready to get started with Kubeflow?
Arrikto’s Kubeflow as a Service is the easiest way to get deployed and have a pipeline running in under 5 minutes. Comes with a 14-day free trial with no credit card required. Click to get started.
FREE Kubeflow courses and certifications
We are excited to announce the first of several free instructor-led and on-demand Kubeflow courses! The “Introduction to Kubeflow” series of courses will start with the fundamentals, then go on to deeper dives of various Kubeflow components. Each course will be delivered over Zoom with the opportunity to earn a certificate upon successful completion of an exam. Visit us to learn more.
We hope to see you at a future Meetup!