Product

How Snorkel Flow users can register custom models to Databricks

January 9, 2024
3 min read

Snorkel AI is thrilled to announce our partnership with Databricks and seamless end-to-end integration across the Databricks Data Intelligence Platform. 

This integration grants Snorkel Flow users access to data within Databricks with just a few clicks (as detailed here) while also facilitating the streamlined registration of custom, use-case-specific models to the Databricks Workspace Model Registry. 

The synergy between Snorkel and Databricks enables data scientists to navigate their entire machine learning pipeline—from data access to model deployment—all within Snorkel Flow. 

Closing the loop with end-to-end integration across the Databricks platform

Snorkel Flow integrates seamlessly into existing enterprise workflows. Snorkel offers a full suite of third-party data connectors, making data stored in popular cloud repositories like Databricks quickly and easily accessible for data-centric AI development with Snorkel Flow. 

The new Databricks Model Registry integration equips Snorkel Flow users to automatically register custom, use case-specific models trained in Snorkel Flow to the Databricks platform, which provides a unified service for deploying, governing, querying, and monitoring models.

Data-centric AI development with Snorkel Flow

One of the most painstaking and time-consuming issues with developing AI applications is the process of curating and labeling unstructured data. Snorkel AI eases this bottleneck with the Snorkel Flow AI data development platform.

Data science and machine learning teams use Snorkel Flow to intelligently capture knowledge from various sources—such as previously labeled data (even when imperfect), heuristics from subject matter experts, business logic, and even the latest foundation models and large language models—and then scale this knowledge to label large quantities of data.

As users integrate more sources of knowledge, the platform enables them to rapidly improve training data quality and model performance using integrated error analysis tools. Once they have completed the data labeling process, Snorkel Flow users can apply their labeled data to train predictive models or filter data for generative AI applications.

Snorkel Flow + Databricks Model Registry

Snorkel further streamlines the machine learning development process for organizations that rely on Databricks through a native integration with Databricks Model Registry built directly into the platform. After training, adapting, or distilling a model using the Snorkel Flow data development platform, users can easily register their custom, use case-specific models to the Databricks Workspace Model Registry with just a few clicks.

Here’s how it works:

  1. Register a new model registry for your Databricks workspace and access token.
  2. Fill out the experiment name in the format /Users/<your-username>/<experiment_name>, where <your-username> should be your Databricks username.
  3. Upon clicking the “Deploy” button, Snorkel Flow registers a model to your Databricks Workspace Model Registry.

Once users register a model to the Databricks Workspace Model Registry, they can deploy the model to the Databricks Model Serving or use it on a Spark cluster.

In an upcoming release, Snorkel will expand this integration to allow registering a model to the Databricks Unity Catalog.

Learn More

Follow Snorkel AI on LinkedInTwitter, and YouTube to be the first to see new posts and videos!

Share this article
Image
Hiromu Hota
Machine Learning Engineer

Hiromu Hota is a Staff Engineer at Snorkel AI, where he brings extensive expertise in applied machine learning as the Tech Lead Manager and Lead Machine Learning Engineer. Prior to Snorkel AI, he held roles as a Senior Researcher and Researcher at Hitachi, focusing on advanced research and development. Hiromu also serves as a Visiting Scholar at Stanford University’s School of Engineering, where he contributes to academic advancements in computational science and engineering.

With a background that includes software engineering at Hitachi Data Systems and internships, Hiromu holds a Ph.D. in Computational Science and Engineering and a Master of Engineering from Nagoya University, underscoring his deep technical knowledge and academic achievements.

Connect with Hiromu to discuss machine learning, computational science, or collaborative opportunities in applied research and engineering.

Recommended articles

View all articles
Image
Building AI-Native Systems for Federal Infrastructure: A Conversation with Rezaur Rahman
Christopher Sniffen recently sat down with Rezaur Rahman — CIO / CISO / CAIO at the Advisory Council on Historic Preservation — for a conversation on what it actually takes to build frontier AI for federal infrastructure. They get into the limits of frontier models on geospatial reasoning, mechanistic interpretability for applied AI, the trick that makes vision models useful
May 14, 2026
Snorkel Team
Image
Code World Models and AutoHarness for LLM Agents
At our latest Snorkel AI Reading Group, Carter Wendelken of Google DeepMind walked us through two related papers he presented at ICLR: Code World Models for General Game Playing and AutoHarness: Improving LLM Agents by Automatically Synthesizing a Code Harness. Both ask the same question from opposite ends: when you want an LLM to act reliably in a complex, possibly
May 14, 2026
David Burch
coding-agents-eval
Why coding agents need better data, evals, and environments
Coding agents have moved from tab-complete to teammate. They autonomously inspect repositories, edit files, run commands, diagnose failures, and work through multi-step engineering tasks. That creates a harder reliability problem. A model that only suggests code is easy for a human to evaluate. A coding agent refactoring your repository and testing its own changes is much harder to supervise –
May 11, 2026
Justin Bauer
Image

Join our newsletter

For expert advice, the latest research, and exclusive events.
By submitting this form, I acknowledge I will receive email updates from Snorkel AI, and I agree to the Terms of Use and acknowledge that my information will be used in accordance with the Privacy Policy.