Product

How Snorkel Flow users can register custom models to Databricks

January 9, 2024
3 min read

Snorkel AI is thrilled to announce our partnership with Databricks and seamless end-to-end integration across the Databricks Data Intelligence Platform. 

This integration grants Snorkel Flow users access to data within Databricks with just a few clicks (as detailed here) while also facilitating the streamlined registration of custom, use-case-specific models to the Databricks Workspace Model Registry. 

The synergy between Snorkel and Databricks enables data scientists to navigate their entire machine learning pipeline—from data access to model deployment—all within Snorkel Flow. 

Closing the loop with end-to-end integration across the Databricks platform

Snorkel Flow integrates seamlessly into existing enterprise workflows. Snorkel offers a full suite of third-party data connectors, making data stored in popular cloud repositories like Databricks quickly and easily accessible for data-centric AI development with Snorkel Flow. 

The new Databricks Model Registry integration equips Snorkel Flow users to automatically register custom, use case-specific models trained in Snorkel Flow to the Databricks platform, which provides a unified service for deploying, governing, querying, and monitoring models.

Data-centric AI development with Snorkel Flow

One of the most painstaking and time-consuming issues with developing AI applications is the process of curating and labeling unstructured data. Snorkel AI eases this bottleneck with the Snorkel Flow AI data development platform.

Data science and machine learning teams use Snorkel Flow to intelligently capture knowledge from various sources—such as previously labeled data (even when imperfect), heuristics from subject matter experts, business logic, and even the latest foundation models and large language models—and then scale this knowledge to label large quantities of data.

As users integrate more sources of knowledge, the platform enables them to rapidly improve training data quality and model performance using integrated error analysis tools. Once they have completed the data labeling process, Snorkel Flow users can apply their labeled data to train predictive models or filter data for generative AI applications.

Snorkel Flow + Databricks Model Registry

Snorkel further streamlines the machine learning development process for organizations that rely on Databricks through a native integration with Databricks Model Registry built directly into the platform. After training, adapting, or distilling a model using the Snorkel Flow data development platform, users can easily register their custom, use case-specific models to the Databricks Workspace Model Registry with just a few clicks.

Here’s how it works:

  1. Register a new model registry for your Databricks workspace and access token.
  2. Fill out the experiment name in the format /Users/<your-username>/<experiment_name>, where <your-username> should be your Databricks username.
  3. Upon clicking the “Deploy” button, Snorkel Flow registers a model to your Databricks Workspace Model Registry.

Once users register a model to the Databricks Workspace Model Registry, they can deploy the model to the Databricks Model Serving or use it on a Spark cluster.

In an upcoming release, Snorkel will expand this integration to allow registering a model to the Databricks Unity Catalog.

Learn More

Follow Snorkel AI on LinkedInTwitter, and YouTube to be the first to see new posts and videos!

Share this article
Image
Hiromu Hota
Machine Learning Engineer

Hiromu Hota is a Staff Engineer at Snorkel AI, where he brings extensive expertise in applied machine learning as the Tech Lead Manager and Lead Machine Learning Engineer. Prior to Snorkel AI, he held roles as a Senior Researcher and Researcher at Hitachi, focusing on advanced research and development. Hiromu also serves as a Visiting Scholar at Stanford University’s School of Engineering, where he contributes to academic advancements in computational science and engineering.

With a background that includes software engineering at Hitachi Data Systems and internships, Hiromu holds a Ph.D. in Computational Science and Engineering and a Master of Engineering from Nagoya University, underscoring his deep technical knowledge and academic achievements.

Connect with Hiromu to discuss machine learning, computational science, or collaborative opportunities in applied research and engineering.

Recommended articles

View all articles
agentic-in-action
The Standard for Agents You Can Trust: Lessons from the Federal Front Lines
In the first installment of Agentic in Action — a series about real AI deployments, not demos — Snorkel AI’s Kevin Olivieri sat down with three people who have spent their careers where trust isn’t optional: Chris Sniffen, Federal Applied AI Lead at Snorkel AI; John Hickey, President of August Schell; and Mike Baca, CIO of August Schell. The conversation focused on
June 5, 2026
Snorkel Team
collab-gym-thumbnail
Collaborative Gym: A Framework for Enabling and Evaluating Human-Agent Collaboration
At our latest Snorkel AI Reading Group, Yijia Shao (Stanford NLP) stopped by our San Francisco office to present Collaborative Gym: A Framework for Enabling and Evaluating Human-Agent Collaboration. As LLM agents get better at automating tasks on their own, a large class of real-world problems still needs a human in the loop – for their preferences, their domain expertise, or simply for control.
June 4, 2026
Alexis Sobel
Image
Benchtalks #2: The future of coding benchmarks
For our second Benchtalks, the series dedicated to the researchers building the measurement toolkits that frontier labs hill-climb on, Snorkel AI co-founder Vincent Sunn Chen sat down with John Yang, a Stanford PhD student and creator of the SWE-bench franchise, SWE-smith, CodeClash, and most recently ProgramBench. Highlights More on ProgramBench: See the benchmark and the upcoming leaderboard at programbench.com. More from John Yang: Publications and writing at john-b-yang.github.io. Snorkel
June 3, 2026
Vincent Sunn Chen
Image

Join our newsletter

For expert advice, the latest research, and exclusive events.
By submitting this form, I acknowledge I will receive email updates from Snorkel AI, and I agree to the Terms of Use and acknowledge that my information will be used in accordance with the Privacy Policy.