All articles on
Research

Prompting and weak supervision to build better, smaller models

Snorkel AI co-founder and CEO Alex Ratner recently interviewed several Snorkel researchers about their published academic papers. In this video, Alex talks with Ryan Smith, Senior Applied Scientist at Snorkel, about the work he did on using foundation models to build compact, deployable, and effective models.

Dr. Bubbles, Snorkel AI's mascot
January 19, 2023

FM Summit shows Foundation Model hurdles and potential

Snorkel AI held its Foundation Model Summit Jan 17, bringing together 12 presenters and over 600 attendees at 10 virtual sessions. The event drew registrants from across many sectors, including the tech industry, healthcare, and financial services.

January 18, 2023

Contrastive Learning boosts Foundation Model specialization

Snorkel AI co-founder and CEO Alex Ratner talks with Ananya Kumar about the work he did on improving the effectiveness of foundation models by using contrastive learning, image augmentations, and labeled subsamples.

Dr. Bubbles, Snorkel AI's mascot
January 13, 2023

Ask Me Anything approach bolsters foundation models

Researcher Simran Arora tells Snorkel CEO Alex Ratner how she improved foundation model effectiveness by using “Ask Me Anything”-style questions.

Dr. Bubbles, Snorkel AI's mascot
January 4, 2023

Combining human and artificial intelligence with human-in-the-loop ML | FDCAI

More components in an ML lifecycle are designed to run on autopilot, but some tasks require human-in-the-loop ML, an active research topic that has seen an increasing number of publications in the last 10 years.

Dr. Bubbles, Snorkel AI's mascot
December 28, 2022

Seven research papers push foundation model boundaries

The recent debut of ChatGPT astounded the public with the power and speed of foundation models, but their enterprise use remains hampered by adaptation and deployment challenges. In the past year, Snorkel AI has researched several ways to overcome those challenges. 

December 15, 2022

Snorkel AI Team presents research at NeurIPS 2022

The Snorkel AI team will present five research papers advancing weak supervision and programmatic labeling at the NeurIPS 2022 conference that started this week.

Dr. Bubbles, Snorkel AI's mascot
November 29, 2022

What can Data-Centric AI learn from data & ML engineering?

Databricks’ Chief Technologist: Data-Centric AI can learn from Data Engineering and ML Engineering in five ways: continuous updates, versioning, code-centric deployment, data privatization and actionable monitoring.

Dr. Bubbles, Snorkel AI's mascot
November 5, 2022

Improving upon Precision, Recall, and F1 with Gain metrics

This blog post introduces variants of Precision, Recall, and F1 metrics called Precision Gain, Recall Gain, and F1 Gain. The gain variants have desirable properties such as meaningful linear interpolation of PR curves and a universal baseline across tasks. This post explains what these benefits mean for you, how the gain metrics are calculated and outline some examples for intuitive comparison. 

September 8, 2022

The Future of Data-Centric AI 2022 day 1 highlights

Snorkel AI just hosted the first day of The Future of Data-Centric AI conference 2022. This conference brings together data scientists, ML engineers, and AI leaders to share insights, best practices, and research on how to evolve the ML lifecycle from model-centric to data-centric approaches. This conference takes place over two days with 40+ sessions, 50+ speakers, and thousands of…

Louis Bouchard portrayed
August 4, 2022

Clinical entity classification in electronic health records

Research recap: Ontology-driven weak supervision for clinical entity classification in electronic health records (EHRs)  In this post, I have summarized the research published in this academic paper, Ontology-driven weak supervision for clinical entity classification in electronic health records by Jason Fries et al. This paper was published in Nature Communications in 2021.Problem statement Electronic health records (EHR) contain a rich…

Nazanin Makkinejad
June 17, 2022

Uncovering the unknowns of deep neural networks by Sharon Li

Learning about the challenges and opportunities behind deep neural networks  In this talk, Assistant Professor in Computer Science Sharon Li shares some exciting work about uncovering the unknowns of deep neural networks. She also shares some exciting challenges and opportunities in this domain. If you would like to watch Sharon’s presentation, we have included it below, or you can find…

Dr. Bubbles, Snorkel AI's mascot
June 8, 2022

A data-centric perspective on trustworthy and interpretable AI

The future of data-centric AI talk series In this talk, Assistant Professor of Biomedical Data Science at Stanford University, James Zou, discusses the work he and his team have been doing from a data-centric perspective to trustworthy and interpretable AI. If you would like to watch James’ presentation, we have included it below, or you can find the entire event…

Dr. Bubbles, Snorkel AI's mascot
June 6, 2022

MLOps: Towards DevOps for data-centric AI with Ce Zhang

The future of data-centric AI talk series  Don’t miss the opportunity to gain an in-depth understanding of data-centric AI and learn best practices from real-world implementations. Connect with fellow data scientists, machine learning engineers, and AI leaders from academia and industry with over 30 virtual sessions. Save your seat at The Future of Data-Centric AI. Happening on August 3-4, 2022….

Dr. Bubbles, Snorkel AI's mascot
June 2, 2022

What to expect at The Future of Data-Centric AI 2022

30+ sessions by 40+ speakers in 2 action-packed days Last year we organized The Future of Data-Centric AI conference to explore the shift from model-centric to data-centric AI. Speakers included researchers and industry experts such as Andrew Ng (Landing AI), Anima Anandkumar (NVIDIA), Chris Re (Stanford AI Lab), Michael DAndrea (Genentech), Skip McCormick (BNY Mellon), Imen Grida Ben Yahia (Orange)…

Devang Sachdev portrayed
June 1, 2022
Image

Ready to accelerate AI development?

Deploy production AI and ML applications 10-100x faster with Snorkel Flow, the AI data development platform.
Request a demo