Document classification

Build AI-powered document classification applications in a fraction of the time without hand-labeling data using Snorkel Flow.

Request a demo
Image

Data-centric AI technology developed at the Stanford AI Lab and proven at world-leading companies.

How Snorkel Flow works

One size fits you, not all

Achieve greater performance gains by exploiting domain-specific text features of your own data.
Image

Faster, lower-cost development

Use programmatic labeling to develop high-quality AI applications in hours instead of spending weeks or months on expensive hand-labeling.
Image

Higher-accuracy models

Iterate on your application, using a closed-loop approach with intermediate results and analysis at every step to zero in on errors.
Image

Flexible integrations

Easily integrate labeling, training and analysis pipelines defined over diverse input types–text, PDF, HTML, and more–with downstream applications using APIs or a Python SDK.
Image

Easier SME collaboration

Build complex classification apps intuitively while preserving natural information about data taxonomies with subject matter expert (SME) collaboration.

Document classification

Programmatically label training data across complex data types and build multi-model document classification applications with ease.
Image
Image

An end-to-end ML platform

Designed for collaboration

Image

For data scientists

  • Ready-to-use model zoo
  • Auto-generated analysis tools
  • Integrated Python notebooks
Image

For domain experts

  • Rich data annotation suite
  • Intuitive, no-code labeling UI
  • Model error analysis reports
Image

For developers

  • Fully interoperable API and web UI
  • Write custom operators with Python SDK
  • Integrations to deploy models at scale
Image


Case study

Google

Google used Snorkel to replace 100K+ hand-annotated labels in critical ML pipelines for text classification.
Read more



Problem

Content, product, and event classification problems change too fast to hand-label, even with significant annotation budget.

Solution

Google deployed early versions of Snorkel's core technology with three high-impact teams, repurposing many resources as labeling functions.

Results

Hours of labeling function development replaced 10-100K+ hand labels, significantly impacting the bottom line and accelerating of ML adoption.

6 months

of hand-labeling data replaced in 30 mins

52%

performance improvement

100k+

hand labels replaced with a programmatic approach


Case Study

Google

Google used Snorkel to replace 100K+ hand-annotated labels in critical ML pipelines for text classification.
Read more



Problem

Content, product, and event classification problems change too fast to hand-label, even with significant annotation budget.

6 Months

of hand-labeling data replaced in 30 mins

Solution

Google deployed early versions of Snorkel's core technology with three high-impact teams, repurposing many resources as labeling functions.

52%

performance improvement

Results

Hours of labeling function development replaced 10-100K+ hand labels, significantly impacting the bottom line and accelerating of ML adoption.

100k+

hand labels replaced with programmatic approach


Case Study

Google

Google used Snorkel to replace 100K+ hand-annotated labels in critical ML pipelines for text classification.
Read More



Problem

Content, product, and event classification problems change too fast to hand-label, even with significant annotation budget.

Solution

Google deployed early versions of Snorkel's core technology with three high-impact teams, repurposing many resources as labeling functions.

Results

Hours of labeling function development replaced 10-100K+ hand labels, significantly impacting the bottom line and accelerating of ML adoption.

6 Months

of hand-labeling data replaced in 30 mins

52%

performance improvement

100k+

hand labels replaced with programmatic approach


Dive in

[get_press_posts]
Press
Blog
Research
Case studies
Press
Image
September 20, 2021
Snorkel AI welcomes industry leaders to the team

Image
August 9, 2021
This hot startup is now valued at $1 billion for its A.I. skills

Image
February 24, 2021
The Data-First Enterprise AI Revolution

Image
July 14, 2020
Meet The Stanford AI Lab Alums That Raised $15 Million To Optimize Machine Learning

Blog
Image
February 4, 2022
Making Automated Data Labeling a Reality in Modern AI

Image
Date: Jan 25, 2022
The Principles of Data-Centric AI Development

Image
Date: Jan 5, 2022
Meet the Snorkelers

Image
Date: Jul 9, 2021
How to Use Snorkel to Build AI Applications

Research
Image
2022
Universalizing Weak Supervision

Image
2021
Ontology-driven weak supervision for clinical entity classification in electronic health records

Image
2017
Rapid Training Data Creation with Weak Supervision

Image
2016
Data Programming: Creating Large Datasets Quickly

Customer Stories
Image
February 26, 2022
Genentech used Snorkel Flow to extract information from clinical trials

Image
February 18, 2022
Google used Snorkel to build and adapt content classification models

Image
2019
Intel used Snorkel to accelerate sales and marketing agents

Image
2019
Apple built a Snorkel-based system to answer billions of queries in multiple languages

Image

Let’s connect

Speed time to value, reduce costs, and unlock more AI possibility 
with the Snorkel Flow platform.
Request a demo