While a majority of Natural Language Processing (NLP) models focus on English, the real world requires solutions that work with languages across the globe. This demo shows how effectively users can build cross-language models in Snorkel Flow.
Using Snorkel Flow, Pixability has created a way to build classifiers for massive amounts of YouTube data quickly—that was previously out of reach.
Sirisha Rella, Technical Product Marketing Manager at Nvidia, recently gave a Lightning Talk presentation on “demystifying” speech AI at Snorkel AI’s Future of Data-Centric AI virtual conference.
Snorkel AI will hold a free Foundation Model Virtual Summit on Tuesday, January 17 where speakers from across the technology industry, including some from Google and Stanford University, will discuss the enterprise use of Foundation Models.
Snorkel Flow debuts a new integration with Microsoft Azure Form Recognizer to help organizations leverage Azure AI services.
A central innovation team at a top US bank wanted to modernize its AI development and data annotation processes in order to create a custom natural language processing (NLP) model that could extract important financial information from 10-Ks. Manually reviewing these documents was taking up valuable time that could be better spent assisting customers. The team used Snorkel Flow’s data-centric AI development process and programmatic labeling to train a customized NLP model that could accurately extract information on interest rate swaps.
MIT’s Technology Review reported this week that workers in Venezuela contracted by outsourced data annotation services provider shared customer data—low-angled pictures intended to be labeled, including one that featured a woman in a private moment in the bathroom—with each other on social media. Programmatic labeling could have minimized this.
Georgetown University’s CSET is building next-generation NLP applications using Snorkel Flow to classify complex research documents. Snorkel Flow drastically reduced labeling, model training, and iteration time and better equipped CSET’s data science team to collaborate closely with analysts to gather, process, and interpret data at scale.
Snorkel AI is delighted to announce a partnership with Aimpoint Digital, a premier analytics firm specializing in AI application development that builds, operationalizes, and scales data science solutions for biopharma, manufacturing, retail, and other major industries. Aimpoint Digital leads the industry in solving complex challenges and exploiting value-generating opportunities for organizations of all sizes through data. The company helps clients…
Labeling data manually can be a grind. Snorkel Flow slashes labeling time from months to minutes by allowing data scientists and domain experts collaborate through labeling functions. Snorkel Flow offers two unique capabilities that further supercharge that collaboration: Comments and Tags.
Snorkel AI is excited to build on our partnership with Microsoft Azure to help enterprises and government agencies solve their most impactful problems and unlock value from their data using AI. Learn how Azure customers can easily deploy Snorkel Flow on their Azure cloud infrastructure to accelerate AI application development with data-centric workflows and programmatic labeling.
Introducing new capabilities for Data-centric Foundation Model Development in Snorkel Flow Powerful new large language or foundation models (FMs) like GPT-3, Stable Diffusion, BERT, and more have taken the AI space by storm, going viral—even beyond technical practitioners—thanks to incredible capabilities around text generation, image synthesis, and more. However, enterprises face fundamental barriers to using these foundation models on real,…
We created Data-centric Foundation Model Development to bridge the gaps between foundation models and enterprise AI. New Snorkel Flow capabilities (Foundation Model Fine-tuning, Warm Start, and Prompt Builder) give data science and machine learning teams the tools they need to effectively put foundation models (FMs) to use for performance-critical enterprise use cases. The need is clear: despite undeniable excitement about…
Create a data-centric AI application using Snorkel Flow to save your analysts time of manual labeling and information extraction related to environmental, social, and governance (ESG) factors from earnings call transcripts. Rapidly and accurately extract all existing and new factors from the transcripts to make the right investment decision.
AI is generally accepted as necessary for organizations across private and public sectors to build (or maintain) a competitive advantage. However, a major challenge to adopting AI successfully is our ability to build reliable, predictable, and equitable solutions. A critical flaw with traditional approaches to developing AI is the reliance on hand-labeled training datasets and/or “pre-trained” black-box models that are effectively ungovernable and unauditable. In this article, we explore the motivations and challenges for Trustworthy AI that we’ve encountered and discuss how core tenants of Data-Centric AI, including programmatic labeling, help ameliorate them.