AI case study

Improving AI model evaluation

by
Labelbox
Context

Labelbox has built a fully managed AI model evaluation solution directly integrated into the Vertex AI platform, allowing Google Cloud users to seamlessly launch human evaluation jobs and set specific criteria for evaluation, such as question-answering and summarization.

Results

This eases and accelerates the ability to deploy human-in-the-loop AI systems with higher levels of trust and authority.

Results not reported in the source
Published
September 24, 2024
Agent type
Code Agents
AI provider
Google
Models/tools
Not disclosed
ICE score
392
The ICE framework in this database provides a quick way to assess the feasibility and potential impact of AI use cases, with higher scores signaling more actionable opportunities.

Impact: Potential benefits to the business.

Confidence: Likelihood of achieving expected results.

Ease: Simplicity of implementation in terms of resources and time.

ICE Score: Calculated by multiplying the component scores.

Note:
Each score is AI-generated based on available data and should be viewed merely as a general guideline for deeper exploration of the use cases.
Impact
7
Confidence
8
Ease
7

41

AI use cases in

Artificial Intelligence

See All

Synechron

Artificial Intelligence
Use case
Secure chat workflows
Context

Synechron implemented an enterprise-grade AI chat platform, Synechron Nexus Chat, powered by Azure OpenAI to enable secure and scalable conversational AI. The platform was deployed within an Azure private landing zone and integrated various language models, customizable personas, file uploads, and plugin agents to support natural language interactions and specialized tasks like diagram generation and image analysis. This solution enhanced internal business processes across HR, marketing, legal, and compliance while safeguarding sensitive data.

Models/tools
No items found.
...

Hume AI

Artificial Intelligence
Use case
Empathetic voice interactions
Context

Hume AI, a leader in emotionally intelligent AI systems, utilizes Anthropic's Claude to power natural and empathetic voice conversations through their EVI platform. This integration enables Hume's clients in healthcare, customer service, and consumer applications to build trust with users by providing emotionally aware and responsive interactions.

Models/tools
...
1

Decagon

Artificial Intelligence
Use case
Handle complex customer inquiries automatically
Context

Decagon, a company focused on automating customer support, uses OpenAI's suite of GPT models, including GPT-3.5 and GPT-4, to manage large volumes of support inquiries without human intervention. The models are configured for tasks such as query rewriting, complex decision-making, and API request processing, offering scalable, nuanced responses tailored to each customer's needs.

Models/tools
Explore industries

62

companies using

Code Agents

See All
Use case
Real-time code support
Context

Bito implemented AI-powered developer agents by integrating Anthropic's Claude and leveraging Claude 3.7 Sonnet for advanced reasoning into its code review and coding workflows. They utilized Anthropic’s robust API and developer-friendly infrastructure to embed an AI Code Review Agent and Bito Wingman directly within developers’ Git workflows and popular IDEs, enabling automated analysis of pull request diffs and code architecture. This integration streamlined code review, error detection, and code generation processes while upholding security standards.

Models/tools
...
2
Use case
Complex code navigation
Context

Augment code integrated Anthropic's Claude within Google Cloud's Vertex AI to develop an AI-powered code assistant that provides expert-level contextual understanding of complex software systems. By automating code comprehension, debugging, documentation, and change propagation within development workflows, the solution dramatically reduced project timelines and sped up developer onboarding while ensuring SOC 2 Level 2 compliant security protocols.

Models/tools
...
2
Use case
Faster coding workflows
Context

Tata Elxsi integrated Microsoft GitHub Copilot into its video distribution platform development and testing processes to provide intelligent code suggestions, optimize existing code, assist in both manual and automated testing, simplify documentation, and enable smoother code translation. The integration streamlined coding and debugging workflows across the development cycle, enhancing productivity and ensuring compliance with strict security standards.

Models/tools
...
1
Explore agents

263

solutions powered by

Google

See All
Use case
Faster retail transformation
Context

TCS partnered with Google Cloud to integrate advanced AI and generative AI capabilities into retail service offerings. They launched the Google Cloud Gemini Experience Center at their Retail Innovation Lab in Chennai, enabling retail clients to ideate, prototype, and co-develop tailored AI solutions that optimize supply chain, warehouse receiving, customer insights, and content creation. This approach automated processes using tools like Vertex AI Vision for warehouse receiving and leveraged Vertex AI with Gemini 1.5 Pro and speech-to-text to transform service centers.

Models/tools
...
4
Use case
Faster research reports
Context

Deutsche Bank developed DB Lumina, an AI-powered research agent built on Gemini and Vertex AI through a partnership with Google Cloud. The solution automates the creation of financial research reports by rapidly condensing extensive market data—such as converting a 400-page report into a three-page summary—thereby streamlining analysis workflows while maintaining rigorous data privacy standards.

Models/tools
...
2
Use case
Secure on-premises AI
Context

NVIDIA partnered with Google Cloud to enable on-premises agentic AI by integrating Google Gemini models with NVIDIA Blackwell platforms and Confidential Computing, ensuring data sovereignty and regulatory compliance for sensitive enterprise operations. The solution further optimizes AI inference and observability by deploying a GKE Inference Gateway alongside NVIDIA Triton Inference Server, NVIDIA NeMo Guardrails, and NVIDIA Dynamo to enhance secure routing and load balancing for enterprise workloads.

Models/tools
...
7
Explore AI providers

284

AI use cases in

North America

See All
Use case
Fast content creation
Context

Cox Automotive integrated Claude via Amazon Bedrock into its portfolio by first creating a sandbox environment to evaluate performance metrics and then selecting Claude 3.5 Sonnet for complex tasks and Claude 3.5 Haiku for high-volume content generation. They automated personalized dealer-consumer communications, generated engaging vehicle listing descriptions, and produced SEO-optimized blog posts, while also streamlining internal data governance through automated metadata generation. This integration optimized operational efficiency across marketing and internal data processes.

Models/tools
...
2
Use case
Faster tax form processing
Context

Intuit integrated Google Cloud’s Document AI and Gemini models into its GenOS platform to automate the autofill of ten common U.S. tax forms, including complex 1099 and 1040 forms. The solution extracts and categorizes data from uploaded documents, drastically reducing manual data entry for TurboTax customers. This integration streamlines tax preparation workflows and improves speed and accuracy.

Models/tools
...
2
Use case
Democratized data access
Context

Block implemented Anthropic’s Claude models (Claude 3.5 Sonnet and Claude 3.7 Sonnet) on its Databricks platform to power its internal AI agent, codename goose. They integrated the LLM using secure OAuth-enabled connections and a custom MCP server to connect internal databases and tools, enabling employees across all roles to auto-generate SQL queries, analyze complex data, and automate workflows. This agentic integration streamlined software development, design prototyping, and data analysis by translating user intents into actionable insights.

Models/tools
...
2
Explore regions
Thoughts & ideas