AI case study

Benchmarking performance of LLMs

by
ML Commons
Context

ML Commons has integrated Meta's Llama 2 70B parameter model into version 4.0 of its MLPerf Inference benchmark. The published results demonstrate the performance potential of various platforms for running one of the most demanding and capable large language models, highlighting the efficiency and scalability of Meta’s Llama 2 in high-performance computing environments.

Results
Results not reported in the source
Region
Global
Published
September 30, 2023
Agent type
Data Agents
AI provider
Meta
Models/tools
Not disclosed
ICE score
448
The ICE framework in this database provides a quick way to assess the feasibility and potential impact of AI use cases, with higher scores signaling more actionable opportunities.

Impact: Potential benefits to the business.

Confidence: Likelihood of achieving expected results.

Ease: Simplicity of implementation in terms of resources and time.

ICE Score: Calculated by multiplying the component scores.

Note:
Each score is AI-generated based on available data and should be viewed merely as a general guideline for deeper exploration of the use cases.
Impact
7
Confidence
8
Ease
8

13

AI use cases in

Data & analytics

See All

Quillit

Data & analytics
Use case
Reduce report writing burden
Context

Quillit integrated Anthropic’s Claude to automate qualitative research tasks by summarizing interview transcripts, generating contextual citations, and threading conversation data into comprehensive reports. They implemented the AI tool into their existing research workflow within three months, streamlining report writing, transcription, and analysis while ensuring data security and high precision.

Models/tools

MetaLearner

Data & analytics
Data Agents
Quick win
Use case
Simplifying ERP data access
Context

MetaLearner uses Meta's Llama 3.1 to make ERP systems like SAP and Oracle easier to work with.

Models/tools

Ipsos

Data & analytics
Data Agents
Quick win
Use case
Streamlining market research analysis
Context

Ipsos used Gemini 1.5 Pro and Flash to create an internal tool that allows market researchers to pull real-world data from Google Search for analysis.

Explore industries

151

companies using

Data Agents

See All
Use case
Scalable contract detection
Context

wealthAPI implemented a next‐gen contract detection solution by integrating DataStax Astra DB on Google Cloud and leveraging Google Gemini models for AI‐powered analysis. They deployed DataStax’s vector search and real‐time insights capabilities to scale contract detection across millions of users in less than three months, streamlining wealth management workflows by dramatically reducing response times and efficiently handling massive data volumes.

Models/tools
...
1
Use case
Automated job title classification
Context

Aura Intelligence integrated Anthropic's Claude via Amazon Bedrock into its data pipeline to automatically classify over 200 million job titles and industry pairings from multi-language data, replacing manual lookups and fuzzy matching. They fine-tuned foundation models on proprietary datasets and leveraged AWS infrastructure, including SageMaker and prompt management, to automate QA, report generation, anomaly detection, and real-time hiring trend analysis.

Models/tools
...
2
Use case
Efficient engineering team management
Context

LaunchNotes leverages Claude in Amazon Bedrock in their product 'Graph' to transform engineering data into actionable insights. Graph functions as an ETL platform with Claude managing data pipelines, helping engineering managers understand development metrics, reduce incident identification time, automate updates, and generate customized release notes and technical documentation.

Models/tools
...
2
Explore agents

49

solutions powered by

Meta

See All
Use case
Automate image segmentation
Context

Roboflow uses Meta's Segment Anything Model (SAM) to enable users to automatically segment objects in images and videos, significantly reducing the time required to create training datasets for computer vision models.

Models/tools
...
2
Use case
Protecting customer privacy in personalized recommendations
Context

Untukmu.AI, an online gifting site in Indonesia, uses Meta's Llama 3.1 8B model with split inference processing to protect customer privacy. By running part of the AI model on customers' devices and the rest on their servers, they deliver personalized gift recommendations without accessing or storing personal data. This ensures customer privacy while still providing high-quality, tailored suggestions, enhancing trust and satisfaction.

Models/tools
...
1
Use case
Understanding complex codebases
Context

CodeGPT, a popular coding assistant with over 1.4 million downloads, integrates Meta's Llama models to enhance developer productivity. By using Llama 3.2 (90B), CodeGPT helps developers not just generate code but also answer questions about their codebase, debug code, and onboard new team members. It includes a codebase graph mechanism that lets Llama understand entire repositories, allowing developers to effectively "talk" with their code. This integration leads to at least a 30% increase in productivity and accelerates onboarding from months to days.

Models/tools
...
1
Explore AI providers

174

AI use cases in

Global

See All
Use case
Reduce report writing burden
Context

Quillit integrated Anthropic’s Claude to automate qualitative research tasks by summarizing interview transcripts, generating contextual citations, and threading conversation data into comprehensive reports. They implemented the AI tool into their existing research workflow within three months, streamlining report writing, transcription, and analysis while ensuring data security and high precision.

Models/tools
...
2
Use case
Secure on-premises AI
Context

NVIDIA partnered with Google Cloud to enable on-premises agentic AI by integrating Google Gemini models with NVIDIA Blackwell platforms and Confidential Computing, ensuring data sovereignty and regulatory compliance for sensitive enterprise operations. The solution further optimizes AI inference and observability by deploying a GKE Inference Gateway alongside NVIDIA Triton Inference Server, NVIDIA NeMo Guardrails, and NVIDIA Dynamo to enhance secure routing and load balancing for enterprise workloads.

Models/tools
...
7
Use case
Faster internal workflows
Context

Quantium deployed Anthropic's Claude across its organization to empower over 1200 employees in functions such as coding, proposal drafting, training development, and leadership coaching. They implemented the AI solution by launching an "ALL IN on AI" strategy with clear guidelines, practical guardrails, and comprehensive hands-on training programs integrated into daily workflows. This approach streamlined routine tasks and enabled teams to focus on strategic initiatives.

Models/tools
...
1
Explore regions
Thoughts & ideas