Empowering AI at the Edge with Foundational Models

7:50 am
September 21, 2023

In the world of machine learning (ML) and artificial intelligence (AI), foundational models (FMs) are revolutionizing the way AI is developed and applied. These models, trained on a broad set of unlabeled data, can be adapted to a wide range of tasks and applications. By combining the power of FMs with edge computing, enterprises can run AI workloads for FM fine-tuning and inferencing at the operational edge, enabling faster deployment, near-real-time predictions, and reduced costs.

What are Foundational Models (FMs)?

FMs are AI models trained on large amounts of unlabeled data, enabling them to learn more generally and work across domains and problems. Unlike traditional AI models that specialize in specific tasks within a single domain, FMs serve as a foundation for a multitude of applications. They address the challenges of scaling AI adoption by ingesting vast amounts of unlabeled data and using self-supervised techniques for training, eliminating the need for extensive human labeling and annotation.

How Do Large Language Models (LLMs) Fit into Foundational Models?

Large language models (LLMs) are a class of FMs that consist of neural networks trained on massive amounts of unlabeled data. These models can perform various natural language processing (NLP) tasks and have become integral to AI applications. LLMs learn to understand language in a way similar to humans, making them invaluable in a wide range of industries.

Scaling AI at the Edge

Deploying AI models at the edge, where data is generated and work is performed, allows for near-real-time predictions while maintaining data sovereignty and privacy. However, scaling AI deployments at the edge comes with challenges such as deployment time and cost, as well as day-to-day management of numerous edge locations.

IBM has developed an edge architecture that addresses these challenges by introducing an integrated hardware/software (HW/SW) appliance model. This model enables zero-touch provisioning of software, continuous monitoring of edge system health, and centralized management of software, security, and configuration updates. The architecture follows a hub-and-spoke deployment configuration, with a central cloud acting as the hub and edge-in-a-box appliances serving as spokes at each edge location.

The Value of FM Fine-Tuning and Inference at the Edge

By pre-training FMs in the cloud and fine-tuning them for specific downstream tasks at the edge, enterprises can optimize the use of compute resources and reduce operational costs. Fine-tuning requires fewer labeled data samples and can be performed using a few GPUs at the enterprise edge, allowing sensitive data to stay within the operational environment. Serving the fine-tuned AI model at the edge also reduces latency and data transfer costs.

To demonstrate the value of FM fine-tuning and inference at the edge, IBM deployed a vision-transformer-based FM for civil infrastructure on a three-node edge cluster. This deployment showcased the reduction in time-to-insight and cost associated with defect detection using drone imagery inputs.

Summary

Combining IBM watsonx data and AI platform capabilities with edge-in-a-box appliances empowers enterprises to run AI workloads at the edge, reducing deployment time, enabling near-real-time predictions, and optimizing resource utilization. FM fine-tuning and inference at the edge provide significant advantages in terms of reduced latency, data transfer costs, and improved security. With FMs and edge computing, the possibilities for AI deployments are expanding, opening doors to faster and more efficient AI adoption across industries.

FAQs

What are foundational models (FMs)?

Foundational models (FMs) are AI models that are trained on a broad set of unlabeled data, allowing them to be adapted to various downstream tasks and applications. Unlike traditional AI models, FMs learn more generally and work across domains and problems.

What are large language models (LLMs)?

Large language models (LLMs) are a class of foundational models that consist of neural networks trained on massive amounts of unlabeled data. These models excel in natural language processing (NLP) tasks and have become integral to AI applications.

What is the advantage of running AI workloads at the edge?

Running AI workloads at the edge enables near-real-time predictions while abiding by data sovereignty and privacy requirements. It reduces latency and data transfer costs, allowing for faster and more efficient data analysis and insights.

How does edge-in-a-box architecture facilitate AI deployments at the edge?

The edge-in-a-box architecture provides an integrated hardware/software (HW/SW) appliance model for AI deployments at the edge. It enables zero-touch provisioning, continuous monitoring of edge system health, and centralized management of software, security, and configuration updates, making AI deployments at the edge more scalable and efficient.

How does fine-tuning AI models at the edge reduce operational costs?

Fine-tuning AI models at the edge reduces the time required and data transfer costs associated with inferencing tasks. It allows sensitive data to stay within the operational environment and optimizes resource utilization by utilizing a few GPUs at the enterprise edge instead of extensive cloud compute resources.


Share:

More in this category ...

7:27 pm April 30, 2024

Ripple companions with SBI Group and HashKey DX for XRPL answers in Japan

Featured image for “Ripple companions with SBI Group and HashKey DX for XRPL answers in Japan”
6:54 pm April 30, 2024

April sees $25M in exploits and scams, marking historic low ― Certik

Featured image for “April sees $25M in exploits and scams, marking historic low ― Certik”
5:21 pm April 30, 2024

MSTR, COIN, RIOT and different crypto shares down as Bitcoin dips

Featured image for “MSTR, COIN, RIOT and different crypto shares down as Bitcoin dips”
10:10 am April 30, 2024

EigenLayer publicizes token release and airdrop for the group

Featured image for “EigenLayer publicizes token release and airdrop for the group”
7:48 am April 30, 2024

VeloxCon 2024: Innovation in knowledge control

Featured image for “VeloxCon 2024: Innovation in knowledge control”
6:54 am April 30, 2024

Successful Beta Service release of SOMESING, ‘My Hand-Carry Studio Karaoke App’

Featured image for “Successful Beta Service release of SOMESING, ‘My Hand-Carry Studio Karaoke App’”
2:58 am April 30, 2024

Dogwifhat (WIF) large pump on Bybit after record reasons marketplace frenzy

Featured image for “Dogwifhat (WIF) large pump on Bybit after record reasons marketplace frenzy”
8:07 pm April 29, 2024

How fintech innovation is riding virtual transformation for communities around the globe  

Featured image for “How fintech innovation is riding virtual transformation for communities around the globe  ”
7:46 pm April 29, 2024

Wasabi Wallet developer bars U.S. customers amidst regulatory considerations

Featured image for “Wasabi Wallet developer bars U.S. customers amidst regulatory considerations”
6:56 pm April 29, 2024

Analyst Foresees Peak In Late 2025

Featured image for “Analyst Foresees Peak In Late 2025”
6:59 am April 29, 2024

Solo Bitcoin miner wins the three.125 BTC lottery, fixing legitimate block

Featured image for “Solo Bitcoin miner wins the three.125 BTC lottery, fixing legitimate block”
7:02 pm April 28, 2024

Ace Exchange Suspects Should Get 20-Year Prison Sentences: Prosecutors

Featured image for “Ace Exchange Suspects Should Get 20-Year Prison Sentences: Prosecutors”
7:04 am April 28, 2024

Google Cloud's Web3 portal release sparks debate in crypto trade

Featured image for “Google Cloud's Web3 portal release sparks debate in crypto trade”
7:08 pm April 27, 2024

Bitcoin Primed For $77,000 Surge

Featured image for “Bitcoin Primed For $77,000 Surge”
5:19 pm April 27, 2024

Bitbot’s twelfth presale level nears its finish after elevating $2.87 million

Featured image for “Bitbot’s twelfth presale level nears its finish after elevating $2.87 million”
10:07 am April 27, 2024

PANDA and MEW bullish momentum cool off: traders shift to new altcoin

Featured image for “PANDA and MEW bullish momentum cool off: traders shift to new altcoin”
9:51 am April 27, 2024

Commerce technique: Ecommerce is useless, lengthy are living ecommerce

Featured image for “Commerce technique: Ecommerce is useless, lengthy are living ecommerce”
7:06 am April 27, 2024

Republic First Bank closed by way of US regulators — crypto neighborhood reacts

Featured image for “Republic First Bank closed by way of US regulators — crypto neighborhood reacts”
2:55 am April 27, 2024

China’s former CBDC leader is beneath executive investigation

Featured image for “China’s former CBDC leader is beneath executive investigation”
10:13 pm April 26, 2024

Bigger isn’t all the time higher: How hybrid Computational Intelligence development permits smaller language fashions

Featured image for “Bigger isn’t all the time higher: How hybrid Computational Intelligence development permits smaller language fashions”
7:41 pm April 26, 2024

Pantera Capital buys extra Solana (SOL) from FTX

Featured image for “Pantera Capital buys extra Solana (SOL) from FTX”
7:08 pm April 26, 2024

Successful Beta Service release of SOMESING, ‘My Hand-Carry Studio Karaoke App’

Featured image for “Successful Beta Service release of SOMESING, ‘My Hand-Carry Studio Karaoke App’”
12:29 pm April 26, 2024

SEC sues Bitcoin miner Geosyn Mining for fraud; Bitbot presale nears $3M

Featured image for “SEC sues Bitcoin miner Geosyn Mining for fraud; Bitbot presale nears $3M”
10:34 am April 26, 2024

Business procedure reengineering (BPR) examples

Featured image for “Business procedure reengineering (BPR) examples”
7:10 am April 26, 2024

85% Of Altcoins In “Opportunity Zone,” Santiment Reveals

Featured image for “85% Of Altcoins In “Opportunity Zone,” Santiment Reveals”
5:17 am April 26, 2024

Sam Altman’s Worldcoin eyeing PayPal and OpenAI partnerships

Featured image for “Sam Altman’s Worldcoin eyeing PayPal and OpenAI partnerships”
10:55 pm April 25, 2024

Artificial Intelligence transforms the IT strengthen enjoy

Featured image for “Artificial Intelligence transforms the IT strengthen enjoy”
10:04 pm April 25, 2024

Franklin Templeton tokenizes $380M fund on Polygon and Stellar for P2P transfers

Featured image for “Franklin Templeton tokenizes $380M fund on Polygon and Stellar for P2P transfers”
7:13 pm April 25, 2024

Meta’s letting Xbox, Lenovo, and Asus construct new Quest metaverse {hardware}

Featured image for “Meta’s letting Xbox, Lenovo, and Asus construct new Quest metaverse {hardware}”
2:52 pm April 25, 2024

Shiba Inu (SHIB) unveils bold Shibarium plans as Kangamoon steals the display

Featured image for “Shiba Inu (SHIB) unveils bold Shibarium plans as Kangamoon steals the display”