Sponsored Content by Microsoft Azure

Confidential GPUs for AI are the future of secure computing

Efficiency and innovation are often touted as hallmark attributes of generative AI. But as more enterprise businesses look to integrate the technology into their workflows, confidentiality — in data processing and sharing — is of utmost importance. 

The recent introduction of AI-specific policies, such as the U.S. Executive Order on the Safe, Secure and Trustworthy AI and the European Union’s AI Act, is a regulatory step forward for developers and users alike. These policies set compliance standards for AI developers to ensure that sensitive, proprietary, or confidential data is protected. They also nod to the inherent value of AI models as intellectual property, wherein training data, algorithms, model architecture, and weights should be secured against unauthorized access.

How confidential computing protects data at scale

Cloud services providers (CSPs) have been helping their customers keep their sensitive code and data secure in transit on the network using TLS and HTTPS encryption, and secure at rest on disk using encryption with customer managed keys. However, one area of data protection that has not been addressed until more recently is the protection of data in use in server memory. This changed in 2019 when Microsoft and other industry leaders founded the Confidential Computing Consortium (CCC), a project community at the Linux Foundation, to accelerate the development and adoption of confidential computing. The CCC defines confidential computing as the protection of data in use by performing computations in a hardware-based and attested Trusted Execution Environment (TEE). 

As a pioneer in this space, Microsoft Azure became one of the first CSPs to introduce confidential virtual machines, which are virtual machines running on confidential computing enabled CPUs. With confidential VMs, only the CPU hardware and the contents of the confidential VM are trusted—all other components of the software stack, including the hypervisor and host OS, are considered outside of this trust boundary and can be breached without exposing sensitive data in memory. And, in compliance with the CCC definition of confidential computing, Microsoft provides attestation tools to allow the user to verify the good state of the CPU and their VM before disk encryption keys are released and sensitive data is loaded into the VM.

The need for confidential GPUs

“We’ve worked very closely with customers to get their feedback on what types of AI models they hope to run, what security posture they are looking for, what use cases they want to enable,” said Vikas Bhatia, Head of Product for Azure Confidential Computing. “With answers including AI models such as Stable Diffusion, Zephyr, Llama2, and GPT2, it became very clear that GPU-enhanced confidential computing would be needed. Our introduction of Azure confidential VMs with NVIDIA H100 Tensor Core GPUs is our first step at addressing this market.”

“Our collaboration with NVIDIA has been a multi-year effort,” said Bhatia, “but this has been necessary to ensure that the TEE of the confidential VM can be securely extended to include the GPU and the communications channel that connects the two. Any AI applications uploaded, built, and deployed on this stack will remain protected from end to end.”

With these new GPU-enhanced confidential VMs, existing Azure customers can redeploy their CUDA models and the code that they’ve written already in an AI ML space in a confidential GPU environment to achieve what Bhatia calls a “unified confidentiality.” This establishes a secure channel with the GPU, wherein all subsequent data transfers between the VM and GPU are protected. Furthermore, the attestation process will verify that the VMs and GPUs are running a correctly configured TEE before any sensitive applications are launched.

The diverse applications of confidential GPUs

The effectiveness of generative AI models hinges on two factors: quality and quantity in training data. Despite training progress made with publicly available datasets, access to proprietary data is essential to leveraging the full potential of enterprise models. Through confidential GPUs computing, businesses can securely authorize the use of specialized data to perform more complex and targeted tasks, such as private data analysis, joint modeling, secure voting, or multi-party computation.

Bhatia identified three major use-cases for confidential GPUs: 

  • Confidential multi-party computation: Organizations can collaborate to train and run inferences on models without sharing proprietary data. Only the final result of a computation would be revealed to the participants.
  • Confidential inferencing: Inferencing occurs when a query or input is sent to a machine learning model to obtain a prediction or response. Confidential GPUs protect data in all stages of the inferencing process from clients, the model developer, service operations, and cloud providers.
  • Confidential training: Model algorithms and weights won’t be visible outside of TEEs set up by AI developers. Models can be securely trained on encrypted, distributed datasets that remain confidential to each party within a hardware-enforced boundary.

Azure’s healthcare customers, for example, are interested in employing confidential inferencing to analyze medical images, like X-rays, CT scans, and MRIs, without disclosing sensitive patient data or proprietary algorithms. Advanced image processing can improve the likelihood of diagnosis and treatment in identifying tumors, fractures, or anomalies in scans — without placing patient data at risk.

As an example, confidential GPUs are valuable in scenarios where data privacy is crucial but collaborative computation is still necessary. Researchers can run simulations of sensitive data (e.g. government data, scientific data) without sharing datasets or code to unauthorized parties. In the finance sector, confidential multi-party computation can be useful in fraud prevention work. Finance institutions can perform analyses or computations in a protected data clean room without disclosing individual financial details.

“Before confidential computing, companies struggled to securely implement this kind of data-sharing technology,” Bhatia said. “While in preview, clients have tested the VMs and found that the security enhancements help to address some of the challenges they’re facing with respect to compliance, governance and security.”

A new security standard for the AI era

As a leader in confidential computing, Azure’s robust security platform caters to the privacy needs of businesses worldwide. Innovative hardware is essential to maintaining a confidential GPU ecosystem of applications and AI models, which Azure is building towards. Bhatia’s hope is that this level of confidentiality will one day be standard across all industries. Data privacy and AI confidentiality should be a convention of everyday computing. 

“Our initial offering is best suited for use with smaller language models,” Bhatia said. “And while work is underway to scale this technology to support LLMs, we know customers will benefit from the current version by discovering the possibilities this technology will bring.”

Similar to how the early internet was once run on unsecure HTTP sites, security standards are always evolving. With more organizations processing sensitive data for AI models, there’s a great need for confidential NVIDIA GPU-powered AI. Azure’s latest VMs are a necessary, innovative introduction to secure GPU computing, which Azure is working to scale up to multiple GPUs.

“We want to set a new security standard with our confidential VMs,” Bhatia said. “We build from the mindset that a rising tide lifts all boats.”

Curious about Azure confidential VMs with NVIDIA H100 Tensor Core GPUs? Sign up to preview Azure’s hardware-based security enhancements and protect your GPU data-in-use.


This article is presented by TC Brand Studio. This is paid content, TechCrunch editorial was not involved in the development of this article. Reach out to learn more about partnering with TC Brand Studio.

More TechCrunch

The Appellate Court of Montenegro ruled on Thursday that Terraform Labs co-founder, Do Kwon, should be returned to his home country, South Korea. The ruling confirmed an earlier decision in…

Terraform Labs co-founder and crypto fugitive, Do Kwon, set for extradition to South Korea

A day after Meta CEO Mark Zuckerberg talked about his newest social media experiment Threads reaching “almost” 200 million users on the company’s Q2 2024 earnings call, the platform has…

Meta’s Threads crosses 200 million active users

TechCrunch Disrupt 2024 will be in San Francisco on October 28–30, and we’re already excited! Disrupt brings innovation for every stage of your startup journey, and we could not bring you this…

Connect with Google Cloud, Aerospace, Qualcomm and more at Disrupt 2024

The tech layoff wave is still going strong in 2024. Following significant workforce reductions in 2022 and 2023, this year has already seen 60,000 job cuts across 254 companies, according…

A comprehensive list of 2024 tech layoffs

Intel announced it would layoff more than 15% of its staff, or 15,000 employees, in a memo to employees on Thursday. The massive headcount is part of a large plan…

Intel to lay off 15,000 employees

Following the recent lawsuit filed by the Recording Industry Association of America (RIAA) against music generation startups Udio and Suno, Suno admitted in a court filing on Thursday that it did, in…

AI music startup Suno claims training model on copyrighted music is ‘fair use’

In spite of a drop for the quarter, iPhone remained Apple’s most important category by a wide margin.

iPad sales help bail out Apple amid a continued iPhone slide

Molly Alter wears a lot of hats. She’s a mocumentary filmmaker working on a project about an alternate reality where charades is big business. She’s a caesar salad connoisseur and…

How filming a cappella concerts and dance recitals led Northzone’s newest partner Molly Alter to a career in VC

Microsoft has a long and tangled history with OpenAI, having invested a reported $13 billion in the ChatGPT maker as part of a long-term partnership. As part of the deal,…

Microsoft now lists OpenAI as a competitor in AI and search

The San Jose-based startup raised $60 million in a round that values it lower than the $500 million valuation it garnered in its most recent round, according to multiple sources.

Sequoia-backed Knowde raises Series C at a valuation cut

Self-driving technology company Aurora Innovation is looking to raise hundreds of millions in additional capital as it races toward a driverless commercial launch by the end of 2024.  Aurora is…

Self-driving truck startup Aurora Innovation to sell up to $420M in shares ahead of commercial launch

X (formerly Twitter) can no longer be accessed in the Mac App Store, suggesting that it has been officially delisted.  Searches for both “Twitter” and “X” on Apple’s platform no…

Twitter disappears from Mac App Store

Google Thursday said that it is introducing new Gemini-powered features for Chrome’s desktop version, including Lens for desktop, tab compare for shopping assistance, and natural language integration for search history.…

Google brings Gemini-powered search history and Lens to Chrome desktop

When Xiaoyin Qu was growing up in China, she was obsessed with learning how to build paper airplanes that could do flips in the air. Her parents, though, didn’t have…

Heeyo built an AI chatbot to be a billion kids’ interactive tutor and friend

While the company was awarded a massive, $4.2 billion contract to accelerate Starliner development in 2014, it was structured as a “fixed-price” model.

Boeing bleeds another $125M on Starliner program, bringing total losses to $1.6B

Welcome back to TechCrunch Mobility — your central hub for news and insights on the future of transportation. Sign up here for free — just click TechCrunch Mobility! Summer road…

Anthony Levandowski bets on off-road autonomy, Nuro plots a comeback and Applied Intuition gets more investor love

Google’s new features include Gemini in BigQuery and Looker to help users with data engineering and analysis.

Google Cloud expands its database portfolio with new AI capabilities

Rad Power Bikes, the Seattle-based e-bike startup that has raised more than $300 million from investors, went through another round of layoffs in July, TechCrunch has exclusively learned. This is…

VC darling Rad Power Bikes hit with another round of layoffs

Five years ago, as robotaxis and self-driving truck startups were still raking in millions in venture capital, Anthony Levandowski turned to off-road autonomy. Now, that decision — which brought the…

Why Anthony Levandowski returned to his off-road autonomous vehicle roots with AV startup Pronto

Commercial space station company Vast is building a private microgravity research lab as part of its wider Haven-1 station plans. The module is set to launch no earlier than the…

Vast plans microgravity lab on its Haven-1 private space station

Google Cloud is giving Y Combinator startups access to a dedicated, subsidized cluster of Nvidia graphics processing units and Google tensor processing units to build AI models. It’s part of…

Google Cloud now has a dedicated cluster of Nvidia GPUs for Y Combinator startups

Open source compliance and security platform FOSSA has acquired developer community platform StackShare, the company confirmed to TechCrunch.  StackShare is one of the more popular platforms for developers to discuss,…

Open source startup FOSSA is buying StackShare, a site used by 1.5M developers

Ola Electric and FirstCry are set to test investor appetite with public listing, both pricing their shares below their previous valuation asks.

Indian startups gut valuations ahead of IPO push

The European Union’s risk-based regulation for applications of artificial intelligence has come into force starting from today.

The EU’s AI Act is now in force

The company also said it has received regulatory clearance to start Phase 2 clinical trials for a new drug in the U.S. later this year.

Healx, an AI-enabled drug discovery platform for rare diseases, raises $47M

The European Commission (EC) has given the go-ahead to HPE’s planned megabucks acquisition of Juniper Networks.

EU greenlights HPE’s $14B Juniper Networks acquisition

Meta, which develops one of the biggest foundational open source large language models, Llama, believes it will need significantly more computing power to train models in the future. Mark Zuckerberg…

Zuckerberg says Meta will need 10x more computing power to train Llama 4 than Llama 3

Axle Energy is a B2B, back-end infrastructure business focused on connecting flexible assets, such as electric vehicles and home batteries, to energy markets that aren’t otherwise available for consumers to…

Axle Energy’s sprint to decarbonize the grid lights up with $9M seed led by Accel

OpenAI CEO Sam Altman says that OpenAI is working with the U.S. AI Safety Institute, a federal government body that aims to assess and address risks in AI platforms, on…

OpenAI pledges to give U.S. AI Safety Institute early access to its next model

WhatsApp’s massive 500 million users in India have supercharged Meta’s AI ambitions. Meta CFO Susan Li said Wednesday that India is the largest market in terms of Meta AI usage,…

Meta says India is the largest market for Meta AI usage