What Is Trustworthy AI?

Artificial intelligence, like any transformative technology, is a work in progress — continually growing in its capabilities and its societal impact. Trustworthy AI initiatives recognize the real-world effects that AI can have on people and society, and aim to channel that power responsibly for positive change.

What Is Trustworthy AI?

Trustworthy AI is an approach to AI development that prioritizes safety and transparency for those who interact with it. Developers of trustworthy AI understand that no model is perfect, and take steps to help customers and the general public understand how the technology was built, its intended use cases and its limitations.

In addition to complying with privacy and consumer protection laws, trustworthy AI models are tested for safety, security and mitigation of unwanted bias. They’re also transparent — providing information such as accuracy benchmarks or a description of the training dataset — to various audiences including regulatory authorities, developers and consumers.

Principles of Trustworthy AI

Trustworthy AI principles are foundational to NVIDIA’s end-to-end AI development. They have a simple goal: to enable trust and transparency in AI and support the work of partners, customers and developers.

Privacy: Complying With Regulations, Safeguarding Data

AI is often described as data hungry. Often, the more data an algorithm is trained on, the more accurate its predictions.

But data has to come from somewhere. To develop trustworthy AI, it’s key to consider not just what data is legally available to use, but what data is socially responsible to use.

Developers of AI models that rely on data such as a person’s image, voice, artistic work or health records should evaluate whether individuals have provided appropriate consent for their personal information to be used in this way.

For institutions like hospitals and banks, building AI models means balancing the responsibility of keeping patient or customer data private while training a robust algorithm. NVIDIA has created technology that enables federated learning, where researchers develop AI models trained on data from multiple institutions without confidential information leaving a company’s private servers.

NVIDIA DGX systems and NVIDIA FLARE software have enabled several federated learning projects in healthcare and financial services, facilitating secure collaboration by multiple data providers on more accurate, generalizable AI models for medical image analysis and fraud detection.

Safety and Security: Avoiding Unintended Harm, Malicious Threats

Once deployed, AI systems have real-world impact, so it’s essential they perform as intended to preserve user safety.

The freedom to use publicly available AI algorithms creates immense possibilities for positive applications, but also means the technology can be used for unintended purposes.

To help mitigate risks, NVIDIA NeMo Guardrails keeps AI language models on track by allowing enterprise developers to set boundaries for their applications. Topical guardrails ensure that chatbots stick to specific subjects. Safety guardrails set limits on the language and data sources the apps use in their responses. Security guardrails seek to prevent malicious use of a large language model that’s connected to third-party applications or application programming interfaces.

NVIDIA Research is working with the DARPA-run SemaFor program to help digital forensics experts identify AI-generated images. Last year, researchers published a novel method for addressing social bias using ChatGPT. They’re also creating methods for avatar fingerprinting — a way to detect if someone is using an AI-animated likeness of another individual without their consent.

To protect data and AI applications from security threats, NVIDIA H100 and H200 Tensor Core GPUs are built with confidential computing, which ensures sensitive data is protected while in use, whether deployed on premises, in the cloud or at the edge. NVIDIA Confidential Computing uses hardware-based security methods to ensure unauthorized entities can’t view or modify data or applications while they’re running — traditionally a time when data is left vulnerable.

Transparency: Making AI Explainable

To create a trustworthy AI model, the algorithm can’t be a black box — its creators, users and stakeholders must be able to understand how the AI works to trust its results.

Transparency in AI is a set of best practices, tools and design principles that helps users and other stakeholders understand how an AI model was trained and how it works. Explainable AI, or XAI, is a subset of transparency covering tools that inform stakeholders how an AI model makes certain predictions and decisions.

Transparency and XAI are crucial to establishing trust in AI systems, but there’s no universal solution to fit every kind of AI model and stakeholder. Finding the right solution involves a systematic approach to identify who the AI affects, analyze the associated risks and implement effective mechanisms to provide information about the AI system.

Retrieval-augmented generation, or RAG, is a technique that advances AI transparency by connecting generative AI services to authoritative external databases, enabling models to cite their sources and provide more accurate answers. NVIDIA is helping developers get started with a RAG workflow that uses the NVIDIA NeMo framework for developing and customizing generative AI models.

NVIDIA is also part of the National Institute of Standards and Technology’s U.S. Artificial Intelligence Safety Institute Consortium, or AISIC, to help create tools and standards for responsible AI development and deployment. As a consortium member, NVIDIA will promote trustworthy AI by leveraging best practices for implementing AI model transparency.

And on NVIDIA’s hub for accelerated software, NGC, model cards offer detailed information about how each AI model works and was built. NVIDIA’s Model Card ++ format describes the datasets, training methods and performance measures used, licensing information, as well as specific ethical considerations.

Nondiscrimination: Minimizing Bias

AI models are trained by humans, often using data that is limited by size, scope and diversity. To ensure that all people and communities have the opportunity to benefit from this technology, it’s important to reduce unwanted bias in AI systems.

Beyond following government guidelines and antidiscrimination laws, trustworthy AI developers mitigate potential unwanted bias by looking for clues and patterns that suggest an algorithm is discriminatory, or involves the inappropriate use of certain characteristics. Racial and gender bias in data are well-known, but other considerations include cultural bias and bias introduced during data labeling. To reduce unwanted bias, developers might incorporate different variables into their models.

Synthetic datasets offer one solution to reduce unwanted bias in training data used to develop AI for autonomous vehicles and robotics. If data used to train self-driving cars underrepresents uncommon scenes such as extreme weather conditions or traffic accidents, synthetic data can help augment the diversity of these datasets to better represent the real world, helping improve AI accuracy.

NVIDIA Omniverse Replicator, a framework built on the NVIDIA Omniverse platform for creating and operating 3D pipelines and virtual worlds, helps developers set up custom pipelines for synthetic data generation. And by integrating the NVIDIA TAO Toolkit for transfer learning with Innotescus, a web platform for curating unbiased datasets for computer vision, developers can better understand dataset patterns and biases to help address statistical imbalances.

Learn more about trustworthy AI on NVIDIA.com and the NVIDIA Blog. For more on tackling unwanted bias in AI, watch this talk from NVIDIA GTC and attend the trustworthy AI track at the upcoming conference, taking place March 18-21 in San Jose, Calif, and online.

Live at GTC: Hear From Industry Leaders Using AI to Drive Innovation and Agility

Interest in new AI applications reached a fever pitch last year as business leaders began exploring AI pilot programs. This year, they’re focused on strategically implementing these programs to create new value and sharpen their competitive advantage.

GTC, NVIDIA’s conference on AI and accelerated computing, set for March 18-21 at the San Jose Convention Center, will feature leaders across a broad swath of industries discussing how they’re charting the path to AI-driven innovation.

Execs from Bentley Systems, Lowe’s, Siemens and Verizon are among those sharing their companies’ AI journeys.

Don’t miss NVIDIA founder and CEO Jensen Huang’s GTC keynote on Monday, March 18, at 1 p.m. PT.

AI Takes Center Stage in Enterprise Technology Priorities

Nearly three-quarters of C-suite executives plan to increase their company’s tech investments this year, according to a BCG survey of C-suite executives, and 89% rank AI and generative AI among their top three priorities. More than half expect AI to deliver cost savings, primarily through productivity gains, improved customer service and IT efficiencies.

However, challenges to driving value with AI remain, including reskilling workers, prioritizing the right AI use cases and developing a strategy to implement responsible AI.

Join us in person or online to learn how industry leaders are overcoming these challenges to thrive with AI.

Here’s a preview of top industry sessions:

Financial Services

Navigating the Opportunity for Generative AI in Financial Services, featuring speakers from NVIDIA, MasterCard, Capital One and Goldman Sachs.

Enterprise AI in Banking: How One Leader Is Investing in “AI First,” featuring Alexandra V. Mousavizadeh, CEO of Evident, and Chintan Mehta, chief information officer and head of digital technology and innovation at Wells Fargo.

How PayPal Reduced Cloud Costs by up to 70% With Spark RAPIDS, featuring Illay Chen, software engineer at PayPal.

Public Sector

Generative AI Adoption and Operational Challenges in Government, featuring speakers from Microsoft, NVIDIA and the U.S. Army.

How to Apply Generative AI to Improve Cybersecurity, featuring Bartley Richardson, director of cybersecurity engineering at NVIDIA.

Healthcare

Healthcare Is Adopting Generative AI, Becoming One of the Largest Tech Industries, featuring Kimberly Powell, vice president of healthcare and life sciences at NVIDIA.

The Role of Generative AI in Modern Medicine, featuring speakers from ARK Investment Management, NVIDIA, Microsoft and Scripps Research.

How Artificial Intelligence Is Powering the Future of Biomedicine, featuring Priscilla Chan, cofounder and co-CEO of the Chan Zuckerberg Initiative, and Mona Flores, global head of medical AI at NVIDIA.

Retail and Consumer Packaged Goods

Augmented Marketing in Beauty With Generative AI, featuring Asmita Dubey, chief digital and marketing officer at L’Oréal.

AI and the Radical Transformation of Marketing, featuring Stephan Pretorius, chief technology officer at WPP.

How Lowe’s Is Driving Innovation and Agility With AI, featuring Azita Martin, vice president of artificial intelligence for retail and consumer packaged goods at NVIDIA, and Seemantini Godbole, executive vice president and chief digital and information officer at Lowe’s.

Telecommunications

Special Address: Three Ways Artificial Intelligence Is Transforming Telecommunications, featuring Ronnie Vasishta, senior vice president of telecom at NVIDIA.

Generative AI as an Innovative Accelerator in Telcos, featuring Asif Hasan, cofounder of Quantiphi; Lilach Ilan, global head of business development, telco operations at NVIDIA; and Chris Halton, vice president of product strategy and innovation at Verizon.

How Telcos Are Enabling National AI Infrastructure and Platforms, featuring speakers from Indosat, NVIDIA, Singtel and Telconet.

Manufacturing

Accelerating Aerodynamics Analysis at Mercedes-Benz, featuring Liam McManus, technical product manager at Siemens; Erich Jehle-Graf of Mercedes Benz; and Ian Pegler, global business development, computer-aided design at NVIDIA.

Omniverse-Based Fab Digital Twin Platform for Semiconductor Industry, featuring Seokjin Youn, corporate vice president and head of the management information systems team at Samsung Electronics.

Digitalizing Global Manufacturing Supply Chains With Digital Twins, Powered by OpenUSD, featuring Kirk Fleischhaue, senior vice president at Foxconn.

Automotive

Applying AI & LLMs to Transform the Luxury Automotive Experience, featuring Chrissie Kemp, chief data and digital product officer at JLR (Jaguar Land Rover).

Accelerating Automotive Workflows With Large Language Models, featuring Bryan Goodman, director of artificial intelligence at Ford Motor Co.

How LLMs and Generative AI Will Enhance the Way We Experience Self-Driving Cars, featuring Alex Kendall, cofounder and CEO of Wayve.

Robotics 

Robotics and the Role of AI: Past, Present and Future, featuring Marc Raibert, executive director at The AI Institute, and Dieter Fox, senior director of robotics research at NVIDIA.

Breathing Life into Disney’s Robotic Characters With Deep Reinforcement Learning, featuring Mortiz Bächer, associate lab director of robotics at Disney Research.

Media and Entertainment 

Unlocking Creative Potential: The Synergy of AI and Human Creativity, featuring Andrea Gagliano, senior director of data science, AI/ML at Getty Images.

Beyond the Screen: Unraveling the Impact of AI in the Film Industry, featuring Nikola Todorovic, cofounder and CEO at Wonder Dynamics; Chris Jacquemin, head of digital strategy at WME; and Sanja Fidler, vice president of AI research at NVIDIA.

Revolutionizing Fan Engagement: Unleashing the Power of AI in Software-Defined Production, featuring ​​Lewis Smithingham, senior vice president of innovation and creative solutions at Media.Monks.

Energy

Panel: Building a Lower-Carbon Future With HPC and AI in Energy, featuring speakers from NVIDIA, Shell, ExxonMobil, Schlumberger and Petrobas.

The Increasing Complexity of the Electric Grid Demands Edge Computing, featuring Marissa Hummon, chief technology officer at Utilidata.

Browse a curated list of GTC sessions for business leaders of every technical level and area of interest.  

Battle.net Leaps Into the Cloud With GeForce NOW

GFN Thursday celebrates this leap day with the addition of a popular game store to the cloud.

Stream the first titles from Blizzard Entertainment’s Battle.net, including Diablo IV, Overwatch 2, Call of Duty HQ and Hearthstone, now playable across more devices than ever.

They’re all part of the 30 new games coming to GeForce NOW in March, with eight available this week.

Plus, Day Passes, announced at CES, are coming to the cloud next week, enabling gamers to experience the benefits of GeForce NOW Ultimate and Priority memberships for 24 hours at a time.

Welcome to the Cloud

More cloud gaming friends.

Battle.net is Blizzard’s digital storefront, a gateway to adventures in the Blizzard universe and home to a vibrant gaming community.

Members who own Diablo IV, Overwatch 2, Call of Duty HQ and Hearthstone on Battle.net can now stream these triple-A titles from NVIDIA GeForce RTX-powered servers in the cloud without worrying about hardware specs or long download times.

Cloud gamers have heart.

Battle the forces of evil in the dark, treacherous world of Diablo IV’s Sanctuary at up to 4K resolution and 120 frames per second with an Ultimate membership, even on under-powered devices. Assemble a deck to cast legendary spells in Hearthstone, and engage in epic firefights in Overwatch 2 and Call of Duty HQ at ultra-low latency thanks to the power of NVIDIA Reflex technology. Read this article and search for Hearthstone for more details on supported devices for this title.

Get ready to play Blizzard and Activision’s top-quality games anytime, anywhere. Battle.net joins supported platforms on GeForce NOW, including Steam, Epic Games Store, Xbox, Ubisoft Connect and GOG.com.

Not Mad at March

“A wasteland I like to call my home.”

Imagine a paradise … infested with zombies! In Welcome to ParadiZe — now available for members to stream — capture, control and teach zombies to farm or fight in the beautiful country of ParadiZe. Explore the world’s unique flora and fauna while using the zombies to defend the camp and do the dirty work.

In addition, members can look for the following this week:

STAR WARS: Dark Forces Remaster (New release on Steam, Feb. 28)
Space Engineers (New release on Xbox, available on PC Game Pass, Feb. 29)
Welcome to ParadiZe (New release on Steam, Feb. 29)
Call of Duty HQ (Battle.net)
Diablo IV (Battle.net)
Fort Solis (Steam)
Hearthstone (Battle.net)
Overwatch 2 (Battle.net)

Plus, check out what the rest of March looks like:

The Thaumaturge (New release on Steam, Mar. 4)
Classified: France ’44 (New release on Steam, Mar. 5)
Expeditions: A MudRunner Game (New release on Steam, Mar. 5)
Warhammer 40,000: Boltgun (New release on Xbox, available on PC Game Pass, Mar. 5)
Winter Survival (New release on Steam, Mar. 6)
Taxi Life: A City Driving Simulator (New release on Steam, Mar. 7)
Hellbreach: Vegas (New release on Steam, Mar. 11)
Crown Wars: The Black Prince (New release on Steam, Mar. 14)
Outcast – A New Beginning (New release on Steam, Mar. 15)
Alone in the Dark (New release on Steam, Mar. 20)
Breachway (New release on Steam, Mar. 22)
Palia (New release on Steam, Mar. 25)
Bulwark: Falconeer Chronicles (New release on Steam, Mar. 26)
Millennia (New release on Steam, Mar. 26)
Outpost: Infinity Siege (New release on Steam, Mar. 26)
SOUTH PARK: SNOW DAY! (New release on Steam, Mar. 26)
Balatro (Steam)
PARANORMASIGHT: The Seven Mysteries of Honjo (Steam)
Portal: Revolution (Steam)
STAR OCEAN THE SECOND STORY R (Steam)
STAR OCEAN THE SECOND STORY R – DEMO (Steam)
Undisputed (Steam)

Fantastic February

In addition to the 27 games announced last month, five more joined the GeForce NOW library:

Deep Rock Galactic: Survivor (New release on Steam, Feb. 14)
Goat Simulator 3 (New release on Steam, Feb. 15)
Le Mans Ultimate (New release on Steam, Feb. 20)
art of rally (Xbox, available on Microsoft Store)
Halo Infinite (Steam and Xbox, available on PC Game Pass)

The Thaumaturge didn’t make it in February due to a shift in its launch date, and is included in the March games list.

What are you planning to play this weekend? Let us know on X or in the comments below.

Wait for it…. pic.twitter.com/tXdyoeuQSP

— NVIDIA GeForce NOW (@NVIDIAGFN) February 28, 2024

What Is Sovereign AI?

Nations have long invested in domestic infrastructure to advance their economies, control their own data and take advantage of technology opportunities in areas such as transportation, communications, commerce, entertainment and healthcare.

AI, the most important technology of our time, is turbocharging innovation across every facet of society. It’s expected to generate trillions of dollars in economic dividends and productivity gains.

Countries are investing in sovereign AI to develop and harness such benefits on their own. Sovereign AI refers to a nation’s capabilities to produce artificial intelligence using its own infrastructure, data, workforce and business networks.

Why Sovereign AI Is Important

The global imperative for nations to invest in sovereign AI capabilities has grown since the rise of generative AI, which is reshaping markets, challenging governance models, inspiring new industries and transforming others — from gaming to biopharma. It’s also rewriting the nature of work, as people in many fields start using AI-powered “copilots.”

Sovereign AI encompasses both physical and data infrastructures. The latter includes sovereign foundation models, such as large language models, developed by local teams and trained on local datasets to promote inclusiveness with specific dialects, cultures and practices.

For example, speech AI models can help preserve, promote and revitalize indigenous languages. And LLMs aren’t just for teaching AIs human languages, but for writing software code, protecting consumers from financial fraud, teaching robots physical skills and much more.

In addition, as artificial intelligence and accelerated computing become increasingly critical tools for combating climate change, boosting energy efficiency and protecting against cybersecurity threats, sovereign AI has a pivotal role to play in equipping every nation to bolster its sustainability efforts.

Factoring In AI Factories

Comprising new, essential infrastructure for AI production are “AI factories,” where data comes in and intelligence comes out. These are next-generation data centers that host advanced, full-stack accelerated computing platforms for the most computationally intensive tasks.

Nations are building up domestic computing capacity through various models. Some are procuring and operating sovereign AI clouds in collaboration with state-owned telecommunications providers or utilities. Others are sponsoring local cloud partners to provide a shared AI computing platform for public- and private-sector use.

“The AI factory will become the bedrock of modern economies across the world,” NVIDIA founder and CEO Jensen Huang said in a recent media Q&A.

Sovereign AI Efforts Underway

Nations around the world are already investing in sovereign AI.

Since 2019, NVIDIA’s AI Nations initiative has helped countries spanning every region of the globe to build sovereign AI capabilities, including ecosystem enablement and workforce development, creating the conditions for engineers, developers, scientists, entrepreneurs, creators and public sector officials to pursue their AI ambitions at home.

France-based Scaleway, a subsidiary of the iliad Group, is building Europe’s most powerful cloud-native AI supercomputer. The NVIDIA DGX SuperPOD comprises 127 DGX H100 systems, representing 1,016 NVIDIA H100 Tensor Core GPUs interconnected by NVIDIA NVLink technology and the NVIDIA Quantum-2 InfiniBand platform. NVIDIA DGX systems also include NVIDIA AI Enterprise software for secure, supported and stable AI development and deployment.

Swisscom Group, majority-owned by the Swiss government, recently announced its Italian subsidiary, Fastweb, will build Italy’s first and most powerful NVIDIA DGX-powered supercomputer — also using NVIDIA AI Enterprise software — to develop the first LLM natively trained in the Italian language.

With these NVIDIA technologies and its own cloud and cybersecurity infrastructures, Fastweb plans to launch an end-to-end system with which Italian companies, public-administration organizations and startups can develop generative AI applications for any industry.

The government of India has also announced sovereign AI initiatives promoting workforce development, sustainable computing and private-sector investment in domestic compute capacity. India-based Tata Group, for example, is building a large-scale AI infrastructure powered by the NVIDIA GH200 Grace Hopper Superchip, while Reliance Industries will develop a foundation LLM tailored for generative AI and trained on the diverse languages of the world’s most populous nation. NVIDIA is also working with India’s top universities to support and expand local researcher and developer communities.

Japan is going all in with sovereign AI, collaborating with NVIDIA to upskill its workforce, support Japanese language model development, and expand AI adoption for natural disaster response and climate resilience. These efforts include public-private partnerships that are incentivizing leaders like SoftBank Corp. to collaborate with NVIDIA on building a generative AI platform for 5G and 6G applications as well as a network of distributed AI factories.

Finally, Singapore is fostering a range of sovereign AI programs, including by partnering with NVIDIA to upgrade its National Super Computer Center, or NSCC, with NVIDIA H100 GPUs. In addition, Singtel, a leading communications services provider building energy-efficient AI factories across Southeast Asia, is accelerated by NVIDIA Hopper architecture GPUs and NVIDIA AI reference architectures.

Read more about sovereign AI and its transformative potential.

Time to Skill Up: Game Reviewer Ralph Panebianco Wields NVIDIA RTX for the Win

Editor’s note: This post is part of our weekly In the NVIDIA Studio series, which celebrates featured artists, offers creative tips and tricks, and demonstrates how NVIDIA Studio technology improves creative workflows. We’re also deep diving on new GeForce RTX 40 Series GPU features, technologies and resources, and how they dramatically accelerate content creation.

YouTube content creator Ralph Panebianco really, really loves video games.

Since getting an original Nintendo Entertainment System at the age of four, Panebianco, this week’s featured In the NVIDIA Studio creator, has spent much of his free time playing video games. He pursued a career in gaming in his native country of Australia before pivoting to content creation, opening a YouTube channel called Skill Up, where he reviews the latest video games.

“When I wasn’t playing video games, I was reading about them, and now I get to talk about them for a living,” he said.

And calling all art fans: the latest Studio Standouts video features film noir-themed artwork brought to life with dramatic, monochromatic flair.

Video Editing Skillz

Panebianco works with his partner to create in-depth, insightful reviews of the latest video games on his Skill Up YouTube channel, which has garnered nearly 1 million subscribers. Below is a recent video reviewing Pacific Drive, a title available on the NVIDIA GeForce NOW cloud gaming service, powered by GeForce RTX GPUs.

“Creatively, we don’t view game reviews as functional buying guides with a list of pros and cons,” said Panebianco. “We view reviews as a chance to crack a game open and really show the audience what makes it tick. They’re sort of mini-essays on game design, delving deep into why specific game mechanics do or don’t work.”

The content creation process begins with booting up the game on his PC, powered by the recently launched GeForce RTX 4080 SUPER graphics card. This allows the Skill Up team to tap RTX ray tracing and NVIDIA DLSS — breakthrough technologies that use AI to create additional frames and improve image quality.

He records video footage primarily using GeForce Experience, a companion to NVIDIA GeForce GPUs that enables users to capture assets, optimize game settings and keep drivers up to date, among other features.

When footage requires high-dynamic range, the team uses the OBS Studio open-source software with AV1 hardware encoding to achieve 40% more efficient encoding on average than H.264 and deliver higher quality than competing GPUs.

“The AV1 encoder is ridiculously efficient in terms of file size,” he said.

NVIDIA GPUs and OBS Studio software work in synergy.

Once the footage is ready, Panebianco writes a video script in Microsoft Word and then records himself, using Audacity. He uses the AI-powered NVIDIA Broadcast app, free for RTX GPU owners, to eliminate background noise and achieve professional studio quality.

Panebianco then hands off the files to his editor for production in Adobe Premiere Pro, where a number of GPU-accelerated, AI-powered features such as Enhance Speech, Auto Reframe and Unsharp Mask help speed the video editing process.

NVIDIA’s GPU-accelerated video decoder (NVDEC) enables smooth playback and scrubbing of high-resolution videos.

Next, Panebianco exports the final files twice as fast thanks to the dual AV1 encoders in his RTX GPU. Lastly, his editor creates a YouTube thumbnail in Adobe Photoshop, and then the video is ready for publishing.

Adobe Photoshop has over 30 GPU-accelerated features that help modify and adjust images smoothly and quickly.

“Almost my entire workflow was enhanced by NVIDIA’s hardware,” Panebianco shared. “It’s not just about the hardware making for efficient encoding or lightning-fast, hardware-enabled rendering — it’s about the end-to-end toolset.”

Panebianco has words of wisdom for aspiring content creators.

“Worry less about the numbers and more about the quality,” he said. “The metrics grind pays little in the way of dividends, but putting out truly excellent content is an almost failure-proof path to growth.”

Video game content creator Ralph Panebianco.

Catch Panebianco’s video game reviews on the Skill Up YouTube channel.

Access tutorials on the Studio YouTube channel and get updates directly in your inbox by subscribing to the Studio newsletter

And … Action! Cuebric CEO Provides Insights Into Filmmaking Using AI

These days, just about everyone is a content creator. But can generative AI help make people create high-quality films and other content affordably? Find out from Pinar Seyhan Demirdag, cofounder and CEO of Cuebric, during his conversation with NVIDIA AI Podcast host Noah Kravitz.

Cuebric is on a mission to offer new solutions in filmmaking and content creation through immersive, two-and-a-half-dimensional cinematic environments. Its AI-powered application aims to help creators quickly bring their ideas to life, making high-quality production more accessible.

Demirdag discusses how Cuebric uses generative AI to enable the creation of engaging environments affordably. Listen in to find out about the current landscape of content creation, the role of AI in simplifying the creative process, and Cuebric’s participation in NVIDIA’s GTC technology conference.

The AI Podcast · Exploring Filmmaking with Cuebric’s AI: Insights from Pinar Seyhan Demirdag – Ep. 314

Time Stamps:

1:15: Getting to know Pinar Seyhan Demirdag and Cuebric
2:30: The beginnings and goals of Cuebric
4:45: How Cuebric’s AI application works for filmmakers
9:00: Advantages of AI in content creation
13:20: Making high-quality production budget-friendly
17:35: The future of AI in creative endeavors
22:00: Cuebric at NVIDIA GTC

You Might Also Like…

MIT’s Anant Agarwal on AI in Education – Ep. 197

AI could help students work smarter, not harder. Anant Agarwal, founder of edX and chief platform officer at 2U, shares his vision for the future of online education and the impact of AI in revolutionizing the learning experience.

UF Provost Joe Glover on Building a Leading AI University – Ep. 186

Joe Glover, provost and senior vice president of academic affairs at the University of Florida, discusses the university’s efforts to implement AI across all aspects of higher education, including a public-private partnership with NVIDIA that has helped transform UF into one of the leading AI universities in the country.

NVIDIA’s Marc Hamilton on Building the Cambridge-1 Supercomputer During a Pandemic – Ep. 137

Cambridge-1, the U.K.’s most powerful supercomputer, ranks among the world’s top 3 most energy-efficient supercomputers and was built to help healthcare researchers make new discoveries. Marc Hamilton, vice president of solutions architecture and engineering at NVIDIA, speaks on how he remotely oversaw its construction.

Subscribe to the AI Podcast

Get the AI Podcast through iTunes, Google Podcasts, Google Play, Amazon Music, Castbox, DoggCatcher, Overcast, PlayerFM, Pocket Casts, Podbay, PodBean, PodCruncher, PodKicker, Soundcloud, Spotify, Stitcher and TuneIn.

Make the AI Podcast better: Have a few minutes to spare? Fill out this listener survey.

Rack ‘n’ Roll: NVIDIA Grace Hopper Systems Gather at GTC

The spirit of Grace Hopper will live on at NVIDIA GTC.

Accelerated systems using powerful processors — named in honor of the pioneer of software programming — will be on display at the global AI conference running March 18-21, ready to take computing to the next level.

System makers will show more than 500 servers in multiple configurations across 18 racks, all packing NVIDIA GH200 Grace Hopper Superchips. They’ll form the largest display at NVIDIA’s booth in the San Jose Convention Center, filling the MGX Pavilion.

MGX Speeds Time to Market

NVIDIA MGX is a blueprint for building accelerated servers with any combination of GPUs, CPUs and data processing units (DPUs) for a wide range of AI, high performance computing and NVIDIA Omniverse applications. It’s a modular reference architecture for use across multiple product generations and workloads.

GTC attendees can get an up-close look at MGX models tailored for enterprise, cloud and telco-edge uses, such as generative AI inference, recommenders and data analytics.

The pavilion will showcase accelerated systems packing single and dual GH200 Superchips in 1U and 2U chassis, linked via NVIDIA BlueField-3 DPUs and NVIDIA Quantum-2 400Gb/s InfiniBand networks over LinkX cables and transceivers.

The systems support industry standards for 19- and 21-inch rack enclosures, and many provide E1.S bays for nonvolatile storage.

Grace Hopper in the Spotlight

Here’s a sampler of MGX systems now available:

ASRock RACK’s MECAI, measuring 450 x 445 x 87mm, accelerates AI and 5G services in constrained spaces at the edge of telco networks.
ASUS’s MGX server, the ESC NM2N-E1, slides into a rack that holds up to 32 GH200 processors and supports air- and water-cooled nodes.
Foxconn provides a suite of MGX systems, including a 4U model that accommodates up to eight NVIDIA H100 NVL PCIe Tensor Core GPUs.
GIGABYTE’s XH23-VG0-MGX can accommodate plenty of storage in its six 2.5-inch Gen5 NVMe hot-swappable bays and two M.2 slots.
Inventec’s systems can slot into 19- and 21-inch racks and use three different implementations of liquid cooling.
Lenovo supplies a range of 1U, 2U and 4U MGX servers, including models that support direct liquid cooling.
Pegatron’s air-cooled AS201-1N0 server packs a BlueField-3 DPU for software-defined, hardware-accelerated networking.
QCT can stack 16 of its QuantaGrid D74S-IU systems, each with two GH200 Superchips, into a single QCT QoolRack.
Supermicro’s ARS-111GL-NHR with nine hot-swappable fans is part of a portfolio of air- and liquid-cooled GH200 and NVIDIA Grace CPU systems.
Wiwynn’s SV7200H, a 1U dual GH200 system, supports a BlueField-3 DPU and a liquid-cooling subsystem that can be remotely managed.
Wistron’s MGX servers are 4U GPU systems for AI inference and mixed workloads, supporting up to eight accelerators in one system.

The new servers are in addition to three accelerated systems using MGX announced at COMPUTEX last May — Supermicro’s ARS-221GL-NR using the Grace CPU and QCT’s QuantaGrid S74G-2U and S74GM-2U powered by the GH200.

Grace Hopper Packs Two in One

System builders are adopting the hybrid processor because it packs a punch.

GH200 Superchips combine a high-performance, power-efficient Grace CPU with a muscular NVIDIA H100 GPU. They share hundreds of gigabytes of memory over a fast NVIDIA NVLink-C2C interconnect.

The result is a processor and memory complex well-suited to take on today’s most demanding jobs, such as running large language models. They have the memory and speed needed to link generative AI models to data sources that can improve their accuracy using retrieval-augmented generation, aka RAG.

Recommenders Run 4x Faster

In addition, the GH200 Superchip delivers greater efficiency and up to 4x more performance than using the H100 GPU with traditional CPUs for tasks like making recommendations for online shopping or media streaming.

In its debut on the MLPerf industry benchmarks last November, GH200 systems ran all data center inference tests, extending the already leading performance of H100 GPUs.

In all these ways, GH200 systems are taking to new heights a computing revolution their namesake helped start on the first mainframe computers more than seven decades ago.

Register for NVIDIA GTC, the conference for the era of AI, running March 18-21 at the San Jose Convention Center and virtually.

And get the 30,000-foot view from NVIDIA CEO and founder Jensen Huang in his GTC keynote

Meet the Omnivore: Mode Maison Harnesses OpenUSD to Drive Innovations in Retail With High-Fidelity Digital Twins

Editor’s note: This post is a part of our Meet the Omnivore series, which features individual creators and developers who use OpenUSD to build tools, applications and services for 3D workflows and physically accurate virtual worlds.

A failed furniture-shopping trip turned into a business idea for Steven Gay, cofounder and CEO of company Mode Maison.

Gay grew up in Houston and studied at the University of Texas before working in New York as one of the youngest concept designers at Ralph Lauren. He was inspired to start his own company after a long day of trying — and failing — to pick out a sofa.

The experience illuminated how the luxury home-goods industry has traditionally lagged in adopting digital technologies, especially those for creating immersive, interactive experiences for consumers.

Gay founded Mode Maison in 2018 with the goal of solving this challenge and paving the way for scalability, creativity and a generative future in retail. Using the Universal Scene Description framework, aka OpenUSD, and the NVIDIA Omniverse platform, Gay, along with Mode Maison Chief Technology Officer Jakub Cech and the Mode Maison team, are helping enhance and digitalize entire product lifecycle processes — from design and manufacturing to consumer experiences.

Register for NVIDIA GTC, which takes place March 17-21, to hear how leading companies are using the latest innovations in AI and graphics. And join us for OpenUSD Day to learn how to build generative AI-enabled 3D pipelines and tools using Universal Scene Description.

They developed a photometric scanning system, called Total Material Appearance Capture, which offers an unbiased, physically based approach to digitizing the material world that’s enabled by real-world embedded sensors.

TMAC captures proprietary data and the composition of any material, then turns it into input that serves as a single source of truth, which can be used for creating a fully digitized retail model. Using the system, along with OpenUSD and NVIDIA Omniverse, Mode Maison customers can create highly accurate digital twins of any material or product.

“By enabling this, we’re effectively collapsing and fostering a complete integration across the entire product lifecycle process — from design and production to manufacturing to consumer experiences and beyond,” said Gay.

Mode Maison developed a photometric scanning system called Total Material Appearance Capture.

Streamlining Workflows and Enhancing Productivity With Digital Twins

Previously, Mode Maison faced significant challenges in creating physically based, highly flexible and scalable digital materials. The limitations were particularly noticeable when rendering complex materials and textures, or integrating digital models into cohesive, multilayered environments.

Using Omniverse helped Gay and his team overcome these challenges by offering advanced rendering capabilities, physics simulations and extensibility for AI training that unlock new possibilities in digital retail.

Before using Omniverse and OpenUSD, Mode Maison used disjointed processes for digital material capture, modeling and rendering, often leading to inconsistencies, the inability to scale and minimal interoperability. After integrating Omniverse, the company experienced a streamlined, coherent workflow where high-fidelity digital twins can be created with greater efficiency and interoperability.

The team primarily uses Autodesk 3ds Max for design, and they import the 3D data using Omniverse Connectors. Gay says OpenUSD is playing an increasingly critical role in its workflows, especially when developing composable, flexible, interoperable capabilities across asset creation.

This enhanced pipeline starts with capturing high-fidelity material data using TMAC. The data is then processed and formatted into OpenUSD for the creation of physically based, scientifically accurate, high-fidelity digital twins.

“OpenUSD allows for an unprecedented level of collaboration and interoperability in creating complex, multi-layered capabilities and advanced digital materials,” Gay said. “Its ability to seamlessly integrate diverse digital assets and maintain their fidelity across various applications is instrumental in creating realistic, interactive digital twins for retail.”

OpenUSD and Omniverse have sped Mode Maison and their clients’ ability to bring products to market, reduced costs associated with building and modifying digital twins, and enhanced productivity through streamlined creation.

“Our work represents a major step toward a future where digital and physical realities will be seamlessly integrated,” said Gay. “This shift enhances consumer engagement and paves the way for more sustainable business practices by reducing the need for physical prototyping while enabling more precise manufacturing.”

As for emerging technological advancements in digital retail, Gay says AI will play a central role in creating hyper-personalized design, production, sourcing and front-end consumer experiences — all while reducing carbon footprints and paving the way for a more sustainable future in retail.

Join In on the Creation

Anyone can build their own Omniverse extension or Connector to enhance 3D workflows and tools.

Learn more about how OpenUSD and NVIDIA Omniverse are transforming industries at NVIDIA GTC, a global AI conference running March 18-21, online and at the San Jose Convention Center.

Join OpenUSD Day at GTC on Tuesday, March 19, to learn more about building generative AI-enabled 3D pipelines and tools using USD.

Get started with NVIDIA Omniverse by downloading the standard license free, access OpenUSD resources, and learn how Omniverse Enterprise can connect your team. Stay up to date on Instagram, Medium and X. For more, join the Omniverse community on the  forums, Discord server, Twitch and YouTube channels.

NVIDIA RTX 500 and 1000 Professional Ada Generation Laptop GPUs Drive AI-Enhanced Workflows From Anywhere

With generative AI and hybrid work environments becoming the new standard, nearly every professional, whether a content creator, researcher or engineer, needs a powerful, AI-accelerated laptop to help users tackle their industry’s toughest challenges — even on the go.

The new NVIDIA RTX 500 and 1000 Ada Generation Laptop GPUs will be available in new, highly portable mobile workstations, expanding the NVIDIA Ada Lovelace architecture-based lineup, which includes the RTX 2000, 3000, 3500, 4000 and 5000 Ada Generation Laptop GPUs.

AI is rapidly being adopted to drive efficiencies across professional design and content creation workflows and everyday productivity applications, underscoring the importance of having powerful local AI acceleration and sufficient processing power in systems.

The next generation of mobile workstations with Ada Generation GPUs, including the RTX 500 and 1000 GPUs, will include both a neural processing unit (NPU), a component of the CPU, and an NVIDIA RTX GPU, which includes Tensor Cores for AI processing. The NPU helps offload light AI tasks, while the GPU provides up to an additional 682 TOPS of AI performance for more demanding day-to-day AI workflows.

The higher level of AI acceleration delivered by the GPU is useful for tackling a wide range of AI-based tasks, such as video conferencing with high-quality AI effects, streaming videos with AI upscaling, or working faster with generative AI and content creation applications.

The new RTX 500 GPU delivers up to 14x the generative AI performance for models like Stable Diffusion, up to 3x faster photo editing with AI and up to 10x the graphics performance for 3D rendering compared with a CPU-only configuration — bringing massive leaps in productivity for traditional and emerging workflows.

Enhancing Professional Workflows Across Industries

The RTX 500 and 1000 GPUs elevate workflows with AI for laptop users everywhere in compact designs. Video editors can streamline tasks such as removing background noise with AI. Graphic designers can bring blurry images to life with AI upscaling. Professionals can work on the go while using AI for higher-quality video conferencing and streaming experiences.

For users looking to tap AI for advanced rendering, data science and deep learning workflows, NVIDIA also offers the RTX 2000, 3000, 3500, 4000 and 5000 Ada Generation Laptop GPUs. 3D creators can use AI denoising and deep learning super sampling (DLSS) to visualize photorealistic renders in real time. Businesses can query their internal knowledge base with chatbot-like interfaces using local large language models. And researchers and scientists can experiment with data science, AI model training and tuning, and development projects.

Performance and Portability With NVIDIA RTX

The RTX 500 and 1000 GPUs, based on the NVIDIA Ada Lovelace architecture, bring the latest advancements to thin and light laptops, including:

Third-generation RT Cores: Up to 2x the ray tracing performance of the previous generation for high-fidelity, photorealistic rendering.
Fourth-generation Tensor Cores: Up to 2x the throughput of the previous generation, accelerating deep learning training, inferencing and AI-based creative workloads.
Ada Generation CUDA cores: Up to 30% the single-precision floating point (FP32) throughput compared to the previous generation for significant performance improvements in graphics and compute workloads.
Dedicated GPU memory: 4GB GPU memory with the RTX 500 GPU and 6GB with the RTX 1000 GPU allows users to run demanding 3D and AI-based applications, as well as tackle larger projects, datasets and multi-app workflows.
DLSS 3: Delivers a breakthrough in AI-powered graphics, significantly boosting performance by generating additional high-quality frames.
AV1 encoder: Eighth-generation NVIDIA encoder, aka NVENC, with AV1 support is up to 40% more efficient than H.264, enabling new possibilities for broadcasting, streaming and video calling.

Availability

The new NVIDIA RTX 500 and 1000 Ada Generation Laptop GPUs will be available this spring in mobile workstations from global manufacturing partners including Dell Technologies, HP, Lenovo and MSI.

Learn more about the latest NVIDIA RTX Laptop GPUs.

Add It to the Toolkit: February Studio Driver and NVIDIA App Beta Now Available

Editor’s note: This post is part of our weekly In the NVIDIA Studio series, which celebrates featured artists, offers creative tips and tricks, and demonstrates how NVIDIA Studio technology improves creative workflows. We’re also deep diving on new GeForce RTX 40 Series GPU features, technologies and resources, and how they dramatically accelerate content creation.

The February NVIDIA Studio Driver, designed specifically to optimize creative apps, is now available for download. Developed in collaboration with app developers, Studio Drivers undergo extensive testing to ensure seamless compatibility with creative apps while enhancing features, automating processes and speeding workflows.

Creators can download the latest driver on the public beta of the new NVIDIA app, the essential companion for creators and gamers with NVIDIA GPUs in their PCs and laptops. The NVIDIA app beta is a first step to modernize and unify the NVIDIA Control Panel, GeForce Experience and RTX Experience apps.

The NVIDIA App offers easy access to the latest Studio Drivers, a suite of AI-powered Studio apps, games and more.

The NVIDIA app simplifies the process of keeping PCs updated with the latest NVIDIA drivers, enables quick discovery and installation of NVIDIA apps like NVIDIA Broadcast and NVIDIA Omniverse, unifies the GPU control center, and introduces a redesigned in-app overlay for convenient access to powerful recording tools. Download the NVIDIA app beta today.

Adobe Premiere Pro’s AI-powered Enhance Speech tool is now available in general release. Accelerated by NVIDIA RTX, the new feature removes unwanted noise and improves the quality of dialogue clips so they sound professionally recorded. It’s 75% faster on a GeForce RTX 4090 laptop GPU compared with an RTX 3080 Ti.

Adobe Premiere Pro’s AI-powered Enhance Speech tool removes unwanted noise and improves dialogue quality.

Have a Chat with RTX, the tech demo app that lets GeForce RTX owners personalize a large language model connected to their own content. Results are fast and secure since it runs locally on a Windows RTX PC or workstation. Download Chat with RTX today.

And this week In the NVIDIA Studio, filmmaker James Matthews shares his short film, Dive, which was created with an Adobe Premiere Pro-powered workflow supercharged by his ASUS ZenBook Pro NVIDIA Studio laptop with a GeForce RTX 4070 graphics card.

Going With the Flow

Matthews’ goal with Dive was to create a visual and auditory representation of what it feels like to get swallowed up in the creative editing process.

Talk about a dream content creation location.

“When I’m really deep into an edit, I sometimes feel like I’m fully immersed into the film and the editing process itself,” he said. “It’s almost like a flow state, where time stands still and you are one with your own creativity.”

To capture and visualize that feeling, Matthews used the power of his ASUS ZenBook Pro NVIDIA Studio laptop equipped with a GeForce RTX 4070 graphics card.

He started by brainstorming — listening to music and sketching conceptual images with pencil and paper. Then, Matthews added a song to his Adobe Premiere Pro timeline and created a shot list, complete with cuts and descriptions of focal range, speed, camera movement, lighting and other details.

Next, he planned location and shooting times, paying special attention to lighting conditions.

“I always have my Premiere Pro timeline up so I can really see and feel what I need to create from the images I originally drew while building the concept in my head,” Matthews said. “This helps get the pacing of each shot right, by watching it back and possibly adding it into the timeline for a test.”

Then, Matthews started editing the footage in Premiere Pro, aided by his Studio laptop. His dedicated GPU-based NVIDIA video encoder (NVENC) enabled buttery-smooth playback and scrubbing of his high-resolution and multi-stream footage, saving countless hours.

Matthews’ RTX GPU accelerated a variety of AI-powered Adobe video editing tools, such as Enhance Speech, Scene Edit Detection and Auto Color, which applies color corrections with just a few clicks.

Finally, Matthews added sound design before exporting the final files twice as fast thanks to NVENC’s dual AV1 encoders.

“The entire edit used GPU acceleration,” he shared. “Effects in Premiere Pro, along with the NVENC video encoders on the GPU, unlocked a seamless workflow and essentially allowed me to get into my flow state faster.”

Filmmaker James Matthews.

Watch Matthews’ content on YouTube.

Follow NVIDIA Studio on Facebook, Instagram and X . Access tutorials on the Studio YouTube channel and get updates directly in your inbox by subscribing to the Studio newsletter