Denoising Diffusion Models: Generative Models of Modern Deep Learning Era (Arash Vahdat, NVIDIA)

Sdílet
Vložit
  • čas přidán 5. 09. 2024
  • Date: Mar 17, 2023
    Abstract:
    Diffusion models are revolutionizing the way we train deep generative models. Comprising a forward process that adds Gaussian noise iteratively to data and a reverse process that learns to generate data by denoising, these models exhibit exceptional sample quality and diversity. However, their iterative nature often results in slow sampling. In this talk, I will provide a brief overview of denoising diffusion models and highlight some of the successful frameworks we have recently developed at NVIDIA using these models, including text-to-image models, 3D shape models, and adversarially robust classification frameworks. Additionally, I will delve into the sampling challenges from diffusion models and introduce three frameworks we have created to address them. These include latent score-based generative models that train diffusion models in a latent space, denoising diffusion GANs that employ complex multimodal distributions for denoising, and higher-order solvers that solve the sampling differential equations in diffusion models in fewer steps.
    Bio:
    Arash Vahdat is a principal research scientist at NVIDIA research specializing in generative AI technologies. Before joining NVIDIA, he was a research scientist at D-Wave Systems where he worked on generative learning and its applications in efficient training. Before D-Wave, Arash was a research faculty member at Simon Fraser University (SFU), where he led deep learning-based video analysis research and taught master courses on machine learning for big data. Arash's current areas of research include generative learning, representation learning, and efficient deep learning.

Komentáře •