Artwork

Content provided by Deep Learning Deep Dive. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Deep Learning Deep Dive or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://ro.player.fm/legal.
Player FM - Aplicație Podcast
Treceți offline cu aplicația Player FM !

Episode #2: DALL-E and friends in image generation

1:51:27
 
Distribuie
 

Manage episode 336489917 series 3274640
Content provided by Deep Learning Deep Dive. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Deep Learning Deep Dive or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://ro.player.fm/legal.

Andrej Karpathy and Justin Johnson deep dive into OpenAI's DALL-E and use it as an anchor point to recurse into some of the recent work in AI on image generation. Approximate agenda:

DALL-E Blog Post:
https://openai.com/blog/dall-e/

ImageGPT
https://openai.com/blog/image-gpt/

VQ-VAE
https://arxiv.org/abs/1711.00937

VQ-VAE-2
https://arxiv.org/abs/1906.00446

Gumbel-Softmax / Concrete Distribution
https://arxiv.org/abs/1611.01144
https://arxiv.org/abs/1611.00712

VQGAN
https://arxiv.org/abs/2012.09841

Andrej's attempted re-implementation of VQVAE and GumbelSoftmax:
https://github.com/karpathy/deep-vector-quantization/blob/main/model.py

You can see a video version of this episode on YouTube:
https://www.youtube.com/watch?v=gMc90bqHMSM

We reached out to all speakers and obtained their written consent to appear in this recording.

  continue reading

2 episoade

Artwork
iconDistribuie
 
Manage episode 336489917 series 3274640
Content provided by Deep Learning Deep Dive. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Deep Learning Deep Dive or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://ro.player.fm/legal.

Andrej Karpathy and Justin Johnson deep dive into OpenAI's DALL-E and use it as an anchor point to recurse into some of the recent work in AI on image generation. Approximate agenda:

DALL-E Blog Post:
https://openai.com/blog/dall-e/

ImageGPT
https://openai.com/blog/image-gpt/

VQ-VAE
https://arxiv.org/abs/1711.00937

VQ-VAE-2
https://arxiv.org/abs/1906.00446

Gumbel-Softmax / Concrete Distribution
https://arxiv.org/abs/1611.01144
https://arxiv.org/abs/1611.00712

VQGAN
https://arxiv.org/abs/2012.09841

Andrej's attempted re-implementation of VQVAE and GumbelSoftmax:
https://github.com/karpathy/deep-vector-quantization/blob/main/model.py

You can see a video version of this episode on YouTube:
https://www.youtube.com/watch?v=gMc90bqHMSM

We reached out to all speakers and obtained their written consent to appear in this recording.

  continue reading

2 episoade

Toate episoadele

×
 
Loading …

Bun venit la Player FM!

Player FM scanează web-ul pentru podcast-uri de înaltă calitate pentru a vă putea bucura acum. Este cea mai bună aplicație pentru podcast și funcționează pe Android, iPhone și pe web. Înscrieți-vă pentru a sincroniza abonamentele pe toate dispozitivele.

 

Ghid rapid de referință

Listen to this show while you explore
Play