philip lelyveld The world of entertainment technology

17Jul/20Off

OpenAI’s fiction-spewing AI is learning to generate images

Screen-Shot-2020-07-15-at-1.05.23-PMAt its core, GPT-2 is a powerful prediction engine. It learned to grasp the structure of the English language by looking at billions of examples of words, sentences, and paragraphs, scraped from the corners of the internet. With that structure, it could then manipulate words into new sentences by statistically predicting the order in which they should appear.

So researchers at OpenAI decided to swap the words for pixels and train the same algorithm on images in ImageNet, the most popular image bank for deep learning. Because the algorithm was designed to work with one-dimensional data (i.e., strings of text), they unfurled the images into a single sequence of pixels. They found that the new model, named iGPT, was still able to grasp the two-dimensional structures of the visual world. Given the sequence of pixels for the first half of an image, it could predict the second half in ways that a human would deem sensible.

The results are startlingly impressive and demonstrate a new path for using unsupervised learning, which trains on unlabeled data, in the development of computer vision systems.

At the same time, the method presents a concerning new way to create deepfake images.

See the full story here: https://www.technologyreview.com/2020/07/16/1005284/openai-ai-gpt-2-generates-images/?truid=33b587ecf0755237a213721d72ba90e8&utm_source=the_download&utm_medium=email&utm_campaign=the_download.unpaid.engagement&utm_term=subs&utm_content=07-17-2020

Comments (0) Trackbacks (0)

Sorry, the comment form is closed at this time.

Trackbacks are disabled.