Generative AI: The new power tool for creative pros
- 21 December, 2017 22:00
Artificial intelligence can generate fresh patterns in data with astonishing speed, efficiency, and verisimilitude. Over the past few years, it has become commonplace for AI to algorithmically generate any object that can be rendered digitally.
Increasingly, this application of AI is being referred to as “generative.” Generative AI will drive the next generation of apps for autoprogramming, content development, visual arts, and other creative, design, and engineering activities.
By 2019, most leading AI providers will offer tools and libraries for building AI-powered natural-language generation, image manipulation, and other generative use cases. Already, generative AI has proven itself—in both research and in commercial applications—in the following areas:
- Generative graphics: AI can abstract visual patterns from artwork and then apply those patterns in the fanciful rerendering of photographic images with the hallmark features of that artwork. These algorithms can also transform any rough doodle into an impressive drawing that seems to have been created by expert human artists depicting real-world models. They can take hand-drawn sketches of human faces and algorithmically transform them into photorealistic images. They can instruct a computer to render any image so it appears as if it were composed by a specific human artist in a specific style. And they can organically conjure from any image any patterns, figures, and other details that were not present in the source.
- Generative photos: AI can autocorrect photos by generating and superimposing onto the original any visual elements that were missing, obscure, or misleading. It can also transform any low-resolution original image into a natural-looking high-resolution version. It can generate natural-looking but synthetic human faces by blending existing portraits or abstracting features from any specific portrait. And it can generate photo-realistic images from semantic label maps.
- Generative audio: AI can render any computer-generated voice into one that truly sounds like it was naturally produced in a human vocal tract. It can translate text to speech with astonishing naturalness. It can also compose music that feels like it expresses some authentic feeling deep in the soul of an actual human musician.
- Generative video: AI can tweak frames in a motion picture to conform with whatever style, lighting, or other effect the director wants to achieve.
- Generative text: AI can generate natural language content rapidly in large varieties at scale at top speed. It can generate captions, annotations, and other narratives from images and other content. And it can blend existing fonts into new font designs.
- Generative code: AI can generate builds of program code that address an application domain of interest.
- Generative materials: AI can even render prosthetic limbs, organic molecules, and other items from scratch when actuated through 3D printing, CRISPR, and other technologies.
In 2018 and beyond, more solutions will come to market—in all verticals—that use leading-edge AI approaches known as generative adversarial networks (GANs) to algorithmically create digital and analog objects of all sorts with astonishing accuracy.
Before 2019, more providers will roll out GAN-driven tools and workbenches for software programming, computer-aided design, web content development, music composition, image manipulation, video production, and other creative disciplines. And generative photo apps are likely to come to every smart camera application on mobile devices and other mass-market IoT endpoints.
GANs were a massive global research focus in the AI community in 2017. The pace of advances in GAN technology is likely to accelerate in the coming year. And generative design techniques are likely to come into the core curricula of data science, creative, and engineering professions globally.