Nils Durner's Blog Ahas, Breadcrumbs, Coding Epiphanies

Amazon's Titan Image Generator: A Comparison with DALL-E

Amazon have launched a preview of Titan Image Generator - three screencaps from the AI/ML keynote attached below. This is very similar to the DALL-E that is available through labs.openai.com (DALL-E 2?):

  • only very limited resolutions available, both for Generation and Editing (screenshot #4).
  • result is always PNG

Unlike DALL-E:

  • I haven’t been able to gotten the Edit to really work (screenshot #5 -> #6, #7 -> #8 face changed even though I had only defined the editing rectangle around the shirt and asked for a blue one)
  • You cannot work on a full resolution photo (DALL-E just restricts the editing area)

I couldn’t find any watermarks (screencap #3) in the finished result. Either they are not part of the Preview, or they are at least in a different format & style from the ones applied by Adobe Firefly.

Speaking of Adobe Firefly: I had success with one photo by using it on iPad: the brush bug didn’t show, and neither did the image noise in the result download. Either there was a major bugfix release or it’s simply a “Mobile First” app. 😉

Screenshot #1
Screenshot #7
Screenshot #7
Screenshot #6
Screenshot #5
Screenshot #4
Screenshot #2
Screenshot #3

The Verge on the watermark:

The catch, though, comes with how the invisible watermark is detected. Amazon created an API that people can connect to and then feed the image to check the image’s provenance. Philomin said this is by design; after all, Titan is not an end product but a model, so developers building with Titan Image Generator will choose how to provide that information to users. (Source)