Working on the red flag interpretation image

June 08, 2023

Working on the red flag interpretation image

Using iconic photos and translating them into an AI context pendant was the initial idea of this project.

One of the images that came out throughout the hundreds of iteration is this one here:

This image is impressive, but it is not what I wanted. The surrounding buildings are higher than the object the flag carrier has climbed on. It is nice for a first try, but does not really match my consistent AI style.

Here I will show some of the steps taken to receive the desired outcome. You will see, that the work was far more than just a click on the AI generator. In fact, achieving the desired visual output is often a hard and long-winded process.

Starting with some of the original photos made on 2nd of May 1945 on the Reichstag building by photographer Jewgeni Ananjewitsch Chaldei, one would think it is all easy.

Step 1: Use one of the black and white photos and have it run through an AI colorizer. In this case, https://deepai.org/machine-learning-model/colorizer was used, but the outcome was rather poor.

There were very few colors added. I manually recolored the uniforms of the soldiers with the original colors.

Working with this as a template, I imported it to Leonardo.Ai and used it as a seed for my individual prompt. The results were disappointing. Soon I came to the conclusion, that a large portion of that iconic "photo" has been hand-painted. Only a few parts in the foreground are from the initial photo. You can tell that by either enlarging the photo or carefully examining the background details. In addition, there is a wrong perspective of the buildings to the left. This wrong perspective gets even more dominant as I expanded the image using outpainting in Leonardos.AIs canvas tool.

No matter which setting, I never achieved my desired crystal-clear high-detail image. I spend around 2000 token experimenting, editing the colorized image in Gimp, manually adding colors, and trying to outpaint the image so it will have a 16:9 ratio at the end of teh procedure. I achieved impressive images if I let teh guidance to a very low value and reduced the init strength value, but all that came out had little to to with the original scene and setting.

I wanted to image to have the right perspective, which would be, looking down from the highest building in the area. I never achieved that. In addition, the soldiers in the image became blurry, combined with the building, or disappeared. The AI was not able to tell the soldiers from parts of the building as the colors and the details got crispier and more detailed.

After 2000 toke I was about to give up. Again: A large portion of the "photo" is painted and the perspectives are wrong. The AI will be influenced and return paintings instead of a hyperrealistic image and create buildings, that are much higher than the one the flag is placed on.

Then I started one last try and researched some Russian sites for additional photo material. My plan was to replace the parts that were painted by the real photographed images. I learned that Jewge Chaldei did take more than 20 photos.

That was a new starting point for me. Now at least I had a more detailed representation of te soldiers and also a higher contrast. Also, the color space is now as much broader, which should finally allow the AI to create a more realistic version.

Still, the painted or repainted background was a problem. The AI tended to create monumental drawings of the scene, instead of a photorealistic representation.

Here are some steps in the process:

Cutting out the background because it influenced the drawing style of the AI and resulted in paint style images.

Carefully outpainting the missing parts. Instructing it will the keywords like: "Berlin 1945, war, burning buildings, avenue" and some others.

In the end, I had a similar image, but with much more photorealistic details, which served as a seed image for Leonardo's image-to-image creation.

I tried several settings which came out quite nice. One is shown here, but not the final image which required some more manual editing using gimp and Leonardo.AIs canvas:

Through the usage of the manually edited image, the AI was able to create a more consistent image regarding details, and color space, while maintaining a high degree of detail. Still, I am not fully happy with it. I will work on and until n that magical moment comes and you know "THIS is it"

If you like the scene and/or want to support my work: The final heavily edited version will be available as a 16:9 ratio poster on Artstation using my unique signature in a ultra high resolution.

Check it out:

https://www.artstation.com/m-a-buth/prints

Search This Blog

The Prompt Temple

Working on the red flag interpretation image

Comments

Post a Comment

Popular Posts

DeepSeek-All you need to know by M. A. Buth - Great jump start into AI revolution