Same prompt First Frame Chat Gpt Second Frame Grok Which one is better "Create an ultra realistic cinematic image with the following constraints: A rainy cyberpunk Lagos street at night, viewed from a low angle, shot on a 50mm lens with shallow depth of field. Neon signs written in accurate Yoruba text reflect on wet asphalt. The main subject is a young Nigerian man wearing a slightly worn black hoodie, standing under a flickering streetlight. His face must show subtle emotion exhaustion mixed with quiet determination. Skin texture must be natural with visible pores, light scars, and realistic subsurface scattering. Behind him, in the midground, a transparent holographic interface floats, displaying floating charts, code fragments, and African inspired geometric UI elements. The hologram light must correctly illuminate nearby surfaces and cast faint colored reflections on his face and hoodie. In the background, moving traffic creates realistic motion blur, while pedestrians holding umbrellas appear partially blurred due to depth of field. Rain droplets must be visible in mid air, some sharply in focus, others blurred. Puddles on the ground should show accurate reflections of neon signs, buildings, and headlights with physically correct distortion. Lighting must include a three point setup: key light from neon signage, soft fill light from ambient city glow, and rim light from passing car headlights. Shadows must be soft and physically consistent. Color grading should follow a cinematic teal and orange palette while preserving natural skin tones. No over sharpening. No plastic skin. No extra fingers. No distorted faces. Text must be legible and spelled correctly. The image should feel like a still from a high budget sci fi film shot on IMAX, with extreme attention to realism, composition, perspective, lighting physics, and cultural accuracy. Aspect ratio 16:9. Ultra high resolution. Photorealistic. Zero artifacts."