Each row is a gundam image + a text description. The original project used BLIP to auto-caption the images but that didn't really work for this dataset so instead I asked BLIP to only describe the colors and inserted them into a generic description: "A robot, humanoid, futuristic, <colors>". One could likely get better results with more fine-grained captions.
Out of curiosity did you you have to pay to host your demos on HuggingFace? I looked around for some free options with GPUs but only found Google Colab which isn't very convenient for Gradio apps.
OnlineGrab t1_jcr8xd6 wrote
Reply to comment by hgaterms in Hibernation, a closely studied option for extended space travel by LeMonde_en
Could be worse, imagine if >!your sun was dying and you were the only hope of your species! !<