If you don’t insist on having the LLM complete everything by itself, this may be simpler:
Short answer
I would treat this less as a “find the perfect generative model” problem and more as a pipeline design problem.
For this kind of description-generation task, I would probably use vLLM as the inference backend, run one or more reputable Hugging Face models behind it, and put…