Image In Words is a generative model designed for scenarios that require generating ultra-detailed text from images. It leverages AI recognition and description capabilities in more complex scenarios using gpt4o, supporting only English. Trained with 100,000 hours of English data, Image In Words ensures high quality and naturalness in text generation.