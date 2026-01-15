¥ª¡¼¥×¥ó¥½¡¼¥¹¤Î²èÁüÀ¸À®AI¡ÖGLM-Image¡×¤òÃæ¹ñ´ë¶È¤ÎZ.ai¤¬È¯É½¡¢¼«¸Ê²óµ¢¥â¥Ç¥ë¤È³È»¶¥â¥Ç¥ë¤Î¥Ï¥¤¥Ö¥ê¥Ã¥É
Ãæ¹ñ¤ÎAI´ë¶È¤Ç¤¢¤ëZ.ai¤¬¡¢¥ª¡¼¥×¥ó¥½¡¼¥¹¤Ç¤¢¤ê¤Ê¤¬¤é»º¶È¥°¥ì¡¼¥É¤Î¥Ñ¥Õ¥©¡¼¥Þ¥ó¥¹¤òÈ¯´ø¤¹¤ëÎ¥»¶¼«¸Ê²óµ¢²èÁüÀ¸À®¥â¥Ç¥ë¤Î¡ÖGLM-Image¡×¤òÈ¯É½¤·¤Þ¤·¤¿¡£GLM-Image¤Ï¼«¸Ê²óµ¢¥â¥Ç¥ë¤È³È»¶¥â¥Ç¥ë¤Î¥Ï¥¤¥Ö¥ê¥Ã¥É¤È¤Ê¤Ã¤Æ¤¤¤Þ¤¹¡£
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image Generation
Z.ai¤¬¸½ÃÏ»þ´Ö¤Î2026Ç¯1·î14Æü¤ËÈ¯É½¤·¤¿²èÁüÀ¸À®¥â¥Ç¥ë¤ÎGLM-Image¤Ï¡¢¼«¸Ê²óµ¢¥â¥Ç¥ë¤È³È»¶¥â¥Ç¥ë¤òÁÈ¤ß¹ç¤ï¤»¤¿¥Ï¥¤¥Ö¥ê¥Ã¥É¥â¥Ç¥ë¤Ç¤¹¡£¼«¸Ê²óµ¢¥â¥Ç¥ë¤Ï¡¢90²¯¤Î¥Ñ¥é¥á¡¼¥¿¡¼¤ò»ý¤Ä¡ÖGLM-4-9B-041¡×¤Ë´ð¤Å¤¤¤Æ½é´ü²½¤µ¤ì¤Æ¤¤¤Þ¤¹¡£³È»¶¥â¥Ç¥ë¤Ï¡ÖCogView4¡×¤Ë½¾¤¤¡¢70²¯¤Î¥Ñ¥é¥á¡¼¥¿¡¼¤ò»ý¤Ä¥·¥ó¥°¥ë¥¹¥È¥ê¡¼¥àDiT¹½Â¤¤òºÎÍÑ¤·¤Æ¤¤¤ë¤½¤¦¤Ç¤¹¡£
GLM-Image¤Î¥Ï¥¤¥Ö¥ê¥Ã¥É¥â¥Ç¥ë¤Ï¡¢¥Æ¥¥¹¥È¥ì¥ó¥À¥ê¥ó¥°¤äÃÎ¼±½¸Ìó·¿À¸À®¤ÇÂç¤¤ÊÍøÅÀ¤òÈ¯´ø¤·¤Þ¤¹¡£ÆÃ¤Ë¡¢Àµ³Î¤Ê°ÕÌ£Íý²ò¤ÈÊ£»¨¤Ê¾ðÊóÉ½¸½¤òÉ¬Í×¤È¤¹¤ë¥¿¥¹¥¯¤ÇÍ¥¤ì¤¿¥Ñ¥Õ¥©¡¼¥Þ¥ó¥¹¤òÈ¯´ø¤¹¤ë¤ÈÆ±»þ¤Ë¡¢¥×¥í¥ó¥×¥È¤ËÃé¼Â¤ÊÀ¸À®¤ò²ÄÇ½¤È¤·¤Þ¤¹¡£
GLM-Image¤Ë¤è¤ë²èÁüÀ¸À®¤Î°ìÎã¤ò¤Þ¤È¤á¤¿¤â¤Î¤¬°Ê²¼¡£
³È»¶¥â¥Ç¥ë¤Ï¥È¥ì¡¼¥Ë¥ó¥°¤Î°ÂÄêÀ¤È¶¯ÎÏ¤Ê°ìÈÌ²½Ç½ÎÏ¤«¤é¡¢²èÁüÀ¸À®¥â¥Ç¥ë¤Î¼çÎ®¤È¤Ê¤Ã¤Æ¤¤¤Þ¤¹¡£³È»¶¥â¥Ç¥ë¤äÊÑÊ¬¥ª¡¼¥È¥¨¥ó¥³¡¼¥À(VAE)¤ÏÇ¯¡¹²þÎÉ¤µ¤ì¤Æ¤¤¤Þ¤¹¤¬¡¢³È»¶¥â¥Ç¥ë¤Ï°ÍÁ³¤È¤·¤ÆÊ£»¨¤Ê»Ø¼¨¤äÃÎ¼±½¸Ìó·¿¥·¥Ê¥ê¥ª¤ò¶ì¼ê¤È¤·¤Æ¤¤¤Þ¤¹¡£
°ìÊý¤Ç¡¢¶áÇ¯¤Ï¥×¥í¥ó¥×¥È¤ËÃé¼Â¤Ê½ÐÎÏ¤ò²ÄÇ½¤È¤¹¤ë¼«¸Ê²óµ¢¥â¥Ç¥ë¤òºÎÍÑ¤·¤¿²èÁüÀ¸À®AI¤âÁý¤¨¤Æ¤¤¤Þ¤¹¤¬¡¢¼«¸Ê²óµ¢¥â¥Ç¥ë¤Ë¤Ï¼Â¹Ô¤¬ÃÙ¤¤¤È¤¤¤¦·çÅÀ¤¬¤¢¤ê¤Þ¤¹¡£GLM-Image¤Ï³È»¶¥â¥Ç¥ë¤È¼«¸Ê²óµ¢¥â¥Ç¥ë¤ÎÍ¥¤ì¤¿ÅÀ¤òÁÈ¤ß¹ç¤ï¤»¤¿¥Ï¥¤¥Ö¥ê¥Ã¥É¥â¥Ç¥ë¤È¤·¤Æ³«È¯¤µ¤ì¤Þ¤·¤¿¡£
GLM-Image¤Ç¤Ï¡¢¼«¸Ê²óµ¢¥¸¥§¥Í¥ì¡¼¥¿¡¼¤¬Äã¼þÇÈ¤Î°ÕÌ£¿®¹æ¤ò»ý¤Ä¥È¡¼¥¯¥ó¤òÀ¸À®¤·¡¢³È»¶¥Ç¥³¡¼¥À¡¼¤Ï¹â¼þÇÈ¤Î¾ÜºÙ¤òÀºÀ½¤·¤ÆºÇ½ª²èÁü¤òÄó¶¡¤·¤Þ¤¹¡£¤³¤Î¥Ï¥¤¥Ö¥ê¥Ã¥É¥¢¡¼¥¥Æ¥¯¥Á¥ã¤Ë¤è¤ê¡¢°ìÈÌÅª¤Ê²èÁüÀ¸À®¥¿¥¹¥¯¤¬³Î¼Â¤Ëµ¡Ç½¤¹¤ë¤À¤±¤Ç¤Ê¤¯¡¢Ê£»¨¤ÊÃÎ¼±É½¸½¤òÉ¬Í×¤È¤¹¤ë¥¯¥ê¥¨¥¤¥Æ¥£¥Ö¤Êºî¶È¤Ç¤â¸²Ãø¤ÊÍøÅÀ¤¬¤â¤¿¤é¤µ¤ì¤Æ¤¤¤Þ¤¹¡£
GLM-Image¤Î²èÁüÀ¸À®¥á¥«¥Ë¥º¥à¤ò¼¨¤·¤¿¿Þ¤¬°Ê²¼¡£
²èÁüÀ¸À®¥â¥Ç¥ë¤Î¥Æ¥¥¹¥È¥ì¥ó¥À¥ê¥ó¥°ÀºÅÙ¤òÈæ³Ó¤·¤¿É½¤¬°Ê²¼¡£GLM-Image¤Ï¥ª¡¼¥×¥ó¥½¡¼¥¹¥â¥Ç¥ë¤Ç¤¢¤ê¤Ê¤¬¤é¡¢°µÅÝÅª¤Ê¥Æ¥¥¹¥È¥ì¥ó¥À¥ê¥ó¥°ÀºÅÙ¤Î¹â¤µ¤ò¼Â¸½¤·¤Æ¤¤¤Þ¤¹¡£
¥â¥Ç¥ë¥ª¡¼¥×¥ó¥½¡¼¥¹NEDCLIPScoreWord Accuracy 2 regionsWord Accuracy 3 regionsWord Accuracy 4 regionsWord Accuracy 5 regionsWord Accuracy averageGLM-Image¡»0.95570.78770.91030.92090.91690.89750.9116Seedream 4.5 ×0.94830.80690.87780.89520.90830.90080.899Z-Image¡»0.93670.79690.90060.87220.86520.85120.8671Qwen-Image-2512¡»0.9290.78190.8630.85710.8610.86180.8604Z-Image-Turbo¡»0.92810.80480.88720.86620.86280.83470.8585GPT Image 1[High]×0.94780.79820.87790.86590.87310.82180.8569Seedream 4.0 ×0.92240.79750.85850.84840.85380.82690.8451Qwen-Image¡»0.91160.80170.8370.83640.83130.81580.8288Nano Banana 2.0×0.87540.73720.73680.77480.78630.79260.7788TextCrafter¡»0.86790.78680.76280.76280.74060.69770.737SD3.5 Large¡»0.8470.77970.72930.68250.65740.5940.6548Seedream 3.0 ×0.85370.78210.62820.59620.60430.5610.5924FLUX.1 [dev]¡»0.68790.74010.60890.55310.46610.43160.49653DIS¡»0.65050.77670.44950.39590.3880.33030.3813RAG-Diffusion¡»0.44980.77970.43880.33160.21160.1910.2648TextDiffuser-2¡»0.43530.67650.53220.32550.17870.08090.2326AnyText¡»0.46750.74320.05130.17390.19480.22490.1804
¤Ê¤ª¡¢GLM-Image¤Ï¥ª¡¼¥×¥ó¥½¡¼¥¹¤Î²èÁüÀ¸À®¥â¥Ç¥ë¤Ç¤¢¤ë¤¿¤á¡¢GitHub¤ª¤è¤ÓHugging Face¤Ç¥³¡¼¥É¤ä¥â¥Ç¥ë¥Ç¡¼¥¿¤¬¸ø³«¤µ¤ì¤Æ¤¤¤Þ¤¹¡£
GitHub - zai-org/GLM-Image: GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image Generation.
https://github.com/zai-org/GLM-Image
zai-org/GLM-Image · Hugging Face
https://huggingface.co/zai-org/GLM-Image