ChatGPT¤Ê¤É¤Ë»È¤ï¤ì¤ëÂ絬ÌϸÀ¸ì¥â¥Ç¥ë¤ò½¾Íè¤Î¥·¥¹¥Æ¥à¤è¤ê¤â15Çܹ⮡¦Ä㥳¥¹¥È¤Ç³Ø½¬¤Ç¤¤ë¡ÖDeepSpeed-Chat¡×¤òMicrosoft¤¬¸ø³«
OpenAI¤¬Ä󶡤¹¤ë¡ÖChatGPT¡×¤Ê¤É¤Î¥Á¥ã¥Ã¥ÈAI¤ÏÍ×Ìó¤ä¥³¡¼¥Ç¥£¥ó¥°¡¢ËÝÌõ¤Ê¤É¤ò¿Í´Ö¤ÎÀìÌç²È°Ê¾å¤ÎÀºÅ٤Ǽ¹ԤǤ¤ë¤ÈÊó¹ð¤µ¤ì¤Æ¤¤¤Þ¤¹¡£¤·¤«¤·¥Á¥ã¥Ã¥ÈAI¤Î·±Îý¤ËɬÍפʿʹ֤Υե£¡¼¥É¥Ð¥Ã¥¯¤Ë´ð¤Å¤¤¤¿¶¯²½³Ø½¬(RLHF)¤ò¼Â¹Ô¤¹¤ë¥¨¥ó¥É¥Ä¡¼¥¨¥ó¥É¤Ê¥Ñ¥¤¥×¥é¥¤¥ó¤¬Â¸ºß¤»¤º¡¢ºÇÀèü¤Î¥Á¥ã¥Ã¥ÈAI¤Î·±Îý¤ò¹Ô¤¦¤³¤È¤Ïº¤Æñ¤Ç¤·¤¿¡£¤·¤«¤·Microsoft¤¬È¯É½¤·¤¿¡ÖDeepSpeed-Chat¡×¤Ç¤Ïï¤Ç¤âChatGPT¤Î¤è¤¦¤Ê¥â¥Ç¥ë¤òºîÀ®²Äǽ¤Ç¤¹¡£
https://github.com/microsoft/DeepSpeed/tree/master/blogs/deepspeed-chat/japanese
¤³¤ì¤Þ¤ÇChatGPT¤Î¤è¤¦¤Ê¥â¥Ç¥ë¤Î·±Îý¤ÇɬÍפȤʤëRLHF¤ò¡¢´Êñ¤«¤Ä¹â¤¤¸úΨ¤Ç¼Â¹Ô¤Ç¤¤ë¥Ñ¥¤¥×¥é¥¤¥ó¤Ï¸ºß¤·¤Æ¤¤¤Þ¤»¤ó¤Ç¤·¤¿¡£¤Þ¤¿¡¢ChatGPT¤Î¤è¤¦¤ÊAI¥â¥Ç¥ë¤ò¥È¥ì¡¼¥Ë¥ó¥°¤¹¤ë¤Ë¤Ï¹â²Á¤ÊGPU¤¬Ê£¿ôɬÍפˤʤ뤿¤á¡¢°ìÈ̤γ«È¯¼Ô¤Ë¤Ï¤³¤Î¼ï¤ÎAI¥â¥Ç¥ë¤ò³«È¯¤¹¤ë¤³¤È¤¬º¤Æñ¤Ç¤·¤¿¡£¤Þ¤¿¡¢GPU¤òÍÑ°Õ¤·¤¿¤È¤·¤Æ¤â½¾Íè¤Î¥½¥Õ¥È¥¦¥§¥¢¤Ç¤Ï¥Ï¡¼¥É¥¦¥§¥¢¤Î5¡ó̤Ëþ¤ÎÀǽ¤·¤«°ú¤½Ð¤»¤º¡¢´Êñ¤«¤Ä¹â®¤Ë¡¢¤«¤ÄÄ㥳¥¹¥È¤Ç¿ôÀ鲯¤Î¥Ñ¥é¥á¡¼¥¿¤ò»ý¤Ä¥â¥Ç¥ë¤Î·±Îý¤ÏÉÔ²Äǽ¤À¤Ã¤¿¤³¤È¤¬Êó¹ð¤µ¤ì¤Æ¤¤¤Þ¤¹¡£
¤½¤³¤ÇMicrosoft¤Ï¡¢³«È¯¼Ô¤¬¤è¤ê¼ê¤´¤í¤Ê²Á³Ê¤Ç¥Á¥ã¥Ã¥ÈAI¤ò³«È¯¤Ç¤¤ë¤è¤¦¤Ë¤¹¤ë¤³¤È¤òÌÜŪ¤È¤·¤¿¥Õ¥ì¡¼¥à¥ï¡¼¥¯¡ÖDeepSpeed-Chat¡×¤òȯɽ¤·¤Þ¤·¤¿¡£
ChatGPT¥¹¥¿¥¤¥ë¤Î¥â¥Ç¥ë¤ò·±Îý¤Ç¤¤ëDeepSpeed-Chat¤ò¸ø³«¤·¤Þ¤·¤¿¡ª GPU1Âæ¤Ç100²¯Ä¶¥Ñ¥é¥á¡¼¥¿¤ò¡¢Ê£¿ôGPU¤Ê¤é1000²¯¥Ñ¥é¥á¡¼¥¿Ä¶¤Î¥â¥Ç¥ë¤ò³Ø½¬¤Ç¤¤Þ¤¹¡£SoTA¤Î15Çܰʾå¤Î¹â®¤Ê³Ø½¬¤ò¥¹¥¯¥ê¥×¥È°ì¤Ä¤Ç¼Â¹Ô¤Ç¤¡¢´Êñ¤«¤ÄÄ㥳¥¹¥È¤Ç¤¹¡ªÆüËܸ쵻ö¤â¸ø³«¤·¤Þ¤·¤¿¡ªhttps://t.co/3OtxlLCA5t https://t.co/AYlITILqIT pic.twitter.com/UwaEMomaMm— ¥Þ¥¤¥¯¥í¥½¥Õ¥ÈDeepSpeed (@MSFTDeepSpeedJP) April 12, 2023
DeepSpeed-Chat¤ÏChatGPT¤Î¸µ¤È¤Ê¤Ã¤¿InstructGPT¤Ç¹Ô¤ï¤ì¤¿¡Ö¶µ»ÕÉÕ¤¥Õ¥¡¥¤¥ó¥Á¥å¡¼¥Ë¥ó¥°¡×¤ä¡ÖÊó½·¥â¥Ç¥ë¤Î¥Õ¥¡¥¤¥ó¥Á¥å¡¼¥Ë¥ó¥°¡×¡ÖRLHF¤Î·±Îý¡×¤Î3¥¹¥Æ¥Ã¥×¤ò¼Â¹Ô¤·¡¢Æȼ«¤ÎChatGPT¥é¥¤¥¯¤Ê¥â¥Ç¥ë¤òÀ¸À®¤Ç¤¤ë¥¹¥¯¥ê¥×¥È¤òÄ󶡤·¤Þ¤¹¡£¤Þ¤¿¡¢³Ø½¬¸å¤Î²ñÏ÷Á¼°¤ò¥Æ¥¹¥È¤¹¤ë¤¿¤á¤Î¿äÏÀAPI¤âÄ󶡤¹¤ë¤È¤Î¤³¤È¡£
¤µ¤é¤ËDeepSpeed-Chat¤ËÅëºÜ¤µ¤ì¤Æ¤¤¤ë¡ÖDeepSpeed-RLHF ¥Ñ¥¤¥×¥é¥¤¥ó¡×¤Ï¡¢¡Ö¶µ»ÕÉÕ¤¥Õ¥¡¥¤¥ó¥Á¥å¡¼¥Ë¥ó¥°¡×¡ÖÊó½·¥â¥Ç¥ë¤Î¥Õ¥¡¥¤¥ó¥Á¥å¡¼¥Ë¥ó¥°¡×¡ÖRLHF¤Î·±Îý¡×¤ò¹Ô¤¦¤È¤È¤â¤Ë¡¢¸¦µæ¼Ô¤ä³«È¯¼Ô¤¬Ê£¿ô¤Î¥Ç¡¼¥¿¥ê¥½¡¼¥¹¤òÍѤ¤¤ÆÆȼ«¤ÎRLHF¥â¥Ç¥ë¤ò·±Îý¤¹¤ë¤Î¤ò¼ê½õ¤±¤¹¤ë¤¿¤á¡¢¡Ö¥Ç¡¼¥¿¤ÎÃê¾Ý²½¡×¤ä¡Ö¥Ö¥ì¥ó¥Éµ¡Ç½¡×¤ò¼Â¹Ô¤¹¤ë¤³¤È¤¬²Äǽ¤Ç¤¹¡£¡Ö¥Ç¡¼¥¿¤ÎÃê¾Ý²½¡×¤Ç¤Ï°Û¤Ê¤ë¥Ç¡¼¥¿¥»¥Ã¥È¤Î·Á¼°¤òÅý°ì¤¹¤ë¤¿¤á¤ËÃê¾Ý²½¤·¤¿¥Ç¡¼¥¿¥»¥Ã¥È¤òºîÀ®¤·¡¢¡Ö¥Ö¥ì¥ó¥Éµ¡Ç½¡×¤ÏÊ£¿ô¤Î¥Ç¡¼¥¿¥»¥Ã¥È¤òŬÀÚ¤ËÍ»¹ç¤·¡¢¡Ö¶µ»ÕÉÕ¤¥Õ¥¡¥¤¥ó¥Á¥å¡¼¥Ë¥ó¥°¡×¤Ê¤É¤Î3¤Ä¤Î¥È¥ì¡¼¥Ë¥ó¥°¤Ëʬ³ä¤·¤Þ¤¹¡£
¤Þ¤¿¡¢¡ÖDeepSpeed-RLHF ¥Ñ¥¤¥×¥é¥¤¥ó¡×¤Ë¤è¤ë³Ø½¬¤òÉý¹¤¤¥Ï¡¼¥É¥¦¥§¥¢¤Ç¹â®¤«¤ÄÄ㥳¥¹¥È¤Ç¼Â¹Ô¤¹¤ë¤¿¤á¤Ë¡¢¤³¤ì¤Þ¤ÇDeepSpeed¤¬È¯É½¤·¤¿ZeRO¤Ê¤É¤Î¿äÏÀ¤È³Ø½¬¤Î¤¿¤á¤ÎÁ´¥·¥¹¥Æ¥à¤òÍ»¹ç¤·¤¿¡ÖDeepSpeed¥Ï¥¤¥Ö¥ê¥Ã¥É¥¨¥ó¥¸¥ó¡×¤¬¹½À®¤µ¤ì¤Æ¤¤¤Þ¤¹¡£
DeepSpeed¥Ï¥¤¥Ö¥ê¥Ã¥É¥¨¥ó¥¸¥ó¤òÅëºÜ¤·¤¿DeepSpeed-Chat¤ò»ÈÍѤ·¡¢Microsoft Azure¾å¤Ç¥Ç¡¼¥¿¥»¥ó¥¿¡¼ÍѤÎGPU¡ÖNVIDIA A100¡×¤ò64ÂæÍѤ¤¤Æ³Ø½¬¤ò¹Ô¤Ã¤¿¾ì¹ç¡¢¡ÖOPT-13B¡×¥â¥Ç¥ë¤ÏÌó7.5»þ´Ö¤Ç·±Îý¤¬´°Î»¤·¤Þ¤¹¡£¤Þ¤¿¡¢¤½¤ÎºÝ¤ÎÈñÍѤÏ1920¥É¥ë(Ìó25Ëü±ß)¤Ç¤¹¡£¤µ¤é¤Ë¡ÖBLOOM¡×¥â¥Ç¥ë¤Ç¤ÏÌó20»þ´Ö¡¢5120¥É¥ë(Ìó68Ëü±ß)¤Ç·±Îý¤¬´°Î»¤¹¤ë¤È¤Î¤³¤È¡£¤³¤ì¤é¤Î¿ô»ú¤Ï´û¸¤ÎRLHF¥·¥¹¥Æ¥à¤è¤ê¤â¤Ï¤ë¤«¤Ë¹â®¤«¤ÄÄ㥳¥¹¥È¤Ç³Ø½¬¤ò¹Ô¤¦¤³¤È¤¬²Äǽ¤Ç¤¢¤ë¤³¤È¤ò¼¨¤·¤Æ¤¤¤Þ¤¹¡£
¤Þ¤¿DeepSpeed-Chat¤Ç¤Ï¡¢¿ô½½²¯¤«¤é1ÃûÄøÅ٤Υѥé¥á¡¼¥¿¤ò»ý¤ÄÂ絬ÌϤʥâ¥Ç¥ë¤Î·±Îý¤È¿äÏÀ¤¬²Äǽ¤Ç¡¢¸Â¤é¤ì¤¿GPU¥ê¥½¡¼¥¹´Ä¶¤Ë¤ª¤¤¤Æ¤â·±Îý¤È¿äÏÀ¤ò¹Ô¤¦¤³¤È¤¬²Äǽ¤Ë¤Ê¤ë¤È¤µ¤ì¤Æ¤¤¤Þ¤¹¡£
Hacker News¤Ç¤Ï¡ÖDeepSpeed-Chat¤Ë¤è¤Ã¤ÆGPT-4¤ÎºÆ¸½¤¬´Êñ¤Ë¤Ê¤ë¤È¤¤¤¦¤ï¤±¤Ç¤Ï¤¢¤ê¤Þ¤»¤ó¤¬¡¢ºÆ¸½¤Ë¸þ¤±¤¿¤¤¤¯¤Ä¤«¤ÎÂ礤ʥϡ¼¥É¥ë¤Ï´Ö°ã¤¤¤Ê¤¯±Û¤¨¤ë¤³¤È¤¬²Äǽ¤Ç¤¹¡×¤È½Ò¤Ù¤é¤ì¤Æ¤¤¤Þ¤¹¡£¤Þ¤¿¡¢Microsoft¤ÏDeepSpeed-Chat¤ò³«È¯¤¹¤ëDeepSpeed¤Ë̵½þ¤Ç100²¯¥É¥ë(Ìó1.3Ãû±ß)¤ò½Ð»ñ¤·¤ÆChatGPT¤Î¤è¤¦¤Êµ¡Ç½¤òMicrosoft¤ÎÀ½ÉʤËÁȤ߹þ¤à¸¦µæ¤ò»Ù±ç¤·¤Æ¤¤¤ë¤³¤È¤¬½Ò¤Ù¤é¤ì¤Æ¤¤¤Þ¤¹¡£
DeepSpeed-Chat¤Î¥½¡¼¥¹¥³¡¼¥É¤Ê¤É¤ÏGitHub¾å¤Ç¸ø³«¤µ¤ì¤Æ¤¤¤Þ¤¹¡£
GitHub - microsoft/DeepSpeed: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
https://github.com/microsoft/DeepSpeed/