OpenAI¤¬Ä󶡤¹¤ë¡ÖChatGPT¡×¤Ê¤É¤Î¥Á¥ã¥Ã¥ÈAI¤ÏÍ×Ìó¤ä¥³¡¼¥Ç¥£¥ó¥°¡¢ËÝÌõ¤Ê¤É¤ò¿Í´Ö¤ÎÀìÌç²È°Ê¾å¤ÎÀºÅ٤Ǽ¹ԤǤ­¤ë¤ÈÊó¹ð¤µ¤ì¤Æ¤¤¤Þ¤¹¡£¤·¤«¤·¥Á¥ã¥Ã¥ÈAI¤Î·±Îý¤ËɬÍפʿʹ֤Υե£¡¼¥É¥Ð¥Ã¥¯¤Ë´ð¤Å¤¤¤¿¶¯²½³Ø½¬(RLHF)¤ò¼Â¹Ô¤¹¤ë¥¨¥ó¥É¥Ä¡¼¥¨¥ó¥É¤Ê¥Ñ¥¤¥×¥é¥¤¥ó¤¬Â¸ºß¤»¤º¡¢ºÇÀèü¤Î¥Á¥ã¥Ã¥ÈAI¤Î·±Îý¤ò¹Ô¤¦¤³¤È¤Ïº¤Æñ¤Ç¤·¤¿¡£¤·¤«¤·Microsoft¤¬È¯É½¤·¤¿¡ÖDeepSpeed-Chat¡×¤Ç¤Ïï¤Ç¤âChatGPT¤Î¤è¤¦¤Ê¥â¥Ç¥ë¤òºîÀ®²Äǽ¤Ç¤¹¡£

DeepSpeed/blogs/deepspeed-chat/japanese at master · microsoft/DeepSpeed · GitHub

https://github.com/microsoft/DeepSpeed/tree/master/blogs/deepspeed-chat/japanese



¤³¤ì¤Þ¤ÇChatGPT¤Î¤è¤¦¤Ê¥â¥Ç¥ë¤Î·±Îý¤ÇɬÍפȤʤëRLHF¤ò¡¢´Êñ¤«¤Ä¹â¤¤¸úΨ¤Ç¼Â¹Ô¤Ç¤­¤ë¥Ñ¥¤¥×¥é¥¤¥ó¤Ï¸ºß¤·¤Æ¤¤¤Þ¤»¤ó¤Ç¤·¤¿¡£¤Þ¤¿¡¢ChatGPT¤Î¤è¤¦¤ÊAI¥â¥Ç¥ë¤ò¥È¥ì¡¼¥Ë¥ó¥°¤¹¤ë¤Ë¤Ï¹â²Á¤ÊGPU¤¬Ê£¿ôɬÍפˤʤ뤿¤á¡¢°ìÈ̤γ«È¯¼Ô¤Ë¤Ï¤³¤Î¼ï¤ÎAI¥â¥Ç¥ë¤ò³«È¯¤¹¤ë¤³¤È¤¬º¤Æñ¤Ç¤·¤¿¡£¤Þ¤¿¡¢GPU¤òÍÑ°Õ¤·¤¿¤È¤·¤Æ¤â½¾Íè¤Î¥½¥Õ¥È¥¦¥§¥¢¤Ç¤Ï¥Ï¡¼¥É¥¦¥§¥¢¤Î5¡ó̤Ëþ¤ÎÀ­Ç½¤·¤«°ú¤­½Ð¤»¤º¡¢´Êñ¤«¤Ä¹â®¤Ë¡¢¤«¤ÄÄ㥳¥¹¥È¤Ç¿ôÀ鲯¤Î¥Ñ¥é¥á¡¼¥¿¤ò»ý¤Ä¥â¥Ç¥ë¤Î·±Îý¤ÏÉÔ²Äǽ¤À¤Ã¤¿¤³¤È¤¬Êó¹ð¤µ¤ì¤Æ¤¤¤Þ¤¹¡£

¤½¤³¤ÇMicrosoft¤Ï¡¢³«È¯¼Ô¤¬¤è¤ê¼ê¤´¤í¤Ê²Á³Ê¤Ç¥Á¥ã¥Ã¥ÈAI¤ò³«È¯¤Ç¤­¤ë¤è¤¦¤Ë¤¹¤ë¤³¤È¤òÌÜŪ¤È¤·¤¿¥Õ¥ì¡¼¥à¥ï¡¼¥¯¡ÖDeepSpeed-Chat¡×¤òȯɽ¤·¤Þ¤·¤¿¡£





DeepSpeed-Chat¤ÏChatGPT¤Î¸µ¤È¤Ê¤Ã¤¿InstructGPT¤Ç¹Ô¤ï¤ì¤¿¡Ö¶µ»ÕÉÕ¤­¥Õ¥¡¥¤¥ó¥Á¥å¡¼¥Ë¥ó¥°¡×¤ä¡ÖÊó½·¥â¥Ç¥ë¤Î¥Õ¥¡¥¤¥ó¥Á¥å¡¼¥Ë¥ó¥°¡×¡ÖRLHF¤Î·±Îý¡×¤Î3¥¹¥Æ¥Ã¥×¤ò¼Â¹Ô¤·¡¢Æȼ«¤ÎChatGPT¥é¥¤¥¯¤Ê¥â¥Ç¥ë¤òÀ¸À®¤Ç¤­¤ë¥¹¥¯¥ê¥×¥È¤òÄ󶡤·¤Þ¤¹¡£¤Þ¤¿¡¢³Ø½¬¸å¤Î²ñÏ÷Á¼°¤ò¥Æ¥¹¥È¤¹¤ë¤¿¤á¤Î¿äÏÀAPI¤âÄ󶡤¹¤ë¤È¤Î¤³¤È¡£

¤µ¤é¤ËDeepSpeed-Chat¤ËÅëºÜ¤µ¤ì¤Æ¤¤¤ë¡ÖDeepSpeed-RLHF ¥Ñ¥¤¥×¥é¥¤¥ó¡×¤Ï¡¢¡Ö¶µ»ÕÉÕ¤­¥Õ¥¡¥¤¥ó¥Á¥å¡¼¥Ë¥ó¥°¡×¡ÖÊó½·¥â¥Ç¥ë¤Î¥Õ¥¡¥¤¥ó¥Á¥å¡¼¥Ë¥ó¥°¡×¡ÖRLHF¤Î·±Îý¡×¤ò¹Ô¤¦¤È¤È¤â¤Ë¡¢¸¦µæ¼Ô¤ä³«È¯¼Ô¤¬Ê£¿ô¤Î¥Ç¡¼¥¿¥ê¥½¡¼¥¹¤òÍѤ¤¤ÆÆȼ«¤ÎRLHF¥â¥Ç¥ë¤ò·±Îý¤¹¤ë¤Î¤ò¼ê½õ¤±¤¹¤ë¤¿¤á¡¢¡Ö¥Ç¡¼¥¿¤ÎÃê¾Ý²½¡×¤ä¡Ö¥Ö¥ì¥ó¥Éµ¡Ç½¡×¤ò¼Â¹Ô¤¹¤ë¤³¤È¤¬²Äǽ¤Ç¤¹¡£¡Ö¥Ç¡¼¥¿¤ÎÃê¾Ý²½¡×¤Ç¤Ï°Û¤Ê¤ë¥Ç¡¼¥¿¥»¥Ã¥È¤Î·Á¼°¤òÅý°ì¤¹¤ë¤¿¤á¤ËÃê¾Ý²½¤·¤¿¥Ç¡¼¥¿¥»¥Ã¥È¤òºîÀ®¤·¡¢¡Ö¥Ö¥ì¥ó¥Éµ¡Ç½¡×¤ÏÊ£¿ô¤Î¥Ç¡¼¥¿¥»¥Ã¥È¤òŬÀÚ¤ËÍ»¹ç¤·¡¢¡Ö¶µ»ÕÉÕ¤­¥Õ¥¡¥¤¥ó¥Á¥å¡¼¥Ë¥ó¥°¡×¤Ê¤É¤Î3¤Ä¤Î¥È¥ì¡¼¥Ë¥ó¥°¤Ëʬ³ä¤·¤Þ¤¹¡£



¤Þ¤¿¡¢¡ÖDeepSpeed-RLHF ¥Ñ¥¤¥×¥é¥¤¥ó¡×¤Ë¤è¤ë³Ø½¬¤òÉý¹­¤¤¥Ï¡¼¥É¥¦¥§¥¢¤Ç¹â®¤«¤ÄÄ㥳¥¹¥È¤Ç¼Â¹Ô¤¹¤ë¤¿¤á¤Ë¡¢¤³¤ì¤Þ¤ÇDeepSpeed¤¬È¯É½¤·¤¿ZeRO¤Ê¤É¤Î¿äÏÀ¤È³Ø½¬¤Î¤¿¤á¤ÎÁ´¥·¥¹¥Æ¥à¤òÍ»¹ç¤·¤¿¡ÖDeepSpeed¥Ï¥¤¥Ö¥ê¥Ã¥É¥¨¥ó¥¸¥ó¡×¤¬¹½À®¤µ¤ì¤Æ¤¤¤Þ¤¹¡£



DeepSpeed¥Ï¥¤¥Ö¥ê¥Ã¥É¥¨¥ó¥¸¥ó¤òÅëºÜ¤·¤¿DeepSpeed-Chat¤ò»ÈÍѤ·¡¢Microsoft Azure¾å¤Ç¥Ç¡¼¥¿¥»¥ó¥¿¡¼ÍѤÎGPU¡ÖNVIDIA A100¡×¤ò64ÂæÍѤ¤¤Æ³Ø½¬¤ò¹Ô¤Ã¤¿¾ì¹ç¡¢¡ÖOPT-13B¡×¥â¥Ç¥ë¤ÏÌó7.5»þ´Ö¤Ç·±Îý¤¬´°Î»¤·¤Þ¤¹¡£¤Þ¤¿¡¢¤½¤ÎºÝ¤ÎÈñÍѤÏ1920¥É¥ë(Ìó25Ëü±ß)¤Ç¤¹¡£¤µ¤é¤Ë¡ÖBLOOM¡×¥â¥Ç¥ë¤Ç¤ÏÌó20»þ´Ö¡¢5120¥É¥ë(Ìó68Ëü±ß)¤Ç·±Îý¤¬´°Î»¤¹¤ë¤È¤Î¤³¤È¡£¤³¤ì¤é¤Î¿ô»ú¤Ï´û¸¤ÎRLHF¥·¥¹¥Æ¥à¤è¤ê¤â¤Ï¤ë¤«¤Ë¹â®¤«¤ÄÄ㥳¥¹¥È¤Ç³Ø½¬¤ò¹Ô¤¦¤³¤È¤¬²Äǽ¤Ç¤¢¤ë¤³¤È¤ò¼¨¤·¤Æ¤¤¤Þ¤¹¡£

¤Þ¤¿DeepSpeed-Chat¤Ç¤Ï¡¢¿ô½½²¯¤«¤é1ÃûÄøÅ٤Υѥé¥á¡¼¥¿¤ò»ý¤ÄÂ絬ÌϤʥâ¥Ç¥ë¤Î·±Îý¤È¿äÏÀ¤¬²Äǽ¤Ç¡¢¸Â¤é¤ì¤¿GPU¥ê¥½¡¼¥¹´Ä¶­¤Ë¤ª¤¤¤Æ¤â·±Îý¤È¿äÏÀ¤ò¹Ô¤¦¤³¤È¤¬²Äǽ¤Ë¤Ê¤ë¤È¤µ¤ì¤Æ¤¤¤Þ¤¹¡£

Hacker News¤Ç¤Ï¡ÖDeepSpeed-Chat¤Ë¤è¤Ã¤ÆGPT-4¤ÎºÆ¸½¤¬´Êñ¤Ë¤Ê¤ë¤È¤¤¤¦¤ï¤±¤Ç¤Ï¤¢¤ê¤Þ¤»¤ó¤¬¡¢ºÆ¸½¤Ë¸þ¤±¤¿¤¤¤¯¤Ä¤«¤ÎÂ礭¤Ê¥Ï¡¼¥É¥ë¤Ï´Ö°ã¤¤¤Ê¤¯±Û¤¨¤ë¤³¤È¤¬²Äǽ¤Ç¤¹¡×¤È½Ò¤Ù¤é¤ì¤Æ¤¤¤Þ¤¹¡£¤Þ¤¿¡¢Microsoft¤ÏDeepSpeed-Chat¤ò³«È¯¤¹¤ëDeepSpeed¤Ë̵½þ¤Ç100²¯¥É¥ë(Ìó1.3Ãû±ß)¤ò½Ð»ñ¤·¤ÆChatGPT¤Î¤è¤¦¤Êµ¡Ç½¤òMicrosoft¤ÎÀ½ÉʤËÁȤ߹þ¤à¸¦µæ¤ò»Ù±ç¤·¤Æ¤¤¤ë¤³¤È¤¬½Ò¤Ù¤é¤ì¤Æ¤¤¤Þ¤¹¡£

DeepSpeed-Chat¤Î¥½¡¼¥¹¥³¡¼¥É¤Ê¤É¤ÏGitHub¾å¤Ç¸ø³«¤µ¤ì¤Æ¤¤¤Þ¤¹¡£

GitHub - microsoft/DeepSpeed: DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

https://github.com/microsoft/DeepSpeed/