GPT-4¤äLlama¡¢Claude¤È¤¤¤Ã¤¿Â絬ÌϸÀ¸ì¥â¥Ç¥ë¤Ï¡¢2017ǯ¤ËGoogle¤Î¸¦µæ¼Ô¤¬È¯É½¤·¤¿¡ÖTransformer¡×¤È¤¤¤¦¥Õ¥ì¡¼¥à¥ï¡¼¥¯¤Î¾å¤Ë¹½ÃÛ¤µ¤ì¤Æ¤¤¤Þ¤¹¡£¤³¤ÎTransformer¤ò¥Ù¡¼¥¹¤Ë¤·¤¿AI¥â¥Ç¥ë¤¬¤É¤Î¤è¤¦¤Ëµ¡Ç½¤¹¤ë¤Î¤«¤ò»ë³Ð²½¤·¤¿¥Ä¡¼¥ë¡ÖTransformer Explainer¡×¤¬¡¢¥¸¥ç¡¼¥¸¥¢¹©²ÊÂç³Ø¤ÎPolo Club of Data Science¤Ë¤è¤Ã¤Æ¸ø³«¤µ¤ì¤Æ¤¤¤Þ¤¹¡£

Transformer Explainer

https://poloclub.github.io/transformer-explainer/

Transformer Explainer¤Î¸«Êý¤Ï¡¢°Ê²¼¤Î¥à¡¼¥Ó¡¼¤ò¸«¤ë¤È°ìȯ¤Ç¤ï¤«¤ê¤Þ¤¹¡£

Transformer Explainer: Learn How LLM Transformer Models Work - YouTube

Transformer Explainer¤Ë¥¢¥¯¥»¥¹¤¹¤ë¤È¤³¤ó¤Ê´¶¤¸¡£



±¦¾å¤Ë¤¢¤ë¡ÖTemperture¡×¤Ï¡¢¼¡¤Îñ¸ì¤òͽ¬¤¹¤ëºÝ¤Î³ÎΨʬÉۤ˱ƶÁ¤òÍ¿¤¨¤ëÊÑ¿ô¤Ç¤¹¡£¥¹¥é¥¤¥É¥Ð¡¼¤òº¸±¦¤ËÆ°¤«¤¹¤³¤È¤Ç³ÎΨʬÉÛ¤¬ÊÑÆ°¤·¡¢¼¡¤Ë½ÐÎϤµ¤ì¤ëñ¸ì¤âÊѲ½¤·¤Þ¤¹¡£



¾åÉô¤Ë¤¢¤ëÆþÎÏÍó¤ËľÀÜʸ¾Ï¤òÆþÎϤ·¤Æ¤âOK¡£



ÆþÎÏÍó¤Î±¦¤Ë¤¢¤ë¡ÖGenerate¡×¥Ü¥¿¥ó¤ò¥¯¥ê¥Ã¥¯¤¹¤ë¤È¡¢ÀßÄꤷ¤Æ¤¤¤ëTemperture¤Ë´ð¤Å¤¤¤Æ³ÎΨʬÉÛ¤¬»»½Ð¤µ¤ì¡¢¼¡¤Îñ¸ì¤¬½ÐÎϤµ¤ì¤Þ¤¹¡£



¤³¤Î³ÎΨʬÉۤϤɤΤ褦¤ËÀ¸À®¤µ¤ì¤Æ¤¤¤ë¤Î¤«¤È¤¤¤¦¤Î¤Ïº¸Â¦¤Î¡ÖEmbedding¡×¤«¤é¸«¤Æ¤¤¤¯¤È¤ï¤«¤ê¤Þ¤¹¡£¤³¤ÎEmbedding¤Ç¤Ï¡¢ÆþÎϤµ¤ì¤¿Ê¸»úÎó¤ò¥È¡¼¥¯¥ó¤È¤¤¤¦Ã±°Ì¤Ëʬ²ò¤·¡¢¥Ù¥¯¥È¥ë¤ËÊÑ´¹¤·¤Æ¤¤¤Þ¤¹¡£Transformer Explainer¤Î¡ÖEmbedding¡×¤È¤¤¤¦Éôʬ¤ò¥¯¥ê¥Ã¥¯¤¹¤ë¤È¡¢¥È¡¼¥¯¥ó¤¬ÊÑ´¹¤µ¤ì¤Æ¤¤¤¯ÍͻҤ¬²Ä»ë²½¤µ¤ì¤Þ¤¹¡£



¤½¤·¤Æ¡¢¤³¤Î¥Ù¥¯¥È¥ë¤«¤éQuery¡¢Key¡¢Value¤È¤¤¤¦3¤Ä¤ÎÆþÎϤò»»½Ð¤·¤Þ¤¹¡£¤³¤Î·×»»²áÄø¤Ï¡¢¥È¡¼¥¯¥ó¤«¤é¤Î¤Ó¤ëÀĤ¤¥é¥¤¥ó¤ò¥¯¥ê¥Ã¥¯¤¹¤ë¤Èɽ¼¨¤µ¤ì¤Þ¤¹¡£



¤½¤·¤Æ¡¢Transformer¤Î´ð´´¤Ï¥Ç¡¼¥¿¤ÎÃ椫¤éͽ¬¤ËÌòΩ¤Ä¤â¤Î¤ò½Å¤ßÉÕ¤±¤·¤ÆÃíÌܤ¹¤ë¡ÖAttention¡×¤È¤¤¤¦µ¡¹½¤Ç¡¢Ãæ±ûÉôʬ¤Î¡ÖMulti-Head Attention¡×¤ò¸«¤ë¤È¡¢Query¤ÈKey¤ÎÆâÀѤòSoftmax¤È¤¤¤¦´Ø¿ô¤ÇÀµµ¬²½¤·¤Æ½Å¤ß¤ò»»½Ð¤·¡¢Value¤È¤ÎÀѤò¼è¤ë¤³¤È¤Ç½ÐÎÏ·ë²Ì¤òÀ¸À®¤·¤Æ¤¤¤ëÍͻҤ¬¼¨¤µ¤ì¤Þ¤·¤¿¡£



Transformer Explainer¤Ï¥ª¡¼¥×¥ó¥½¡¼¥¹¤Ç³«È¯¤µ¤ì¤Æ¤ª¤ê¡¢MIT License¤Î¤â¤È¤Ç¥½¡¼¥¹¥³¡¼¥É¤¬GitHub¤Ç¸ø³«¤µ¤ì¤Æ¤¤¤Þ¤¹¡£

GitHub - poloclub/transformer-explainer: Transformer Explained: Learn How LLM Transformer Models Work with Interactive Visualization

https://github.com/poloclub/transformer-explainer