IBM¥ê¥µ¡¼¥Á¤¬¥«¡¼¥Í¥®¡¼¥á¥í¥óÂç³Ø¤ä¥×¥ê¥ó¥¹¥È¥óÂç³Ø¡¢¥¤¥ê¥Î¥¤Âç³Ø¤È¶¦Æ±¤Ç¡¢¥ª¡¼¥×¥ó¥½¡¼¥¹¤ÎÂ絬ÌϸÀ¸ì¥â¥Ç¥ë¡ÖBamba¡×¤ò¹½ÃÛ¤·¡¢¤½¤Î¥Ð¡¼¥¸¥ç¥ó2¤ò¥ª¡¼¥×¥ó¥½¡¼¥¹¤È¤·¤Æ¥ê¥ê¡¼¥¹¤·¤Þ¤·¤¿¡£

Meet Bamba, IBM¡Çs new attention-state space model - IBM Research

https://research.ibm.com/blog/bamba-ssm-transformer-model



Bamba¤Ï97.8²¯¤Î¥Ñ¥é¥á¡¼¥¿¡¼¤ò»ý¤ÄÂ絬ÌϸÀ¸ì¥â¥Ç¥ë¤Ç¡¢¥Ù¡¼¥¹¤È¤Ê¤ë¥¢¡¼¥­¥Æ¥¯¥Á¥ã¤¬°ìÈÌŪ¤ÊÂ絬ÌϸÀ¸ì¥â¥Ç¥ë¤È¾¯¤·°ã¤¦ÅÀ¤¬ÆÃħ¤Ç¤¹¡£

IBM¥ê¥µ¡¼¥Á¤Ë¤è¤ë¤È¡¢°ìÈÌŪ¤ÊÂ絬ÌϸÀ¸ì¥â¥Ç¥ë¤ÏTransformer¤È¤¤¤¦¥¢¡¼¥­¥Æ¥¯¥Á¥ã¤òÍøÍѤ·¤Æ¤¤¤Þ¤¹¤¬¡¢±þÅú¤ÎºÝ¤Ë¼Â¹ÔÃæ¤Î¥·¡¼¥±¥ó¥¹¤ò¥á¥â¥ê¤ËÊÝ»ý¤¹¤ë´Ø·¸¾å¡¢¥×¥í¥ó¥×¥È¤¬Ä¹¤¯¤Ê¤ë¤Ë¤Ä¤ì¤ÆÀ¸À®¤Î¥³¥¹¥È¤¬»Ø¿ô´Ø¿ôŪ¤ËÁýÂ礹¤ë¤È¤Î¤³¤È¡£¤¿¤È¤¨¤Ð¥³¥ó¥Æ¥­¥¹¥È¥¦¥£¥ó¥É¥¦¤Î¥µ¥¤¥º¤¬2Çܤˤʤë¤È¡¢¤½¤ì¤ò½èÍý¤·¤Æ±þÅú¤òÀ¸À®¤¹¤ë¥³¥¹¥È¤Ï2Çܤɤ³¤í¤«4Çܤˤʤ뤽¤¦¤Ç¤¹¡£

¤³¤ÎÌäÂê¤Ï¡Ö2¼¡¥Ü¥È¥ë¥Í¥Ã¥¯¡×¤È¸Æ¤Ð¤ì¡¢¥æ¡¼¥¶¡¼¤¬AI¤Ë¼ÁÌä¤ò¤·¤Æ¤«¤éÅú¤¨¤òÆÀ¤ë¤Þ¤Ç¤Î¥¿¥¤¥à¥é¥°¤Î¸¶°ø¤Î1¤Ä¤Ë¤Ê¤Ã¤Æ¤¤¤ë¤È¤¤¤¤¤Þ¤¹¡£

¿·¤·¤¯Åо줷¤¿Bamba-9B¤Ï¡¢Transformer¥¢¡¼¥­¥Æ¥¯¥Á¥ã¤È¡¢¾õÂÖ¶õ´Ö¥â¥Ç¥ë(SSM)¤È¤¤¤¦¥¢¡¼¥­¥Æ¥¯¥Á¥ã¤òÁȤ߹ç¤ï¤»¤Ä¤Ä¡¢¥á¥â¥ê¤ËÅö¤¿¤ëKV¥­¥ã¥Ã¥·¥å¤Î´ÉÍý¤òTransformer¥¢¡¼¥­¥Æ¥¯¥Á¥ã¤«¤éº¬ËÜŪ¤ËÊѤ¨¤¿¥â¥Ç¥ë¤Ç¤¹¡£Ä̾Transformer¤¬±þÅú¤ò½ÐÎϤ¹¤ëºÝ¡¢¥³¥ó¥Æ¥­¥¹¥È¥¦¥£¥ó¥É¥¦Æâ¤Î¤¹¤Ù¤Æ¤Îñ¸ì¤ËÃí°Õ¤òʧ¤¦¤Î¤ËÂФ·¡¢SSM¤Ï²áµî¤Î¾ðÊó¤òÍ×Ìó¤·¤¿¡Ö±£¤ì¾õÂ֡פò°Ý»ý¤¹¤ë¤È¤Î¤³¤È¡£¾ðÊó¤òÁªÂòŪ¤ËÊÝ»ý¤¹¤ë¤³¤Î¼êË¡¤ò»È¤¦¤³¤È¤Ç¡¢¥á¥â¥ê¤Î¥ª¡¼¥Ð¡¼¥Ø¥Ã¥É¤¬¾¯¤Ê¤¯¤Ê¤ê¡¢¿äÏÀ®ÅÙ¤¬Â®¤¯¤Ê¤ë¤½¤¦¤Ç¤¹¡£

¾Ü¤·¤¯¤Ï°Ê²¼¤Î¥µ¥¤¥È¤Ëµ­ºÜ¤µ¤ì¤Æ¤¤¤Þ¤¹¡£

Bamba-9B-v2 - Fast and powerful!

https://huggingface.co/blog/ibm-ai-platform/bamba-9b-v2



IBM¥ê¥µ¡¼¥Á¤Ë¤è¤ë¤È¡¢Bamba-9B¤ÏKV¥­¥ã¥Ã¥·¥å¤Î¥á¥â¥êÍ×·ï¤òÂçÉý¤Ëºï¸º¤¹¤ë¤³¤È¤Ç¡¢Æ±¥µ¥¤¥º¤ÎTransformer¥Ù¡¼¥¹¤Î¥â¥Ç¥ë¤ÈÈæ¤Ù¤Æ¡¢Æ±Åù¤ÎÀºÅÙ¤òÊݤÁ¤Ê¤¬¤é¾¯¤Ê¤¯¤È¤â2ÇܤήÅÙ¤ÇÆ°ºî¤Ç¤­¤ë¤È¤Î¤³¤È¡£Transformer¤ÎǽÎϤȡ¢SSM¤Î¼Â¹Ô®ÅÙ¤òÁȤ߹ç¤ï¤»¤ë¤³¤È¤Ç¡¢¥Ü¥È¥ë¥Í¥Ã¥¯¤ò²ò¾Ã¤·¤Ä¤Ä±þÅúÀºÅÙ¤ò°Ý»ý¤·¤¿¥â¥Ç¥ë¤È¤Ê¤Ã¤Æ¤¤¤Þ¤¹¡£

Bamba¤ÏApache 2.0 ¥é¥¤¥»¥ó¥¹¤Î²¼¡¢¥ª¡¼¥×¥ó¥½¡¼¥¹¤Ç¸ø³«¤µ¤ì¤Æ¤¤¤Þ¤¹¡£

GitHub - foundation-model-stack/bamba: Train, tune, and infer Bamba model

https://github.com/foundation-model-stack/bamba