¥Ç¡¼¥¿Ê¬Àϥġ¼¥ë¤Ê¤É¤òÄ󶡤¹¤ë´ë¶È¤ÎDatabricks¤¬¡¢2024ǯ3·î27Æü¤Ë¥ª¡¼¥×¥ó¤ÊÈÆÍÑÂ絬ÌϸÀ¸ì¥â¥Ç¥ë(LLM)¤Ç¤¢¤ë¡ÖDBRX¡×¤òȯɽ¤·¤Þ¤·¤¿¡£¥ª¡¼¥×¥ó¥é¥¤¥»¥ó¥¹¤Ç¤ÎÇÛÉۤȤʤäƤª¤ê¡¢·î´Ö¥¢¥¯¥Æ¥£¥Ö¥æ¡¼¥¶¡¼¤¬7²¯¿Í°Ê²¼¤Î´ë¶È¤Ï̵ÎÁ¤Ç¾¦ÍÑÍøÍѤ¬²Äǽ¤È¤Ê¤Ã¤Æ¤¤¤Þ¤¹¡£

Introducing DBRX: A New State-of-the-Art Open LLM | Databricks

https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm



DBRX¤Ï¥È¥é¥ó¥¹¥Õ¥©¡¼¥Þ¡¼¤Î¥Ç¥³¡¼¥À¡¼¤ò»ÈÍѤ¹¤ëLLM¤Ç¡¢¡Ömixture-of-experts(MoE)¡×¥¢¡¼¥­¥Æ¥¯¥Á¥ã¤¬ºÎÍѤµ¤ì¤Æ¤¤¤Þ¤¹¡£¥Ñ¥é¥á¡¼¥¿¤Î¹ç·×¿ô¤Ï1320²¯¤È¤Ê¤Ã¤Æ¤¤¤Þ¤¹¤¬¡¢Á´¤Æ¤ÎÆþÎϤËÈ¿±þ¤¹¤ë¤Î¤Ï360²¯¥Ñ¥é¥á¡¼¥¿¤Î¤ß¤Ç¡¢»Ä¤ê¤Î¥Ñ¥é¥á¡¼¥¿¤Ï¡ÖÀìÌç²È¡×¤È¤·¤ÆÉ¬Íפ˱þ¤¸¤Æ¥¢¥¯¥Æ¥£¥Ù¡¼¥È¤µ¤ì¤Þ¤¹¡£MoE¥¢¡¼¥­¥Æ¥¯¥Á¥ã¤òºÎÍѤ¹¤ë¤³¤È¤Ç¡¢¥µ¥¤¥º¤òÍÞ¤¨¤Æ¹â¸úΨ¤Î³Ø½¬¤ä¿äÏÀ¤ò²Äǽ¤Ë¤·¤Ä¤ÄÀ­Ç½¤ò¹â¤á¤¿¤È¤Î¤³¤È¡£

Ʊ¤¸¤¯MoE¥¢¡¼¥­¥Æ¥¯¥Á¥ã¤òºÎÍѤ·¤¿Mixtral¤äGrok-1¤¬8¿Í¤ÎÀìÌç²È¤òÅëºÜ¤·¡¢ÆþÎϤ´¤È¤Ë2¿Í¤ò¥¢¥¯¥Æ¥£¥Ù¡¼¥È¤¹¤ë¤Î¤ËÂФ·¡¢DBRX¤Ï16¿Í¤ÎÀìÌç²È¤òÅëºÜ¤·¤ÆÆþÎϤ´¤È¤Ë4¿Í¤ò¥¢¥¯¥Æ¥£¥Ù¡¼¥È¤·¤Þ¤¹¡£ÀìÌç²È¤ÎÁȤ߹ç¤ï¤»¿ô¤¬65Çܤˤʤ俤³¤È¤Ç¥â¥Ç¥ë¤ÎÉʼÁ¤¬¸þ¾å¤·¤¿¤È½Ò¤Ù¤é¤ì¤Æ¤¤¤Þ¤¹¡£

¤Þ¤¿¡¢DBRX¤ÏºÇÂ祳¥ó¥Æ¥¯¥¹¥ÈĹ3Ëü2000¥È¡¼¥¯¥ó¤Î¹ç·×12Ãû¥È¡¼¥¯¥ó¤Î¥Ç¡¼¥¿¤Ç¥È¥ì¡¼¥Ë¥ó¥°¤µ¤ì¤Þ¤·¤¿¡£¡ÖÀìÌç²È¡×¤¬º®ºß¤¹¤ëMoE¥â¥Ç¥ë¤ò¥È¥ì¡¼¥Ë¥ó¥°¤¹¤ë¤Î¤ÏÆñ¤·¤«¤Ã¤¿¤â¤Î¤Î¡¢¸úΨŪ¤ÊÊýË¡¤Ç·«¤êÊÖ¤·¥È¥ì¡¼¥Ë¥ó¥°¤Ç¤­¤ë·øÏ´¤Ê¥Ñ¥¤¥×¥é¥¤¥ó¤ò³«È¯¤·¡¢Ã¯¤Ç¤âDBRX¥ì¥Ù¥ë¤ÎMoE´ðÁÃ¥â¥Ç¥ë¤ò¥¼¥í¤«¤é¥È¥ì¡¼¥Ë¥ó¥°¤Ç¤­¤ë¤è¤¦¤Ë¤·¤¿¤È½Ò¤Ù¤é¤ì¤Æ¤¤¤Þ¤¹¡£

²¼¿Þ¤Ï¡Ö¸À¸ìÍý²ò¡×¡Ö¥×¥í¥°¥é¥ß¥ó¥°¡×¡Ö¿ô³Ø¡×¤Î¥Ù¥ó¥Á¥Þ¡¼¥¯·ë²Ì¤Ç¡¢DBRX¤ÏLLaMa2-70B¤äMixtral¡¢Grok-1¤Ê¤É¤Î¥â¥Ç¥ë¤è¤ê¤âÎɤ¤·ë²Ì¤ò½Ð¤·¤Æ¤¤¤ë¤³¤È¤¬Ê¬¤«¤ê¤Þ¤¹¡£¤Þ¤¿¡¢ÀìÌç²È¤òÅëºÜ¤¹¤ë¤³¤È¤ÇÈÆÍÑÀ­Ç½¤ò³ÎÊݤ·¤Ê¤¬¤é¥×¥í¥°¥é¥ß¥ó¥°¤Ê¤ÉÆÃ²½¤¬É¬ÍפÊÎΰè¤Ë¤ª¤±¤ëÀ­Ç½¤â³ÎÊݤǤ­¤Æ¤¤¤Þ¤¹¡£



¤Þ¤¿¡¢¡ÖGPT-3.5¡×¡ÖGPT-4¡×¡ÖClaude 3¥·¥ê¡¼¥º¡×¡ÖGemini 1.0 Pro¡×¡ÖGemini 1.5 Pro¡×¡ÖMistral¡×¤È¤¤¤Ã¤¿¥¯¥í¡¼¥º¥É¥½¡¼¥¹¤ÎLLM¤È¤ÎÈæ³Ó·ë²Ì¤Ï²¼¿Þ¤ÎÄ̤ꡣ¤¶¤Ã¤¯¤ê¸«¤ë¤ÈDBRX Instruct¤ÏGemini 1.0 Pro¤äMistral Medium¤Ê¤É¤ÈƱÅù¤Î¥ì¥Ù¥ë¤Ë¸«¤¨¤Þ¤¹¡£

ModelDBRX
InstructGPT-3.5GPT-4Claude 3 HaikuClaude 3 SonnetClaude 3 OpusGemini 1.0 ProGemini 1.5 ProMistral MediumMistral LargeMT Bench (Inflection corrected, n=5)8.39 ± 0.08——8.41 ± 0.04 8.54 ± 0.099.03 ± 0.068.23 ± 0.08—8.05 ± 0.128.90 ± 0.06MMLU 5-shot73.7%70.0%86.4%75.2%79.0%86.8%71.8%81.9%75.3%81.2%HellaSwag 10-shot89.0%85.5%95.3%85.9%89.0%95.4%84.7%92.5%88.0%89.2%HumanEval 0-Shot
pass@1
(Programming)70.1% 

temp=0, N=148.1%67.0%75.9%73.0%84.9%67.7%71.9%38.4%45.1%GSM8k CoT maj@172.8% (5-shot)57.1% (5-shot)92.0% (5-shot)88.9%92.3%95.0%86.5%

(maj1@32)91.7% (11-shot)66.7% (5-shot)81.0% (5-shot)WinoGrande 5-shot81.8%81.6%87.5%—————88.0%86.7%

¿äÏÀ¤Î¸úΨ¤ò¼¨¤¹¡Ö1Éá¦1¥æ¡¼¥¶¡¼¤¢¤¿¤ê¤Î½ÐÎϥȡ¼¥¯¥ó¿ô¡×¤Ï²¼¿Þ¤ÎÄ̤ꡣ¤¤¤º¤ì¤Î¥æ¡¼¥¶¡¼¿ô¤Ë¤ª¤¤¤Æ¤â¡¢¤è¤ê¾®¤µ¤¤¥â¥Ç¥ë¤Ç¤¢¤ëMixtral-8x7B¤¬ºÇÂç¤Î½ÐÎϤò¸Ø¤Ã¤Æ¤¤¤Þ¤¹¡£°ìÊý¡¢ÀÖÀþ¤Ç¼¨¤µ¤ì¤Æ¤¤¤ëDBRX¤Î¿äÏÀ¸úΨ¤ÏLLaMa2-70B¤äDense-132B¤òÂ礭¤¯¾å²ó¤ê¤Þ¤·¤¿¡£



´ðÁÃ¥â¥Ç¥ë¤È¤Ê¤ë¡ÖDBRX Base¡×¤È¡¢¥Õ¥¡¥¤¥ó¥Á¥å¡¼¥Ë¥ó¥°ºÑ¤ß¤Î¥â¥Ç¥ë¡ÖDBRX Instruct¡×¤Ï¤È¤â¤ËHugging Face¤Ë¤Æ¥ª¡¼¥×¥ó¥é¥¤¥»¥ó¥¹¤ÇÇÛÉÛ¤µ¤ì¤Æ¤¤¤ë¤Û¤«¡¢Databricks¤ÎFoundation Model API·Ðͳ¤Ç´Êñ¤ËÍøÍѲÄǽ¤È¤Î¤³¤È¤Ç¤¹¡£