ÆüËܸì¤ÎAI¤Ç»È¤¦Â絬ÌϸÀ¸ì¥â¥Ç¥ëÀǽ¤òɾ²Á¤¹¤ë¡Ö¥ª¡¼¥×¥óÆüËܸìLLM¥ê¡¼¥À¡¼¥Ü¡¼¥É¡×¸ø³«

16¼ïÎà°Ê¾å¤ÎNLP(¼«Á³¸À¸ì½èÍý)¥¿¥¹¥¯¤òÍѤ¤¤ÆÆüËܸì¤ÎÂ絬ÌϸÀ¸ì¥â¥Ç¥ë(LLM)¤ÎÀǽɾ²Á¤ÈʬÀϤò¹Ô¤¦¡Ö¥ª¡¼¥×¥óÆüËܸìLLM¥ê¡¼¥À¡¼¥Ü¡¼¥É¡×¤¬¸ø³«¤µ¤ì¤Þ¤·¤¿¡£¹½ÃۤˤϹñΩ¾ðÊ󳨏¦µæ½ê¤ò¤Ï¤¸¤á¤È¤¹¤ëÆüËܸìLLM¤Î¸¦µæ³«È¯¤ò¹Ô¤¦ÁÈ¿¥²£ÃÇ¥×¥í¥¸¥§¥¯¥È¡ÖLLM-jp¡×¤¬·È¤ï¤Ã¤Æ¤¤¤Þ¤¹¡£
Open Japanese LLM Leaderboard - a Hugging Face Space by llm-jp
Introducing the Open Leaderboard for Japanese LLMs!
https://huggingface.co/blog/leaderboard-japanese
Open Japanese LLM Leaderboard ¤Î¸ø³« - LLM ÊÙ¶¯²ñ
https://llm-jp.github.io/llm/2024/11/20/open-japanese-llm-leaderboard.html
LLM¤Ï±Ñ¸ì¤Ç¤Ï¹¤¯µ¡Ç½¤·¤Æ¤¤¤Þ¤¹¤¬¡¢¤½¤Î¾¤Î¸À¸ì¤Ç¤¦¤Þ¤¯µ¡Ç½¤·¤Æ¤¤¤ë¤«¤É¤¦¤«¤òÃΤ뤳¤È¤¬º¤Æñ¤Ç¤·¤¿¡£¤³¤Î¡Ö¥ª¡¼¥×¥óÆüËܸìLLM¥ê¡¼¥À¡¼¥Ü¡¼¥É¡×¤Ï¡¢ÆüËܸì¤ÎLLM¤Î¼«Æ°É¾²Á¥Ä¡¼¥ë¤Ç¤¢¤ë¡Öllm-jp-eval¡×¤ò³èÍѤ·¤Æ¡¢LLM¤ÎÀǽɾ²Á¤ò¹Ô¤¦¤â¤Î¤Ç¤¹¡£
llm-jp-eval: ÆüËܸìÂ絬ÌϸÀ¸ì¥â¥Ç¥ë¤Î¼«Æ°É¾²Á¥Ä¡¼¥ë
(PDF¥Õ¥¡¥¤¥ë)https://www.anlp.jp/proceedings/annual_meeting/2024/pdf_dir/A8-2.pdf

Âбþ¤·¤Æ¤¤¤ëɾ²Á¥Ç¡¼¥¿¥»¥Ã¥È¤Ï°Ê²¼¤ÎÄ̤ꡣ
¡¦¼«Á³¸À¸ì¿äÏÀ(Natural Language Inference¡§NLI)¡§Jamp¡¢JaNLI¡¢JNLI¡¢JSeM¡¢JSICK
¡¦¼ÁÌä±þÅú(Question Answering¡§QA)¡§JEMHopQA¡¢NIILC
¡¦ÆÉ²ò(Reading Comprehension¡§RC)¡§JSQuAD
¡¦ÁªÂò¼°¼ÁÌä±þÅú(Multiple Choice question answering¡§MC)¡§JCommonsenseQA
¡¦¥¨¥ó¥Æ¥£¥Æ¥£¥ê¥ó¥¥ó¥°(Entity Linking¡§EL)¡§chABSA
¡¦´ðÁòòÀÏ(Fundamental Analysis¡§FA)¡§Wikipedia Annotated Corpus
¡¦¿ô³ØÅª¿äÏÀ(Mathematical Reasoning¡§MR)¡§MAWPS
¡¦°Ọ̃Ū¥Æ¥¥¹¥ÈÎà»÷ÅÙ(Semantic Textual Similarity¡§STS)¡§JSTS
¡¦µ¡³£ËÝÌõ(Machine Learning¡§MT)¡§ALT¡¢WikiCorpus
¡¦»î¸³ÌäÂê(HE)¡§MMLU¡¢JMMLU
¡¦¥³¡¼¥ÉÀ¸À®(CG)¡§MBPP
¡¦Í×Ìó(SUM)¡§XL-Sum
ÆüËܸì¤Ï¤Ò¤é¤¬¤Ê¡¢¥«¥¿¥«¥Ê¡¢´Á»ú¡¢¥í¡¼¥Þ»ú¤È¤¤¤¦4¼ïÎà¤Îɽµ¤¬º®ºß¤¹¤ë¡¢¤È¤Æ¤âÊ£»¨¤ÊɽµÂηϤò»ý¤Á¤Þ¤¹¡£¤µ¤é¤Ë¡¢Ã±¸ì¤Èñ¸ì¤Î´Ö¤Ë¥¹¥Ú¡¼¥¹¤òÆþ¤ì¤Ê¤¤¤¿¤á¡¢¥È¡¼¥¯¥ó²½¤âÆñ¤·¤¤¤Î¤À¤½¤¦¤Ç¤¹¡£
¤½¤ì¤Ç¤â¡¢ÆüËܤμ«Á³¸À¸ì½èÍý¤ÎÆÃÀ¤ò¼è¤êÆþ¤ì¤¿ÆüËܸìLLM¤¬³«È¯¤µ¤ì¤Æ¤¤¤Þ¤¹¤¬¡¢ÌäÂê¤Ï¡¢LLM¤òÈæ³Ó¤¹¤ë¤¿¤á¤Î°ì¸µ²½¤µ¤ì¤¿¥ª¡¼¥×¥ó¥·¥¹¥Æ¥à¤¬Â¸ºß¤·¤Ê¤«¤Ã¤¿¤³¤È¤Ç¤¹¡£
¤½¤³¤Ç¸¦µæ¤ÎÆ©ÌÀ²½¤ò¹â¤á¡¢¥ª¡¼¥×¥ó¥½¡¼¥¹¤Î¥â¥Ç¥ë³«È¯¤ÎÊý¿Ë¤ò¾©Î夹¤ë¤¿¤á¤Ë¡¢Hugging Face¤Èllm-jp¤¬¶¨ÎϤ·¤Æ¡¢¥ª¡¼¥×¥óÆüËܸì¥ê¡¼¥À¡¼¥Ü¡¼¥É¤ò¹½ÃÛ¤·¤¿¤È¤Î¤³¤È¡£
¤³¤Î¼è¤êÁȤߤϡ¢¹ñÆâ³°¤Î¸¦µæ¼Ô¤ÎÏ¢·È¤Ë¤è¤Ã¤ÆÆüËܸìLLM¤òɾ²Á¤·¶¯²½¤·¤Æ¤¤¤¯¥×¥é¥Ã¥È¥Õ¥©¡¼¥à¤È¤Ê¤ë¤³¤È¤¬´üÂÔ¤µ¤ì¤Æ¤¤¤Þ¤¹¡£
