Gemini 1.5¤ÎºÇÂç100Ëü¥È¡¼¥¯¥ó¤È¤¤¤¦Ä¹¤¤¥³¥ó¥Æ¥­¥¹¥È¥¦¥£¥ó¥É¥¦¤ò³èÍѤ¹¤ë¤³¤È¤Ç¡¢¥ª¥Õ¥£¥¹Æâ¤Î¥¿¥¹¥¯¤ò¼«Á³¸À¸ì¤Ç²ò·è¤Ç¤­¤ë¤è¤¦¤Ë¤Ê¤Ã¤¿¤ÈGoogle DeepMind¤¬ÏÀʸ¤Ë¤Þ¤È¤á¡¢¥Ç¥â¥à¡¼¥Ó¡¼¤ò¸ø³«¤·¤Þ¤·¤¿¡£

[2407.07775v1] Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs

https://arxiv.org/abs/2407.07775v1



¥³¥ó¥Æ¥­¥¹¥È¥¦¥£¥ó¥É¥¦¤¬Â礭¤¤AI¥â¥Ç¥ë¤Ï°ìÅ٤˿¤¯¤Î¾ðÊó¤ò°·¤¦¤³¤È¤¬²Äǽ¤Ç¤¹¡£º£²ó¡¢DeepMind¤Ï¥ª¥Õ¥£¥¹¤ä¼«Âð¤ò°ÆÆ⤹¤ë¥à¡¼¥Ó¡¼¤ò»£±Æ¤·¤ÆAI¤Ë»ëÄ°¤µ¤»¡¢¡Ö¤É¤³¤Ë²¿¤¬¤¢¤ë¤Î¤«¡×¤È¤¤¤¦´Ä¶­¤Ë¤Ä¤¤¤Æ¤Î¾ðÊó¤ò³Ø½¬¤µ¤»¤Þ¤·¤¿¡£

¥æ¡¼¥¶¡¼¤¬¼«Á³¸À¸ì¤ÇAI¤Ë»Ø¼¨¤¹¤ë¤³¤È¤Ç¡¢AI¤¬¥à¡¼¥Ó¡¼¤Î¾ðÊó¤È¥«¥á¥é¤«¤é¤ÎÆþÎϤò¸µ¤Ë¥æ¡¼¥¶¡¼¤òÌÜŪÃϤ˰ÆÆ⤵¤»¤ë¤³¤È¤ËÀ®¸ù¤·¤¿¤È¤Î¤³¤È¡£°Ê²¼¤Ï¼ÂºÝ¤Ë¡Ö²¿¤«¤òÉÁ¤±¤ë¾ì½ê¤ò¶µ¤¨¤Æ¡×¤È¤¤¤¦¥¿¥¹¥¯¤ò¤³¤Ê¤¹¥Ç¥â¥à¡¼¥Ó¡¼¤Ç¤¹¡£

¤³¤ÎÅê¹Æ¤òInstagram¤Ç¸«¤ë Google DeepMind(@googledeepmind)¤¬¥·¥§¥¢¤·¤¿Åê¹Æ


AI¤òÅëºÜ¤·¤¿¥í¥Ü¥Ã¥È¤Ë¡Ö²¿¤«¤òÉÁ¤±¤ë¾ì½ê¤ò¶µ¤¨¤Æ¡×¤È²»À¼¤ÇÆþÎϤ¹¤ë¤È¡¢¥í¥Ü¥Ã¥È¤¬¡ÖGemini¤Ç¹Í¤¨¤Þ¤¹¡£¤·¤Ð¤é¤¯¤ªÂÔ¤Á¤¯¤À¤µ¤¤¡×¤ÈÊÖÅú¡£



¤·¤Ð¤é¤¯¤·¤Æ¥í¥Ü¥Ã¥È¤¬¤æ¤Ã¤¯¤ê¤ÈÆ°¤­»Ï¤á¤Þ¤·¤¿¡£



̵»ö¥æ¡¼¥¶¡¼¤ò¥Û¥ï¥¤¥È¥Ü¡¼¥É¤ÎÁ°¤Ë°ÆÆ⤹¤ë¤³¤È¤ËÀ®¸ù¡£



¥â¥Ç¥ë¤Î³µÍפϤ³¤ó¤Ê´¶¤¸¡£¤½¤Î¾¤Ë¤³¤Ê¤»¤ë¥¿¥¹¥¯¤ÎÎã¤È¤·¤Æ¡¢Êª¤ò»ý¤Ã¤¿¾õÂ֤ǡ֤³¤ì¤Ï¤É¤³¤ËÊֵѤ¹¤ì¤ÐÎɤ¤¤Ç¤¹¤«¡©¡×¤È¤¤¤¦¼ÁÌä¤ä¡¢¥¹¥Þ¡¼¥È¥Õ¥©¥ó¤ò¸«¤»¤Ä¤Ä¡Ö¤É¤³¤Ç½¼ÅŤǤ­¤Þ¤¹¤«¡©¡×¤È¤¤¤¦¼ÁÌ䤬·ÇºÜ¤µ¤ì¤Æ¤¤¤Þ¤¹¡£



¸¦µæ¥Á¡¼¥à¤Ï¡ÖÂ絬ÌϤʸ½¼ÂÀ¤³¦¤Î´Ä¶­¤Ë¤ª¤±¤ëÊ£»¨¤Ê¿äÏÀ¤È¥Þ¥ë¥Á¥â¡¼¥À¥ë¤Ê¥æ¡¼¥¶¡¼»Ø¼¨¤òȼ¤¦¡¢¤³¤ì¤Þ¤Ç¤Ï¼Â¹ÔÉÔ²Äǽ¤À¤Ã¤¿¥Ê¥Ó¥²¡¼¥·¥ç¥ó¥¿¥¹¥¯¤Ç¡¢¥¨¥ó¥É¥Ä¡¼¥¨¥ó¥É¤ÇºÇÂç90¥Ñ¡¼¥»¥ó¥È¤ÎÀ®¸ùΨ¤òãÀ®¤·¤¿¡×¤È·ë²Ì¤¬ÂçÀ®¸ù¤À¤Ã¤¿¤³¤È¤ò¥¢¥Ô¡¼¥ë¤·¤Þ¤·¤¿¡£