ç¹ã«ãªã
æ§èœçã«ã¯ãæ¢åã®ä»å ¥ææ³ãããè¯ãã¹ã³ã¢ã«ãªã£ãŠãã
SAEã«ããæœåºãããç¹åŸŽéãè¯ãã¯ã©ã¹ãåé¡ã§ããç¹åŸŽéã«ãªã£ãŠããããšãåãã£ã
Ablation Studyã«ããæå€±é¢æ°ã®å¿ èŠæ§ã瀺ãããŠãã
Probeã®åŠç¿ãè¡ããªããšãåºåæç« ã®æŽåæ§ãè«çæ§ãç¡ããªã
èšèªã¢ãã«ã®æå€±ãç¡ãããšãããŒã¹ã«ãªãèšèªã¢ãã«ã®å¿çãä¿æããããšãé£ãããªã
ä»å ¥ã®æ¹åã«ã€ããŠã¯ææ¡ææ³ã§è¯ãæ¹åãèŠã€ããããšãã§ãããã倧ããã«ã€ããŠã¯æªç¥ã§ãã
ToolBenchãåºããŒã¿ãšããŠçæãè¡ã£ã
çæãã€ãã©ã€ã³ãæ§ã ãªLLMãçšããŠæ€èšŒããæãå°èŠæš¡ãªã¢ãã«ã¯ç¡å¹ãªAPIãåŒã³åºãäŸãå€ã
åŠç¿ããã¢ãã«ã®è©äŸ¡ã¯Berkley Function-CallingããŒã¿ã»ããã䜿çšããŠãã
FCã®ããã«åŠç¿ãããLLMã¯GPT-4oãªã©ãããè¯ãæ§èœã瀺ããŠãã
åãã£ã«ã¿ãªã³ã°ã¹ãããã«ãããŠããã£ã«ã¿ãŒåŸã®ããŒã¿ãçšããŠã¢ãã«ãåŠç¿ããè©äŸ¡ãã