ðè«ææ
å ±
ðãã®è«æã®ããŒã¡ãã»ãŒãž
- äºååŠç¿ã«ã³ãŒãã£ã³ã°ã¿ã¹ã¯ãæ··ããããšã§ãå®çšçãªLLMãåŠç¿ããããšãã§ããã
ðã©ãããåé¡ã«åãçµãã ã®ã
- äžã€ã®LLMã§ãèšèªãšããã°ã©ãã³ã°ã®èœåããã©ã³ã¹è¯ãåŠç¿ããããš
ð§âðãã®åé¡ã«åãçµãããšããªãéèŠãªã®ã
- LLMã®å®ç°å¢çšã®ã€ã³ã¿ã©ã¯ã·ã§ã³ã«ãããŠãAPIãæŽ»çšããå¿
èŠããã
- ãã£ããæ©èœãæšè«ã ãã§ã¯ãªããããã°ã©ã ãé©åã«æ§ç¯ããèœåãå¿
èŠã«ãªã£ãŠãã
- ChatGPTãªã©ã®ClosedãªLLMã§ã¯ããã£ããæ©èœãšããã°ã©ã ã®äž¡æ¹ãå®çŸã§ããŠãã
- äžæ¹ã§ãLlamaãªã©ã®ã¢ãã«ã¯ãããããã®ã¿ã¹ã¯å°çšã®LLMãå
¬éããã«çãŸã£ãŠãã
ð¡åé¡è§£æ±ºã«åããããŒã¢ã€ãã¢ã¯äœã
- äºååŠç¿ãšã€ã³ã¹ãã©ã¯ã·ã§ã³ãã¥ãŒãã³ã°ã«åããŠåŠç¿ããã
- äºååŠç¿ã§ã¯ãã³ãŒãã£ã³ã°ã¿ã¹ã¯ãšããã¹ãã®å²åã10:1ã«ããŠåŠç¿ããŠãã
- åŠç¿ã«äœ¿çšããããã°ã©ã ã¯The StackããŒã¿ã»ãããæŽ»çšããŠãã
- ã€ã³ã¹ãã©ã¯ã·ã§ã³ãã¥ãŒãã³ã°ã¯æ¢åã®è€æ°ã®ããŒã¿ã»ãããæ··ããŠäœ¿çšããŠãã
- ããã®ããŒã¿ã¯ããã¹ãã®ã¿
- ãããŸã§å·¥å€«ããŠããç¹ã§ã¯ãªãã®ããïŒ
ðæ°ãã«åãã£ãããšã¯äœã
- äºååŠç¿ã«ã³ãŒãã£ã³ã°ã¿ã¹ã¯ãæ··ããŠåŠç¿ããã€ã³ã¹ãã©ã¯ã·ã§ã³ãã¥ãŒãã³ã°ãé©çšããããšã§ãèªç¶èšèªã察象ãšããæšè«èœåãç¶æãã€ã€ãã³ãŒãã£ã³ã°ã¿ã¹ã¯ã«ãããæ§èœãåäžããŠãã
- ããã°ã©ãã³ã°ã ãã§ã¯ãªããAPIã䜿çšããæšè«ã«ã€ããŠãæ§èœãåäžããŠããã
- äºååŠç¿ãæå¹ã§ãããšèšããã®ãããããªãã
âçåç¹ã¯äœã
- æ»èªã§ãææãããŠãããã©ãäºååŠç¿ã®ã¿ã¹ã¯ã®å²åãã©ããããæå¹ãªã®ãåãããªã
- èè
ãã¯ç¶ç¶åŠç¿ããŠã調ã¹ãŠããããã
- ãã®èŸºã®ãã©ã³ã¹ãè¯ãæãã«èª¿æŽã§ãããç±ãæ°ããã
paper
Created
Mon, 12 Jan 2026 00:00:00 +0900 ðè«ææ
å ±
ðãã®è«æã®ããŒã¡ãã»ãŒãž
- ããŒã¿æ¡åŒµã«APIãçšããããšã§ãäžæµã¿ã¹ã¯ã«ãããæ§èœãåäžããã
ðã©ãããåé¡ã«åãçµãã ã®ã
- èªå·±æåž«æãåŠç¿ã«ãããå€éšããŒã«ãLLMã䜿çšããæ¹æ³ãåŠç¿ãã
- è©äŸ¡ãšéãç¹ããããããèªã¿éããŠãããã
- ããã¹ãããŒã¿ãããTool CallingããŒã¿ã»ãããäœæãã
ð§âðãã®åé¡ã«åãçµãããšããªãéèŠãªã®ã
- æ¢åã®ææ³ã¯ãå€§èŠæš¡ãªäººæã¢ãããŒã·ã§ã³ãå¿
èŠã§ããããšããç¹å®ã®ã¿ã¹ã¯ã®ã¿ã察象ãšããŠãã
- ãã®ãããåŠç¿ããLLMã®å¿çšå
ãéããããšãã課é¡ããã
ð¡åé¡è§£æ±ºã«åããããŒã¢ã€ãã¢ã¯äœã
- ããŒã¿æ¡åŒµã«ã¯ããã¹ãããŒã¿ãçšãã
- ç䌌APIãLLMãçšããŠçæãã
- ç䌌APIã®åœ¢åŒã¯
QA(c)-> r ã®ãããªæã - 䜿çšããã¿ã¹ã¯ãšããŠã¯ãQAãèšç®åé¡ãçšããŠãã
- çæãããç䌌APIãå®è¡ãã
- å®è¡ã¯ãLLMãPythonãªã©ã®ç°å¢ã§ããããšãæ³å®ããŠãã
- ãã£ã«ã¿ãªã³ã°ã«ã¯ãéã¿ä»ãã¯ãã¹ãšã³ããããŒãåºæºãšããŠçšããŠãã
- çæãããããŒã¯ã³ã®æå€±ãšãäœãçæãããªãã£ãæã®æå€±ã®å·®ãéŸå€ããã倧ããæãããŒã¿ã»ããã«å«ãããã«ããŠãã
- æšè«æã«ã¯ã
-> ããŒã¯ã³ãçæããããŸã§ãã³ãŒãã£ã³ã°ãè¡ã- ããã§APIã®åŒã³åºããå
¥ããããAPIã®çµæãæ¿å
¥ããŠããã³ãŒãã£ã³ã°ãåéãã
ðæ°ãã«åãã£ãããšã¯äœã
- QAãæ°åŠã¿ã¹ã¯ã«ãããæ§èœãåäžãã
- äžæµã¿ã¹ã¯ã«ãããæ§èœãåäžããŠãã
- Tool Callã®è©äŸ¡ãç¡ãã®ãæ°ã«ãªã
- ç®çãTool Callingã®æ§èœåäžã§ã¯ãªããTool Callingãéããæ§èœåäžã«ãªãã®ãã
âçåç¹ã¯äœã
- è©äŸ¡ã®ããæ¹ãé©åã§ãããã©ããåãããªã
- API callãå®éã«ããŠããã®ãïŒ
- äžæµã¿ã¹ã¯ã«ãããŠå€éšããŒã«ãåŒã³åºãããªãæ§èœã®åäžã¯å¿
ç¶ã§ã¯ïŒãšãæãã
- ãã®ããŒã¿æ¡åŒµã®ããæ¹ã¯é¢çœããšæã£ãã
paper
Created
Mon, 12 Jan 2026 00:00:00 +0900 ðè«ææ
å ±
ðãã®è«æã®ããŒã¡ãã»ãŒãž
- LLMãçæããå¿çã人éãæžããå¿çãæ¯èŒããããšã§ãLLMãçæããæç« ã®ã¹ã¿ã€ã«ãåãããããšãã§ããã
ðã©ãããåé¡ã«åãçµãã ã®ã
- LLMã®å
éšè¡šçŸã«ä»å
¥ããŠãçæããæç« ãåäººã«æé©åããããš
ð§âðãã®åé¡ã«åãçµãããšããªãéèŠãªã®ã
- ãŠãŒã¶ãŒæ¯ã«æé©åãããããã¹ããçæããéèŠãé«ãŸã£ãŠãã
- æ¢åã®ææ³ã¯RAGãPEFTã«ããææ³ã泚ç®ãããŠãã
- ãããã®ææ³ã¯èšç®ã³ã¹ããé«ãããšãããŠãŒã¶ãŒç¹æã®èšãåãã«åœ±é¿ããããã
ð¡åé¡è§£æ±ºã«åããããŒã¢ã€ãã¢ã¯äœã
- LLMã®é ãå±€ã«ä»å
¥ããããšã§ãåäººã«æé©åãããæç« ãçæããããš
- åæãšããŠããããŠãŒã¶ãŒã®ããã³ãããšããã«å¯Ÿããå¿çããã
- æåã«ãLLMã¯ããã³ããã«å¯Ÿããå¿çãçæãã
- ããã³ãããšå¿çãç¹ããæç« ã®æçµããŒã¯ã³ã«è©²åœããç¹åŸŽéã䜿ã£ãŠä»å
¥ããæ¹åã®èšç®ããã
- ãŠãŒã¶ãŒã®å¿çãç¹ããå Žåã®ç¹åŸŽéãããžãã£ããLLMã®å¿çãç¹ããå Žåã®ç¹åŸŽéããã¬ãã£ããšããŠãã
- ããããããšã§ãä»ã®LLMããã®äººã«åãããããã«ã©ããããä»å
¥ããã°è¯ããèšç®ã§ãã
- æ¹åã®èšç®ã«ã¯ãæ§ã
ãªæ¹æ³ã䜿çšããŠãã
- PCAãšããMean Differenceãªã©ãªã©
- ããã§èšç®ãããã¯ãã«ãçšããŠä»å
¥ãã
ðæ°ãã«åãã£ãããšã¯äœã
- å奿é©åãã³ãããŒã¯ã®LaMPã§è©äŸ¡ãã
- çæãè©äŸ¡ãããã®ãšé·æãè©äŸ¡ãããã®ã®äºã€ããã
- ææ¡ææ³ã¯ãRAGãPEFTã®ææ³ãããè¯ãæ§èœã瀺ããŠãã
- ä»å
¥éã«ããæ§èœã倧ããå€ãã
- ææ¡ææ³ã«ããæšå®ãããä»å
¥ãã¯ãã«ã«ãããŠæ£ã®æ¹åã«ä»å
¥ãããšãŠãŒã¶ãŒã®ã¹ã¿ã€ã«ãåæ ãããããªãããè² ã®æ¹åã«ãããšã¹ã¿ã€ã«ãé¢ä¿ç¡ããªã£ãŠããŸã
âçåç¹ã¯äœã
- å®éšã®ã¹ã¿ã€ã«ãã¯ãã«ã®èšç®ã«ã¯äœã䜿çšããã®ã ãããïŒ
- ã¹ã¿ã€ã«ãã¯ãã«ã®èšç®æ¹æ³ã«ãã£ãŠæ§èœãå€ãã£ããããã®ããª
paper
Created
Thu, 25 Dec 2025 00:00:00 +0900 ðè«ææ
å ±
ðãã®è«æã®ããŒã¡ãã»ãŒãž
- ïŒ1, 2æã§ãŸãšããïŒ
ðã©ãããåé¡ã«åãçµãã ã®ã
- Vison-Languageã¢ãã«(VLM)ã®å
éšè¡šçŸã«ä»å
¥ããããšã§ããã«ã·ããŒã·ã§ã³ãé²ãããš
- ããã§ã®ãã«ã·ããŒã·ã§ã³ã¯ãç»åã«åã£ãŠããªãç©äœã«ã€ããŠã¢ãã«ãèšåããçŸè±¡ãæã
ð§âðãã®åé¡ã«åãçµãããšããªãéèŠãªã®ã
- VLMã®ãã«ã·ããŒã·ã§ã³ãé²ãããšã¯ãå®çšäžéèŠ
- æ¢åã®ææ³ã¯ããŒã¿ã®å質ãæå€±é¢æ°ã®å·¥å€«ãªã©ã§ããã«å¯ŸåŠããŠãã
- åŠç¿ã«å¿
èŠãªèšç®ã³ã¹ãã倧ãããããå®å°ã«é©å¿ããããã«æéãããã
- ä»ã®åŠç¿ç¡ãã®ææ³ã§ã¯ç»åã®åªå
床ãäžããããã«ããŠããããç»åå
ã®Attentionãªã©ã®ç¹å®ã®ä»®å®ã«äŸåããŠãã
ð¡åé¡è§£æ±ºã«åããããŒã¢ã€ãã¢ã¯äœã
- MSCOCOã䜿çšããŠãVLMã®äžé衚çŸãProbingããã
- ãã«ã·ããŒã·ã§ã³ãããŠãããã©ããã¯ãããŒã¯ã³ãMSCOCOã®ã¯ã©ã¹ã®åèªãããã®é¡çŸ©èªãå«ããã©ããã§å€å®ããŠãã
- ä»å
¥ãããã¯ãã«ã¯ããã«ã·ããŒã·ã§ã³ãç¡ãããŒã¯ã³ã®äžé衚çŸã®å¹³åãã¯ãã«ãããã«ã·ããŒã·ã§ã³ããŠããããŒã¯ã³ã®äžé衚çŸã®å¹³åãã¯ãã«ãåŒãããã¯ãã«ãçšãã
- æç« ã®çææã«ã¯ãæ£ã®æ¹åãšè² ã®æ¹åã«ä»å
¥ããäºã€ã®ã¢ãã«ãçæããããžãããè¶³ãããã®ã䜿çšããŠãã
- ä»å
¥ããéã¯åå¥ã«èšå®ããŠãã
ðæ°ãã«åãã£ãããšã¯äœã
- ãã³ãããŒã¯ã«ãããè©äŸ¡ã§ã¯ãæ£è§£çãšF1ã¹ã³ã¢ãæ¹åããŠãã
- 䜿çšããããŒã¿ã¯ãã«ã·ããŒã·ã§ã³ã®ãã³ãããŒã¯
- æ¢åã®ãã«ã·ããŒã·ã§ã³å¯Ÿçãããææ³ãããè¯ããªã£ãŠãã
- æ¢åã®ç»åçè§£ãã³ãããŒã¯ã«ãããŠãä»ã®ææ³ãšåçã®æ§èœã«ãªã£ãŠãã
- ä»å
¥éæ¯ã®æ§èœãèŠããšãæ£ã®æ¹åãžã®ä»å
¥éã¯ããã©ãŒãã³ã¹ã«å€§ãã圱é¿ãã
âçåç¹ã¯äœã
- ãã€ãã©ã®éãå¢ããŠããã®ã¯è¯ãã®ãïŒ
- èè
ããèšåããŠããããã«ããŽãªåãªã©ã¯MSCOCOã«äŸåããŠãã
- ããé£ããåé¡ã ãšæã£ã
paper
Created
Wed, 24 Dec 2025 00:00:00 +0900 ðè«ææ
å ±
ðãã®è«æã®ããŒã¡ãã»ãŒãž
- 察ç
§åŠç¿ã䜿çšããããšã§ãLLMã®ç¹åŸŽãšå¿çã®äžèŽåºŠãåäžãã
ðã©ãããåé¡ã«åãçµãã ã®ã
- LLMã®å
éšè¡šçŸãšåºåæç« ã®äžèŽåºŠãæããããã«ã¢ãã«ã調æŽããããš
ð§âðãã®åé¡ã«åãçµãããšããªãéèŠãªã®ã
- LLMãå²ãåœãŠã確çã«ã¯ãééãã®å¿çã«é«ã確çãå²ãåœãŠããªã©ã®èª²é¡ããã
- æ£ç¢ºãªå¿çã«é«ã確çãå²ãåœãŠãããã«ãå
éšè¡šçŸã調æŽããææ³ã§ã¯è€æ°ã®å¥œãŸããç¹æ§ã«å¯Ÿå¿ããããšãé£ãã
ð¡åé¡è§£æ±ºã«åããããŒã¢ã€ãã¢ã¯äœã
- ããŒã¯ã³åäœã§LLMã®æçµå±€ã«æåãå ãã
- åŸé
ã䜿çšããŠæåãå ãã
- æå€±é¢æ°ã¯é ãå±€ã®ç¶æ
ããæ£è§£ããŒã¯ã³ãäºæž¬ãã確çã®ã¯ãã¹ãšã³ããããŒã䜿çšããŠãã
- ãã©ã¡ãŒã¿ãæŽæ°ããæãšéæ¹åã®åŸé
ãæåãšããŠå ãã
- æåãå ããæã®ããžããã2.ã§äœ¿çšãã
- ãã®æåãå ããã¹ãããã¯Såè¡ã
- ãã®æåã«åœ±é¿ã®ããç¹åŸŽãæœåºãã
- ç¹åŸŽéã¯ããŒã¯ã³æ¯ã«æœåºãã
- ãã®ã§ãç¹åŸŽéãšã¯ãããžãããåŸé
ã®L2ãã«ã ãªã©ãæã
- ãã®ç¹åŸŽéã«å¯ŸããŠå¹³åãªã©ã®çµ±èšåŠçãå ããå€ãæçµçãªç¹åŸŽéãšãã
- ãã®ç¹åŸŽéãã確信床(確çã®ããšãïŒ)ãäºæž¬ããåé¡åšãåŠç¿ãã
- åé¡åšã¯äºã€çšãã
- ããŒã¯ã³ã®ç¹åŸŽéæ¯ã«æ£è§£ãäžæ£è§£ãäºæž¬ããåé¡åšãšæç« åäœã§äºæž¬ããåé¡åš
- æç« åäœã§äºæž¬ããåé¡åšã¯ãããŒã¯ã³åäœã®åé¡åšãæœåºããç¹åŸŽéãé£çµããç¹åŸŽéã䜿çšããŠäºæž¬ããŠãã
- ãã®ã§ã®ç¹åŸŽéã¯ãã¢ãã«ã®æçµåºåãæã
- ããŒã¯ã³åäœã®åé¡åšã¯MLPãæç« åäœã®åé¡åšã¯ç³ã¿èŸŒã¿ã䜿çšããŠãã
- ç®ç颿°ã¯max-marginæå€±ã䜿çšãã
- è² äŸã¯ããžããã®å€ãåºã«æ±ºããŠãã
ðæ°ãã«åãã£ãããšã¯äœã
- è©äŸ¡ã¯éžæåé¡ã察象ãšãã
- è©äŸ¡ææšã«ã¯ãExpected Calibration ErrorãšBarier Scoreã䜿çšããŠãã
- åé¡åé¡ã«ãããŠã¯ãECEãä»ã®ææšãããããæ¹åããŠãã
- å ããŠãæ£è§£çãªã©ã®ææšãæ¹åããããšãåãã£ã
- çæã¿ã¹ã¯ã«ãããŠãæ¹åã§ããããšãåãã£ã
âçåç¹ã¯äœã
- ä»ã®ãã¡ã€ã³ã«ãããæå¹æ§ãæ°ã«ãªã
- æçµå±€ã ãã§æå¹ã ã£ãã®ããª
- ä»ã®ã¬ã€ã€ãŒã®å¹æãæ°ã«ãªã
paper
Created
Tue, 23 Dec 2025 00:00:00 +0900 ðè«ææ
å ±
ðãã®è«æã®ããŒã¡ãã»ãŒãž
- ãã£ãŒãããã¯ãçšããŠã¢ãã«ãæŽæ°ããããšã§ãéå»ã®æšè«çµæã掻ããã€ã€æšè«ã®æ§èœãåäžãã
ðã©ãããåé¡ã«åãçµãã ã®ã
- ãã¹ãæã«ããããã£ãŒãããã¯ããLLMãæŽæ°ãã
- ãã¹ãæã«ãããŠæšè«ãè¡ãããã®çµæãçšããŠå床æšè«ãããšããã¿ã¹ã¯ã«ãªã
- ãã®æã«ãéå»ã®çµéšãäžæãæŽ»çšããŠLLMãæŽããããšãç®æã
ð§âðãã®åé¡ã«åãçµãããšããªãéèŠãªã®ã
- åŸæ¥ã®ææ³ã§ã¯ãSequential RevisionãšParallel samplingããã
- Sequential Revisionã¯ãéå»ã®ãã©ã€ã¢ã«çµæãããã³ããã«å«ããæ¹æ³
- Parallel Samplingã¯éå»ã®çµæã«é¢ããããäœåºŠãäºæž¬ããæ¹æ³ã«ãªã
- Sequential Revisionã¯ã³ã³ããã¹ãé·ãé·ããªãããããããèšç®ã³ã¹ããé«ããªããããäœçœ®ãã€ã¢ã¹ã®åœ±é¿ããã
- Parallel samplingã¯å¹ççã§ããããéå»ã®ãšã©ãŒãèæ
®ããªã課é¡ããã
ð¡åé¡è§£æ±ºã«åããããŒã¢ã€ãã¢ã¯äœã
- éå»ã®ãã©ã€ã¢ã«ããããã¢ãã«ã®éã¿ã«éç¹ã眮ããææ³ãææ¡ããŠãã
- æå€±é¢æ°ãšå¹ççãªOptimizerãææ¡ããŠãã
- LLMã¯åé¡ã«å¯Ÿããè§£çããããšãæ€èšŒã¢ãã«ãæ£è§£ãã©ãããå€å®ãã
- äžæ£è§£ã§ããå Žåãæ€èšŒã¢ãã«ã¯äžæ£è§£ã§ãããšããåºå®ã®æç« ãçæãã
- 远å ã®ãã£ãŒãããã¯ãšããŠLLMãæç« çæãã
- ãããã®äºã€ã®ãã£ãŒãããã¯ã«å¯ŸããŠã¯ãã¹ãšã³ããããŒãæå°ã«ãªãããã«åŠç¿ãé²ãã
- ã¢ãã«ã®ãã©ã¡ãŒã¿å
ã«éå»ã®çµéšãä¿åããããšãã話
- Optimizerã«ã€ããŠã¯ããåãããªãã£ã
- PEFTãåèã«ããã¿ãããããïŒ
ðæ°ãã«åãã£ãããšã¯äœã
- Parallel Samplingã§ã¯20GPU/hã ã£ãã®ã«å¯ŸããŠãææ¡ææ³ã§ã¯4GPU/hã«æ¹åããã
- ãã©ã€ã¢ã«ã®åæ°æ¯ã«æ¯èŒãããšãææ¡ææ³ã¯åæ°ãå¢ããçšæ§èœãè¯ããªã£ãŠãã
- ææ³ã«ãã£ãŠã¯ãäœäžããŠãããã®ããã
- Optimizerã®æ¯èŒã§ã¯ãLoRAãšæ¯èŒããŠå°ãªããã©ã¡ãŒã¿ã§è¯ãæ§èœã«ãªã£ãŠãã
âçåç¹ã¯äœã
- Sequential Samplingãšææ¡ææ³ã®èšç®ã³ã¹ããéããããªãã
- Optmizerã®ç«ã¡äœçœ®ãåãããªã
- ããå¥ã®ææ³ã§ã¯ãªãïŒ
paper
Created
Fri, 28 Nov 2025 00:00:00 +0900 ðè«ææ
å ±
ðãã®è«æã®ããŒã¡ãã»ãŒãž
- DPOãåºã«ããå ±é
¬ã掻çšããŠæç« ã®ãã³ãŒãã£ã³ã°ãè² äŸã®éžæãããããšã¯ãããŒãœãã©ã€ãºã«ãããŠæå¹ã§ãã
ðã©ãããåé¡ã«åãçµãã ã®ã
- LLMãæç« ãçæããæã«ããŠãŒã¶ãŒã®æå³ãæšå®ããªããæç« ãçæããããã«ãã
ð§âðãã®åé¡ã«åãçµãããšããªãéèŠãªã®ã
- ãŠãŒã¶ãŒã®æå³ã«æ²¿ãå¿çãçæããããšã¯LLMã®å®çšäžéèŠã§ãã
- çŸç¶ã¯ãããã³ããããŒã¹ã®æ¹æ³ãšLoRAãªã©ã¢ãã«ã®ãã©ã¡ãŒã¿ãæŽæ°ããæ¹æ³ã®äºçš®é¡ããã
- ããã³ããããŒã¹ã®ææ³ã§ã¯ããŠãŒã¶ãŒã®ããŒã¿ããåŠç¿ããããšãç¡ããã广ãéå®çã§ãã課é¡ããã
- ãã©ã¡ãŒã¿ãæŽæ°ããææ³ã§ã¯ãç Žæ»
çå¿åŽãèšç®ã³ã¹ãã®é¢ãã課é¡ããã
ð¡åé¡è§£æ±ºã«åããããŒã¢ã€ãã¢ã¯äœã
- åºæ¬çã«LoRAãæ³å®ããææ³ã«ãªã£ãŠãã
- æç« ã®ãã³ãŒãã£ã³ã°ã«ã¯ãå ±é
¬ããŒã¹ã®ææ³ã䜿çšããŠãã
- éŸå€ãã倧ããªç¢ºçã®ããŒã¯ã³éåãåŸã
- åºã¢ãã«ãšLoRAãé©çšããã¢ãã«ããã®ããŒã¯ã³ãçæãã確çã®æ¯ãå ±é
¬ãšãã
- ãã®å ±é
¬ãæå€§ã«ãªãããŒã¯ã³ãéžæããŠãã³ãŒãã£ã³ã°ãã
- ã¢ãã«ã®åŠç¿ã«ã¯ãDPOã䜿çšããŠãã
- ããŒã¿ã»ããã®æ§ç¯ã®ããã«ã¯ãLLMãçæããããã€ãã®äŸã®äžããäžèšã®å ±é
¬ãæãå°ãããã®ãè² äŸãšããŠãã
ðæ°ãã«åãã£ãããšã¯äœã
- ããã³ããããŒã¹ã®ææ³ã¯ãæ§èœåäžãéå®çã§ããããš
- ããŒã¹ã¢ãã«ãããæªããªãããšããã
- ç¹ã«é·æã«ãããŠæ§èœãäœäžããããšã確èªã§ãã
- ææ¡ææ³ã¯ãåŠç¿ããŒã¹ã®ææ³ãããè¯ãã¢ãã«ãåŠç¿ã§ããŠãã
- å ±é
¬ããŒã¹ã®ãã³ãŒããšDPOã®å¹æã¯åçšåºŠã§ãã£ã
âçåç¹ã¯äœã
paper
Created
Tue, 25 Nov 2025 00:00:00 +0900 ðè«ææ
å ±
ðãã®è«æã®ããŒã¡ãã»ãŒãž
- LLMã®Function Callingã¿ã¹ã¯ã®ããŒã¿ã®æ¡åŒµã®ããã«ã¯ãåŒã³åºãããŠããã¿ã¹ã¯ã®äžèŽåºŠãªã©ãå
¥ãããšè¯ã
ðã©ãããåé¡ã«åãçµãã ã®ã
- LLMãå€éšAPIãšé£æºããã¿ã¹ã¯ã§ããFunction Callingã®æ§èœãåäžãããããªåŠç¿ãã
- åŠç¿ã«äœ¿çšããããã³ããã«å«ããäŸã工倫ããææ³ã«ããŠãã
ð§âðãã®åé¡ã«åãçµãããšããªãéèŠãªã®ã
- åŠç¿ããŒã¿ãã¢ãã«ã®ãã©ã¡ãŒã¿æ°ãåã«å¢ãããŠããå®äžçã€ã³ã¿ã©ã¯ã·ã§ã³ã¯è§£æ±ºããããšãã§ããªã
- æ¢åã®Function Callingã®åŠç¿ææ³ã¯ãå
·äœäŸãæåã§ä»äžããŠããããå€§èŠæš¡ã«ãã¥ãã課é¡ããã
ð¡åé¡è§£æ±ºã«åããããŒã¢ã€ãã¢ã¯äœã
- é¡äŒŒã®äŸãååŸããããã®æ¹æ³ãšããŠã以äžã®äžçš®é¡ã®ææšãçšãã
- ãŠãŒã¶ãŒã®ã¯ãšãªãšè»è·¡ã®åã蟌ã¿è¡šçŸã®é¡äŒŒåºŠ
- è»è·¡ãšã¯ããŠãŒã¶ãŒã®å
¥åãšåŒã³åºãããããŒã«ã®å¿çãè€æ°ã¹ãããç¹°ãè¿ãããã®ãæã
- é¡äŒŒåºŠã®ææšã«ã¯ãæ£èŠåã³ãµã€ã³é¡äŒŒåºŠã䜿çšãã
- åŒã³åºãããŒã«ã®äžèŽåºŠ
- å®éã«äœ¿çšãããŠããããŒã«ã®äžèŽåºŠã䜿çšããŠãã
- æå³ã¢ã©ã€ã³ã¡ã³ã
- 䜿çšããæå³ã¯ãäºåã«å®çŸ©ãããŠããã¯ã©ã¹ã«åé¡ãããŠãã
- é¡äŒŒåºŠã®æ€çŽ¢ã«äœ¿çšããå±¥æŽãäžããããæã«æå³ãäœããã®æ¹æ³ã§æšå®ããŠããã®ããïŒ
- æçµçãªé¡äŒŒåºŠã¯ããããã®éã¿ä»ãåã«ãªã£ãŠãã
- é¡äŒŒåºŠãèšæž¬ããããã®ããŒã¿éåã¯æ°ããªè»è·¡ãåŸãããæã«æŽæ°ãã
- LLMã§ãŠãŒã¶ãŒã®æå³ãéæã§ãããšåé¡ãããæã«ããŒã¿éåã«è¿œå ãã
ðæ°ãã«åãã£ãããšã¯äœã
- ToolQAãÏ-benchã«ããè©äŸ¡ã§ã¯ãæ¢åææ³ãããæŠãè¯ãæ§èœã§ãã£ã
- ããŒã¹ã©ã€ã³ææ³ã¯Tool Augmented LLMããã
- Ablation Studyã§ã¯ã2ãš3ã®ææšã®ã©ã¡ããéèŠã£ãœãããšã瀺ãããŠãã
- ToolQAã®Easyã§ã¯3ãç¡ãããšã¹ã³ã¢ã倧ããäžãããHardã§ã¯2ãç¡ããæã倧ããã¹ã³ã¢ãäžãã£ã
- å
šäœçã«ã¯3ã®åœ±é¿åºŠã倧ãããã ãã©ãããã¯è¯ãåãããªããªã
âçåç¹ã¯äœã
ç¹ã«ãªã
paper
Created
Mon, 24 Nov 2025 00:00:00 +0900 ðè«ææ
å ±
ðãã®è«æã®ããŒã¡ãã»ãŒãž
- ïŒ1, 2æã§ãŸãšããïŒ
ðã©ãããåé¡ã«åãçµãã ã®ã
- LLMã®å
éšè¡šçŸã«ä»å
¥ããææ³ã®è©äŸ¡ãããããã®ãã³ãããŒã¯ããŒã¿ã»ãããæ§ç¯ãã
ð§âðãã®åé¡ã«åãçµãããšããªãéèŠãªã®ã
- LLMã®å
éšè¡šçŸã«ä»å
¥ããæ§ã
ãªææ³ãææ¡ãããŠããã
- ã ããçµ±äžãããã³ãããŒã¯ãååšããªãããå
¬å¹³ãªè©äŸ¡ãã§ããŠããªããšãã課é¡ãããã
ð¡åé¡è§£æ±ºã«åããããŒã¢ã€ãã¢ã¯äœã
- Concept DetectionãšModel Steeringã®äºã€ã®ææšãè©äŸ¡ããããã®ããŒã¿ã»ãããæ§ç¯ãã
- Concept Detectionã¯ã·ã³ãã«ãªåé¡åé¡
- Model Steeringã¯ãçæããæç« ãLLMãè©äŸ¡ãããã®ã«ãªã
- ããŒã¿ã®çšæã®ããã«ãGPT-4oã䜿çšããããŒã¿æ¡åŒµãè¡ãªãããŠãã
- Concept Dataset Generation
- ããŒã¿ã»ããã®åœ¢åŒã¯PreferenceããŒã¿ã»ãããšåã圢åŒã«ãªã£ãŠãã
- æç€ºãšããžãã£ããªããŒã¿ã¯LLMã«ããçæãããŠãã
- ãã¬ãã£ããªããŒã¿ã«ã¯ãç°ãªãã³ã³ã»ããã«å±ããã¬ã¹ãã³ã¹ã䜿çšããŠãã
- ã¿ã¹ã¯ã®è©äŸ¡ææšã«ã¯ãç¹å®ã®ã¬ã€ã€ãŒã®åããŒã¯ã³ã®äžé衚çŸãçšããŠåé¡åšãäºæž¬ãã確çã®æå€§å€ãçšããŠãã
- åé¡åšã®äºæž¬ã¯[0-1]ã®äžæ¬¡å
ã®åºåã«ãªã
- Model Steering
- è©äŸ¡ææš
- LLMãå¿çã0ã1ã2ã®ããããã§è©äŸ¡ãã
- ã¹ã³ã¢ã¯ãConceptãInstructoinãFluencyã®3ã€ã䜿çšãã
- æçµã¹ã³ã¢ã¯ã調åå¹³åã䜿çšããŠãã
- è«æäžã§å ±åãããŠããã®ã¯ãç¹å®ã®ã¬ã€ã€ãŒã«ãããã¹ã³ã¢ã«ãªã£ãŠãã
- Model Steeringã§ã¯ç¹å®ã®ã¬ã€ã€ãŒã«ä»å
¥ããæã®ã¹ã³ã¢ã«ãªã£ãŠãã
ðæ°ãã«åãã£ãããšã¯äœã
- Concept Detectionã§ã¯ProbeããŒã¹ã®ææ³ããSAEã䜿çšããææ³ãããè¯ãæ§èœã§ãã£ã
- è©äŸ¡ææšã¯ãAUROCãçšããŠãã
- ç¹ã«ãSAEã¯ããŒã¿ã®ãã©ã³ã¹ãæªããšæ§èœãäœäžããåŸåããã
- Model Steeringã«ãããŠã¯ãSAEã®æ¹ãè¯ãæ§èœã§ãããLoRAãSFTãããæ§èœãäœãçµæã§ãã£ã
âçåç¹ã¯äœã
- Model Steeringã®ã¹ã³ã¢ã«ãããŠãå®éçãªãã®ãæ¡çšãããŠããªãã®ãæ°ã«ãªã
- LLMã«ããè©äŸ¡ã ãã§è¯ãã®ãã¯ãšãŠãçå
- Gemma以å€ã®ã¢ãã«ã®æ§èœã¯ã©ããªã®ã ãã
paper
Created
Sat, 22 Nov 2025 00:00:00 +0900 ðè«ææ
å ±
ðãã®è«æã®ããŒã¡ãã»ãŒãž
- ïŒ1, 2æã§ãŸãšããïŒ
ðã©ãããåé¡ã«åãçµãã ã®ã
- SAEãçšããç¹åŸŽééžæã«ãããŠãå
¥åãšåºåã®ç¹åŸŽéã®ããããã«åœ±é¿ãããç¹åŸŽéãèŠã€ããããš
ð§âðãã®åé¡ã«åãçµãããšããªãéèŠãªã®ã
- Sparse AutoEncoder(SAE)ã¯ä»å
¥ããããã®ç¹åŸŽéãéžæããæã«æå¹ãªææ³ã§ãã
- ã ããä»å
¥ã®ããã«æå¹ãªç¹åŸŽãéžæããããšã¯ãŸã æªç¥ã®åé¡ã§ãã
ð¡åé¡è§£æ±ºã«åããããŒã¢ã€ãã¢ã¯äœã
- ç¹åŸŽéã以äžã®äºçš®é¡ã«åé¡ããåé¡ããããã®ææšãææ¡ãã
- Input featuresïŒã¢ãã«ã«å
¥åããããã¿ãŒã³ãèªèããç¹åŸŽé
- Output featuresïŒã¢ãã«ãçæããããŒã¯ã³ã«åœ±é¿ããç¹åŸŽé
- ãããã®åæã«ã¯ãLogit Lensã䜿çšãããŠãã
- Logit Lengsã¯ã¢ãã«ã®ãã©ã¡ãŒã¿ãèªåœç©ºéã«å°åœ±ãããã®åºåååžãèŠãŠãã©ã¡ãŒã¿ãåæããæ¹æ³ã®ããš
- Input featuresã®ã¹ã³ã¢ã®èšç®ã«ã¯ãä»»æã®æç« éåãçšãã
- ãã®æç« éåã«ãããŠæã倧ããSAEã®ããŒã¯ã³ãçºç«ãããããŒã¯ã³ãšãLogit Lensã«ããäºæž¬ãããããŒã¯ã³ã®äžèŽçãã¹ã³ã¢ãšããŠãã
- Output Featuresã®ã¹ã³ã¢ã®èšç®ã«ã¯Logit Lensã«ããäºæž¬ãããããŒã¯ã³ã®ã¹ã³ã¢ãšé äœã確çã䜿çšãã
- ãã®ç¹åŸŽéã«ä»å
¥ãè¡ã£ãæã®ã¢ãã«ã®åºåååžãšä»å
¥ãããåã®ååžã®å·®ãã¹ã³ã¢ãšããŠãã
- Logit Lensã«ããäºæž¬çµæãçšããŠä»å
¥ããåã®åºåååžãèšç®ããŠããããããåãããªãã£ã
ðæ°ãã«åãã£ãããšã¯äœã
- äžèšã®ã¹ã³ã¢ãGemmaãLlamaã«é©çšããæãGemmaã«ãããŠã¯å
¥åã«è¿ãå±€ã§ã¯Input featuresãåºåã«è¿ãå±€ã§ã¯Output Featuresã®ã¹ã³ã¢ã倧ãããªã£ãŠããã
- ãã以å€ã®ã¢ãã«ã«ãããŠã¯ããã®åŸåã¯åœãŠã¯ãŸã£ãŠããªã
- Output featuresãé«ããã©ã¡ãŒã¿ã«ä»å
¥ããããšã«ããåºåæç« ã®å€åãèšç®ãã
- å®éšã§ã¯ãã¹ã³ã¢ã«éŸå€ãçšæãä»å
¥ããç¹åŸŽéãéžæããŠãã
- è©äŸ¡ã«ã¯ãGeneration Success@Kã䜿çšããŠããã
- Logit Lensã«ããäºæž¬ãããTop-kã®ããŒã¯ã³ãšæç« ã«å«ãŸããããŒã¯ã³ã®äžèŽçãèšç®ããŠããã
- éŸå€ãäžãããšãGeneration Success@Kãäžæããããšãåãã£ã
âçåç¹ã¯äœã
- ã¹ã³ã¢ã®èšç®çµæã§ãããããªçµæãåºãŠããã®ãGemmaã ããªã®ãæ°ã«ãªã
- ä»å
¥ã®çµæã¯åæ§ã®åŸåã瀺ããŠãã
- çµå±Output featuresãé«ããã®ãè¯ãç¹åŸŽã§ããã®ãïŒ
- ä»å
¥ã®æ¹æ³ãè¯ãåãããªãã£ã
- æ¹åãæ±ºããæ¹æ³ãç¥ããã
paper
Created
Tue, 18 Nov 2025 00:00:00 +0900