LLM Agent Self-evolution, Multi-Objective Optimal Decision, ๊ทธ๋ฆฌ๊ณ GRPO RL ๋ถ์ผ๋ฅผ ์ฐ๊ตฌํ๊ณ ์์ต๋๋ค.
๐ Recent Publications (์ต๊ทผ ์ฐ๊ตฌ ์ค์ ) ์ฃผ์ ์ฐ๊ตฌ ๋ถ์ผ์ธ AI Agent, LLM, Scalable Reasoning, Self-Evolution ๊ด๋ จ ๋ ผ๋ฌธ ๋ชฉ๋ก
๐ 2025 (The Latest) ๊ฐ์ฅ ์ต๊ทผ์ ๋ฐํ๋์๊ฑฐ๋ ์งํ ์ค์ธ ์ฐ๊ตฌ๋ค์ ๋๋ค.
๐ Scalable Reasoning via Task Decomposition and Model Routing: Achieving Near-Linear Cost in End-to-End Pipelines [PDF]
๋ถ์ผ: LLM, Reasoning, System Architecture
๋ฐํ ์ฐ๋: 2025
โ๏ธ MO-GRPO for Mitigating Reward Hacking in Multi-Objective LLM RL through a Variance-Equalizing Whitening Layer [PDF]
๋ถ์ผ: Multi-Objective Optimization (MOO), LLM RL
๋ฐํ ์ฐ๋: 2025
๐ค LLM Agent์ ํฉ๋ฆฌ์ ์์ฌ๊ฒฐ์ ์ ์ํ Prompt ํ์ฉ ์ฐ๊ตฌ [PDF]
๋ถ์ผ: AI Agent, Prompt Engineering
๋ฐํ ์ฐ๋: 2025
๐ก๏ธ ์์ ํ๊ณ ํด์ ๊ฐ๋ฅํ AI๋ฅผ ์ํ GRPO(Group Relative Policy Optimization) BDI ๋ณด์ ๋ชจ๋ธ [PDF]
๋ถ์ผ: Interpretable AI (XAI), Reinforcement Learning (RL)
๋ฐํ ์ฐ๋: 2025
๐ฐ 2024 ์ง๋ ํด ์ฃผ์ ์ฐ๊ตฌ ์ฑ๊ณผ ๋ชฉ๋ก์ ๋๋ค.
๐ An Empirical Study of the Structural Understanding Capabilities of LLMs on Financial Document Tables [PDF]
๋ถ์ผ: LLM, Financial AI, Table Understanding
๋ฐํ ์ฐ๋: 2024
๐ LLM ๋ฉํํ๋กฌํ ์ ํ์ฉํ ๊ธฐ์ ๊ณต์๋ฌธ์ ๋ถ์์ ๊ดํ ์ฐ๊ตฌ [PDF]
๋ถ์ผ: LLM, Meta-Prompting, Finance
๋ฐํ ์ฐ๋: 2024
โ๏ธ Design and Implementation of a Condition-Based Operation (CBO) using LLM-Based Multi-Agent Systems [PDF]
๋ถ์ผ: Multi-Agent System, CBO, IoT
๋ฐํ ์ฐ๋: 2024
๐ผ๏ธ Custom Hairstyle Image Generation and Similarity Analysis Using Diffuser and PEFT [PDF]
๋ถ์ผ: Generative AI, Diffuser, PEFT
๋ฐํ ์ฐ๋: 2024
| Platform | Link |
|---|---|
[email protected] |
