Trendar

“reinforcement learning”

이 키워드와 관련된 논문 · GitHub · 뉴스를 한곳에 모았습니다.

논문 12

전체 →
  1. OpenAlex자연어·LLM인용 1.4K
    대규모 언어 모델의 발전과 활용에 대한 종합적 조사A Survey of Large Language Models
  2. Semantic Scholar자연어·LLM인용 515
    추론·에이전트 성능과 효율성을 동시에 높인 오픈 LLMDeepSeek-V3.2: Pushing the Frontier of Open Large Language Models
  3. Semantic Scholar자연어·LLM인용 392
    LLM 추론 효율화를 위한 오버싱킹 문제와 해결 방안 종합 분석Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models
  4. Semantic Scholar인용 259
    단 하나의 학습 예제로 LLM 수학 추론 능력을 향상시키는 강화학습Reinforcement Learning for Reasoning in Large Language Models with One Training Example
  5. Semantic Scholar인용 233
    Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models
  6. Semantic Scholar인용 151
    d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning
  7. Semantic Scholar인용 103
    Artificial Intelligence in Cybersecurity
  8. Semantic Scholar인용 70
    Machine Learning and Deep Learning Paradigms: From Techniques to Practical Applications and Research Frontiers
  9. Semantic Scholar인용 61
    Deep Learning for Time Series Forecasting: Review and Applications in Geotechnics and Geosciences
  10. arXiv인용 0
    Shape Formation for the Cooperative Transportation of Arbitrary Objects Using Multi-Agent Reinforcement Learning
  11. arXiv인용 0
    Safe-RULE: Safe Reinforcement UnLEarning
  12. arXiv인용 0
    An Agency-Transferring Model-Free Policy Enhancement Technique