- formatting
- images
- links
- math
- code
- blockquotes
- external-services
•
•
•
•
•
•
-
ON THE EXPRESSIVENESS OF SOFTMAX ATTENTION: A RECURRENT NEURAL NETWORK PERSPECTIVE
논문 리뷰 - Efficient Transformer 관련 연구
-
BLOCK DIFFUSION: INTERPOLATING BETWEEN AUTOREGRESSIVE AND DIFFUSION LANGUAGE MODELS
논문 리뷰 - DiffusionLM, LLM 관련 연구
-
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
논문 리뷰 - Reinforcement Learning 관련 연구