- formatting
- images
- links
- math
- code
- blockquotes
- external-services
•
•
•
•
•
•
-
SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories
논문 리뷰 - Autonomous-Agents, Code Generation 관련 연구
-
Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling
논문 리뷰 - Knowledge Distillation, LLM, Limited Budget 관련 연구
-
Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models
논문 리뷰 - VLM, Safety 관련 연구
-
Many-shot jailbreaking
논문 리뷰 - ICL, Safety 관련 연구
-
LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
논문 리뷰 - Embeddings, LLM 관련 연구