Jul 15, 2025 Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models Oct 17, 2024 Rule Based Rewards for Language Model Safety Mar 19, 2024 Unveiling the Generalization Power of Fine-Tuned Large Language Models Feb 20, 2024 KNOWLEDGE CARD: FILLING LLMS’ KNOWLEDGE GAPS WITH PLUG-IN SPECIALIZED LANGUAGE MODELS Jan 16, 2024 Mistral 7B & Mixtral (Mixtral of Experts) Jan 02, 2024 DETECTING PRETRAINING DATA FROM LARGE LANGUAGE MODELS May 25, 2023 Rethinking the Role of Demonstrations: What Makes In-Context Learning Work? Apr 20, 2023 FALSESUM : Generating Document-level NLI Examples for Recognizing Factual Inconsistency in Summarization