Jun 03, 2025 Textgrad: Automatic “Differentiation” via Text Apr 15, 2025 Universal and Transferable Adversarial Attacks on Aligned Language Models Feb 04, 2025 SSM → HIPPO → LSSL → S4 → Mamba → Mamba2 Oct 17, 2024 KNOWLEDGE ENTROPY DECAY DURING LANGUAGE MODEL PRETRAINING HINDERS NEW KNOWLEDGE ACQUISITION Jul 23, 2024 Pyspark - How to preprocess Large Scale Data with Python Apr 16, 2024 Understanding Emergent Abilities of Language Models from the Loss Perspective Apr 02, 2024 Preference-free Alignment Learning with Regularized Relevance Reward Mar 05, 2024 Beyond Memorization: Violating Privacy Via Inferencing With LLMs Jan 16, 2024 Mistral 7B & Mixtral (Mixtral of Experts) Oct 31, 2023 In-Context Learning Learns Label Relationships but Is Not Conventional Learning Jun 15, 2023 Do Prompt-Based Models Really Understand the Meaning of Their Prompts? May 25, 2023 Rethinking the Role of Demonstrations: What Makes In-Context Learning Work? Mar 16, 2023 Calibrating Factual Knowledge in Pretrained Language Models Feb 09, 2023 AdapterHub: A Framework for Adapting Transformers, Parameter-Efficient Transfer Learning for NLP Jan 26, 2023 Task-aware Retrieval with Instructions