| Aug 05, 2025 | Impact of Fine-Tuning Methods on Memorization in Large Language Models |
| Jul 15, 2025 | Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models |
| Jun 10, 2025 | DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models |
| Apr 22, 2025 | Fine-tuning Vision-Language-Action Models: Optimizing Speed and Success |
| Apr 08, 2025 | On the Biology of a Large Language Model |
| Mar 04, 2025 | Contextual Document Embeddings |
| Feb 18, 2025 | DeepSeek v3 |
| Jan 14, 2025 | OpenVLA: An Open-Source Vision-Language-Action Model |
| Jan 02, 2025 | TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies |
| Jan 02, 2025 | Inferring from Logits: Exploring Best Practices for Decoding-Free Generative Candidate Selection |
| Jan 02, 2025 | Diffusion Language Model-Mathematical foundations & inference optimization |
| Sep 09, 2024 | Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models |
| Sep 02, 2024 | LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders |
| Aug 13, 2024 | Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process |
| Aug 13, 2024 | Knowledge conflict survey |
| Jul 30, 2024 | In-Context Retrieval-Augmented Language Models |
| Jul 02, 2024 | RL-JACK: Reinforcement Learning-powered Black-box Jailbreaking Attack against LLMs |
| Jul 02, 2024 | Llama3 Tokenizer |
| Jun 11, 2024 | Contextual Position Encoding: Learning to Count What’s Important |
| May 07, 2024 | How to Train LLM? - From Data Parallel To Fully Sharded Data Parallel |
| Apr 30, 2024 | Training diffusion modelse with reinforcement learning |
| Mar 12, 2024 | A Simple and Effective Pruning Approach for Large Language Models |
| Mar 11, 2024 | BitNet: Scaling 1-bit Transformers for Large Language Models |
| Jan 23, 2024 | IN-CONTEXT PRETRAINING: LANGUAGE MODELING BEYOND DOCUMENT BOUNDARIES |
| Jan 09, 2024 | Making Large Language Models A Better Foundation For Dense Retrieval |
| Dec 19, 2023 | Learning to Tokenize for Generative Retrieval |
| Oct 31, 2023 | A Survey on Large Language Model based Autonomous Agents |
| Oct 10, 2023 | LongLoRA: Efficient Fine-Tuning of Long-Context Large Language Models |
| Aug 29, 2023 | Code Llama: Open Foundation Models for Code |
| May 11, 2023 | Measuring Association Between Labels and Free-Text Rationales |
| Apr 13, 2023 | P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks |
| Mar 30, 2023 | GPT Understands, Too |
| Jan 19, 2023 | KALA: Knowledge-Augmented Language Model Adaptation |