| Dec 26, 2023 | Are Emergent Abilities of Large Language Models a Mirage? |
| Dec 19, 2023 | Learning to Tokenize for Generative Retrieval |
| Dec 19, 2023 | Break the Sequential Dependency of LLM Inference Using Lookahead Decoding |
| Dec 12, 2023 | Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning |
| Oct 31, 2023 | In-Context Learning Learns Label Relationships but Is Not Conventional Learning |
| Oct 31, 2023 | EFFICIENT STREAMING LANGUAGE MODELS WITH ATTENTION SINKS |
| Oct 31, 2023 | A Survey on Large Language Model based Autonomous Agents |
| Oct 17, 2023 | Resolving Interference When Merging Models |
| Oct 10, 2023 | LongLoRA: Efficient Fine-Tuning of Long-Context Large Language Models |
| Oct 03, 2023 | DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models |
| Sep 19, 2023 | The CRINGE Loss: Learning what language not to model |
| Sep 19, 2023 | LARGE LANGUAGE MODELS AS OPTIMIZERS |
| Sep 12, 2023 | SILO LANGUAGE MODELS: ISOLATING LEGAL RISK IN A NONPARAMETRIC DATASTORE |
| Sep 12, 2023 | A Systematic Study of Knowledge Distillation for Natural Language Generation with Pseudo-Target Training |
| Aug 29, 2023 | Code Llama: Open Foundation Models for Code |
| Jun 29, 2023 | QLoRA: Eficient Finetuning of Quantized LLMs |
| Jun 22, 2023 | The False Promise of Imitating Proprietary LLMs |
| Jun 15, 2023 | Do Prompt-Based Models Really Understand the Meaning of Their Prompts? |
| May 25, 2023 | Rethinking the Role of Demonstrations: What Makes In-Context Learning Work? |
| May 11, 2023 | Measuring Association Between Labels and Free-Text Rationales |
| Apr 27, 2023 | Automatic chain of thought prompting in large language models |
| Apr 20, 2023 | FALSESUM : Generating Document-level NLI Examples for Recognizing Factual Inconsistency in Summarization |
| Apr 13, 2023 | P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks |
| Apr 13, 2023 | AdapterDrop: On the Efficiency of Adapters in Transformers |
| Mar 30, 2023 | GPT Understands, Too |
| Mar 16, 2023 | Calibrating Factual Knowledge in Pretrained Language Models |
| Feb 09, 2023 | AdapterHub: A Framework for Adapting Transformers, Parameter-Efficient Transfer Learning for NLP |
| Feb 02, 2023 | Measuring and Improving Semantic Diversity of Dialogue Generation |
| Jan 26, 2023 | Task-aware Retrieval with Instructions |
| Jan 19, 2023 | KALA: Knowledge-Augmented Language Model Adaptation |
| Jan 12, 2023 | A Survey for In-context Learning |