embedding

an archive of posts with this tag

Aug 05, 2025 Impact of Fine-Tuning Methods on Memorization in Large Language Models
Jul 15, 2025 Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models
Jun 10, 2025 DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models
Apr 22, 2025 Fine-tuning Vision-Language-Action Models: Optimizing Speed and Success
Apr 08, 2025 On the Biology of a Large Language Model
Mar 04, 2025 Contextual Document Embeddings
Feb 18, 2025 DeepSeek v3
Jan 14, 2025 OpenVLA: An Open-Source Vision-Language-Action Model
Jan 02, 2025 TraceVLA: Visual Trace Prompting Enhances Spatial-Temporal Awareness for Generalist Robotic Policies
Jan 02, 2025 Inferring from Logits: Exploring Best Practices for Decoding-Free Generative Candidate Selection
Jan 02, 2025 Diffusion Language Model-Mathematical foundations & inference optimization
Sep 09, 2024 Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models
Sep 02, 2024 LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders
Aug 13, 2024 Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process
Aug 13, 2024 Knowledge conflict survey
Jul 30, 2024 In-Context Retrieval-Augmented Language Models
Jul 02, 2024 RL-JACK: Reinforcement Learning-powered Black-box Jailbreaking Attack against LLMs
Jul 02, 2024 Llama3 Tokenizer
Jun 11, 2024 Contextual Position Encoding: Learning to Count What’s Important
May 07, 2024 How to Train LLM? - From Data Parallel To Fully Sharded Data Parallel
Apr 30, 2024 Training diffusion modelse with reinforcement learning
Mar 12, 2024 A Simple and Effective Pruning Approach for Large Language Models
Mar 11, 2024 BitNet: Scaling 1-bit Transformers for Large Language Models
Jan 23, 2024 IN-CONTEXT PRETRAINING: LANGUAGE MODELING BEYOND DOCUMENT BOUNDARIES
Jan 09, 2024 Making Large Language Models A Better Foundation For Dense Retrieval
Dec 19, 2023 Learning to Tokenize for Generative Retrieval
Oct 31, 2023 A Survey on Large Language Model based Autonomous Agents
Oct 10, 2023 LongLoRA: Efficient Fine-Tuning of Long-Context Large Language Models
Aug 29, 2023 Code Llama: Open Foundation Models for Code
May 11, 2023 Measuring Association Between Labels and Free-Text Rationales
Apr 13, 2023 P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks
Mar 30, 2023 GPT Understands, Too
Jan 19, 2023 KALA: Knowledge-Augmented Language Model Adaptation