How LoRA Remembers? A Parametric Memory Law for LLM Finetuning
How LoRA Remembers? A Parametric Memory Law for LLM Finetuning
Topic · 记忆
仅有原始 MD
Quick Read
LLM failed, fallback used
Same Evidence, Different Answers: Canonical-Context On-Policy Distillation for Multi-Turn Language Models
Same Evidence, Different Answers: Canonical-Context On-Policy Distillation for Multi-Turn Language Models
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Reinforcement Learning with Robust Rubric Rewards
Reinforcement Learning with Robust Rubric Rewards
Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used
Do Language Models Track Entities Across State Changes?
Do Language Models Track Entities Across State Changes?
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Beyond 3D VQAs: Injecting 3D Spatial Priors into Vision-Language Models for Enhanced Geometric Reasoning
Beyond 3D VQAs: Injecting 3D Spatial Priors into Vision-Language Models for Enhanced Geometric Reasoning
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Unifying Temporal and Structural Credit Assignment in LLM-Based Multi-Agent Prompt Optimization
Unifying Temporal and Structural Credit Assignment in LLM-Based Multi-Agent Prompt Optimization
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
BORA: Bridging Offline Reinforcement Learning and Online Residual Adaptation for Real-World Dexterous VLA Models
BORA: Bridging Offline Reinforcement Learning and Online Residual Adaptation for Real-World Dexterous VLA Models
Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used
Automating Low-Risk Code Review at Meta: RADAR, Risk Calibration, and Review Efficiency
Automating Low-Risk Code Review at Meta: RADAR, Risk Calibration, and Review Efficiency
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
HPO: Hysteretic Policy Optimization for Stable and Efficient Training under Sparse-Reward Regime
HPO: Hysteretic Policy Optimization for Stable and Efficient Training under Sparse-Reward Regime
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
What drives performance in molecular MPNNs? An operator-level factorial benchmark
What drives performance in molecular MPNNs? An operator-level factorial benchmark
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Token-Level Generalization in LoRA Adapter Backdoors: Attack Characterization and Behavioral Detection
Token-Level Generalization in LoRA Adapter Backdoors: Attack Characterization and Behavioral Detection
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
CalArena: A Large-Scale Post-Hoc Calibration Benchmark
CalArena: A Large-Scale Post-Hoc Calibration Benchmark
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
iLoRA: Bayesian Low-Rank Adaptation with Latent Interaction Graphs for Microbiome Diagnosis
iLoRA: Bayesian Low-Rank Adaptation with Latent Interaction Graphs for Microbiome Diagnosis
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Dissociative Identity: Language Model Agents Lack Grounding for Reputation Mechanisms
Dissociative Identity: Language Model Agents Lack Grounding for Reputation Mechanisms
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
On Distributional Reinforcement Learning in Chaotic Dynamical Systems
On Distributional Reinforcement Learning in Chaotic Dynamical Systems
Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used
Neural Network Verification using Partial Multi-Neuron Relaxation
Neural Network Verification using Partial Multi-Neuron Relaxation
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Do Proactive Agents Really Need an LLM to Decide When to Wake and What to Anchor?
Do Proactive Agents Really Need an LLM to Decide When to Wake and What to Anchor?
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Overcoming Forgetting in LLM Fine-Tuning with Evolution Strategies
Overcoming Forgetting in LLM Fine-Tuning with Evolution Strategies
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
DAMEL: Dual-Axis Multi-Expert Learning for Class-Imbalanced Learning
DAMEL: Dual-Axis Multi-Expert Learning for Class-Imbalanced Learning
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
PARCEL: Pool-Anchored Resampling with Conditioned Elastic Queries for Efficient Vision-Language Understanding
PARCEL: Pool-Anchored Resampling with Conditioned Elastic Queries for Efficient Vision-Language Understanding
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used