Research Stream

2026-05-29 · 354 篇

显示 161-180 / 354
筛选与排序 默认折叠,需要时再展开,当前条件会直接显示在右侧。
日期 2026-05-29
清空
快捷日期
更多筛选

How LoRA Remembers? A Parametric Memory Law for LLM Finetuning

How LoRA Remembers? A Parametric Memory Law for LLM Finetuning

Topic · 记忆
仅有原始 MD
Quick Read
LLM failed, fallback used

Same Evidence, Different Answers: Canonical-Context On-Policy Distillation for Multi-Turn Language Models

Same Evidence, Different Answers: Canonical-Context On-Policy Distillation for Multi-Turn Language Models

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Reinforcement Learning with Robust Rubric Rewards

Reinforcement Learning with Robust Rubric Rewards

Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used

Do Language Models Track Entities Across State Changes?

Do Language Models Track Entities Across State Changes?

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Beyond 3D VQAs: Injecting 3D Spatial Priors into Vision-Language Models for Enhanced Geometric Reasoning

Beyond 3D VQAs: Injecting 3D Spatial Priors into Vision-Language Models for Enhanced Geometric Reasoning

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Unifying Temporal and Structural Credit Assignment in LLM-Based Multi-Agent Prompt Optimization

Unifying Temporal and Structural Credit Assignment in LLM-Based Multi-Agent Prompt Optimization

Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used

BORA: Bridging Offline Reinforcement Learning and Online Residual Adaptation for Real-World Dexterous VLA Models

BORA: Bridging Offline Reinforcement Learning and Online Residual Adaptation for Real-World Dexterous VLA Models

Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used

Automating Low-Risk Code Review at Meta: RADAR, Risk Calibration, and Review Efficiency

Automating Low-Risk Code Review at Meta: RADAR, Risk Calibration, and Review Efficiency

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

HPO: Hysteretic Policy Optimization for Stable and Efficient Training under Sparse-Reward Regime

HPO: Hysteretic Policy Optimization for Stable and Efficient Training under Sparse-Reward Regime

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

What drives performance in molecular MPNNs? An operator-level factorial benchmark

What drives performance in molecular MPNNs? An operator-level factorial benchmark

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Token-Level Generalization in LoRA Adapter Backdoors: Attack Characterization and Behavioral Detection

Token-Level Generalization in LoRA Adapter Backdoors: Attack Characterization and Behavioral Detection

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

CalArena: A Large-Scale Post-Hoc Calibration Benchmark

CalArena: A Large-Scale Post-Hoc Calibration Benchmark

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

iLoRA: Bayesian Low-Rank Adaptation with Latent Interaction Graphs for Microbiome Diagnosis

iLoRA: Bayesian Low-Rank Adaptation with Latent Interaction Graphs for Microbiome Diagnosis

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Dissociative Identity: Language Model Agents Lack Grounding for Reputation Mechanisms

Dissociative Identity: Language Model Agents Lack Grounding for Reputation Mechanisms

Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used

On Distributional Reinforcement Learning in Chaotic Dynamical Systems

On Distributional Reinforcement Learning in Chaotic Dynamical Systems

Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used

Neural Network Verification using Partial Multi-Neuron Relaxation

Neural Network Verification using Partial Multi-Neuron Relaxation

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Do Proactive Agents Really Need an LLM to Decide When to Wake and What to Anchor?

Do Proactive Agents Really Need an LLM to Decide When to Wake and What to Anchor?

Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used

Overcoming Forgetting in LLM Fine-Tuning with Evolution Strategies

Overcoming Forgetting in LLM Fine-Tuning with Evolution Strategies

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

DAMEL: Dual-Axis Multi-Expert Learning for Class-Imbalanced Learning

DAMEL: Dual-Axis Multi-Expert Learning for Class-Imbalanced Learning

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

PARCEL: Pool-Anchored Resampling with Conditioned Elastic Queries for Efficient Vision-Language Understanding

PARCEL: Pool-Anchored Resampling with Conditioned Elastic Queries for Efficient Vision-Language Understanding

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
PrevPage 9/18Next