PLAN-S: Bridging Planning with Latent Style Dynamics for Autonomous Driving World Models
PLAN-S: Bridging Planning with Latent Style Dynamics for Autonomous Driving World Models
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Beyond Vector Similarity: A Structural Analysis of Graph-Augmented Retrieval for Industrial Knowledge Graphs
Beyond Vector Similarity: A Structural Analysis of Graph-Augmented Retrieval for Industrial Knowledge Graphs
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Framing, Judging, Steering: An Assessable Competency Model for Teach-ing Students to Reason With Generative AI
Framing, Judging, Steering: An Assessable Competency Model for Teach-ing Students to Reason With Generative AI
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
The Self-Correction Illusion: LLMs Correct Others but Not Themselves
The Self-Correction Illusion: LLMs Correct Others but Not Themselves
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Bidirectional Search for Longest Paths: Case for Front-to-Front Heuristics
Bidirectional Search for Longest Paths: Case for Front-to-Front Heuristics
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Edit-R2: Context-Aware Reinforcement Learning for Multi-Turn Image Editing
Edit-R2: Context-Aware Reinforcement Learning for Multi-Turn Image Editing
Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used
A Pre-Registered Causal Partition of Self-Consistency Elicitation and Reward Design in RLVR
A Pre-Registered Causal Partition of Self-Consistency Elicitation and Reward Design in RLVR
Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used
Towards World Models in Biomedical Research
Towards World Models in Biomedical Research
Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used
Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts
Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Retry Policy Gradients in Continuous Action Spaces
Retry Policy Gradients in Continuous Action Spaces
Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used
QCFuse: Query-Aware Cache Fusion via Compressed View for Efficient RAG Serving
QCFuse: Query-Aware Cache Fusion via Compressed View for Efficient RAG Serving
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Entropy-Based Evaluation of AI Agents: A Lightweight Framework for Measuring Behavioral Patterns
Entropy-Based Evaluation of AI Agents: A Lightweight Framework for Measuring Behavioral Patterns
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Agentic Molecular Recovery via Molecule-Aware Exploration
Agentic Molecular Recovery via Molecule-Aware Exploration
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Statistical Priors for Implicit Preferences: Decoupling Skill Selection as a Local Harness in Personal Agents
Statistical Priors for Implicit Preferences: Decoupling Skill Selection as a Local Harness in Personal Agents
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents
When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
From Risk Classification to Action Plan Remediation: A Guardrail Feedback Driven Framework for LLM Agents
From Risk Classification to Action Plan Remediation: A Guardrail Feedback Driven Framework for LLM Agents
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Can LLMs Write Correct TLA+ Specifications? Evaluating Natural-Language-to-TLA+ Generation
Can LLMs Write Correct TLA+ Specifications? Evaluating Natural-Language-to-TLA+ Generation
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
TAPO: Tool-Aware Policy Optimization via Credit Transfer for Multimodal Search Agents
TAPO: Tool-Aware Policy Optimization via Credit Transfer for Multimodal Search Agents
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
SubtleMemory: A Benchmark for Fine-Grained Relational Memory Discrimination in Long-Horizon AI Agents
SubtleMemory: A Benchmark for Fine-Grained Relational Memory Discrimination in Long-Horizon AI Agents
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Class-Specific Branch Attention for Mitigating Gradient Interference under Class Imbalance
Class-Specific Branch Attention for Mitigating Gradient Interference under Class Imbalance
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used