ArXiv Intelligence

PLAN-S: Bridging Planning with Latent Style Dynamics for Autonomous Driving World Models

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Beyond Vector Similarity: A Structural Analysis of Graph-Augmented Retrieval for Industrial Knowledge Graphs

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Framing, Judging, Steering: An Assessable Competency Model for Teach-ing Students to Reason With Generative AI

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

The Self-Correction Illusion: LLMs Correct Others but Not Themselves

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Bidirectional Search for Longest Paths: Case for Front-to-Front Heuristics

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Edit-R2: Context-Aware Reinforcement Learning for Multi-Turn Image Editing

Topic · 强化学习

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

A Pre-Registered Causal Partition of Self-Consistency Elicitation and Reward Design in RLVR

Topic · 强化学习

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Towards World Models in Biomedical Research

Topic · 强化学习

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Retry Policy Gradients in Continuous Action Spaces

Topic · 强化学习

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

QCFuse: Query-Aware Cache Fusion via Compressed View for Efficient RAG Serving

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Entropy-Based Evaluation of AI Agents: A Lightweight Framework for Measuring Behavioral Patterns

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Agentic Molecular Recovery via Molecule-Aware Exploration

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Statistical Priors for Implicit Preferences: Decoupling Skill Selection as a Local Harness in Personal Agents

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

From Risk Classification to Action Plan Remediation: A Guardrail Feedback Driven Framework for LLM Agents

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Can LLMs Write Correct TLA+ Specifications? Evaluating Natural-Language-to-TLA+ Generation

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

TAPO: Tool-Aware Policy Optimization via Credit Transfer for Multimodal Search Agents

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

SubtleMemory: A Benchmark for Fine-Grained Relational Memory Discrimination in Long-Horizon AI Agents

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Class-Specific Branch Attention for Mitigating Gradient Interference under Class Imbalance

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

2026-06-05 · 280 篇

PLAN-S: Bridging Planning with Latent Style Dynamics for Autonomous Driving World Models

Beyond Vector Similarity: A Structural Analysis of Graph-Augmented Retrieval for Industrial Knowledge Graphs

Framing, Judging, Steering: An Assessable Competency Model for Teach-ing Students to Reason With Generative AI

The Self-Correction Illusion: LLMs Correct Others but Not Themselves

Bidirectional Search for Longest Paths: Case for Front-to-Front Heuristics

Edit-R2: Context-Aware Reinforcement Learning for Multi-Turn Image Editing

A Pre-Registered Causal Partition of Self-Consistency Elicitation and Reward Design in RLVR

Towards World Models in Biomedical Research

Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

Retry Policy Gradients in Continuous Action Spaces

QCFuse: Query-Aware Cache Fusion via Compressed View for Efficient RAG Serving

Entropy-Based Evaluation of AI Agents: A Lightweight Framework for Measuring Behavioral Patterns

Agentic Molecular Recovery via Molecule-Aware Exploration

Statistical Priors for Implicit Preferences: Decoupling Skill Selection as a Local Harness in Personal Agents

When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents

From Risk Classification to Action Plan Remediation: A Guardrail Feedback Driven Framework for LLM Agents

Can LLMs Write Correct TLA+ Specifications? Evaluating Natural-Language-to-TLA+ Generation

TAPO: Tool-Aware Policy Optimization via Credit Transfer for Multimodal Search Agents

SubtleMemory: A Benchmark for Fine-Grained Relational Memory Discrimination in Long-Horizon AI Agents

Class-Specific Branch Attention for Mitigating Gradient Interference under Class Imbalance