Research Stream

2026-06-05 · 280 篇

显示 241-260 / 280
筛选与排序 默认折叠,需要时再展开,当前条件会直接显示在右侧。
日期 2026-06-05
清空
快捷日期
更多筛选

Willing but Unable: Separating Refusal from Capability in Code LLMs via Abliteration

Willing but Unable: Separating Refusal from Capability in Code LLMs via Abliteration

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

VASO: Formally Verifiable Self-Evolving Skills for Physical AI Agents

VASO: Formally Verifiable Self-Evolving Skills for Physical AI Agents

Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used

Human oversight of agentic systems in practice: Examining the oversight work, challenges, and heuristics of developers using software agents

Human oversight of agentic systems in practice: Examining the oversight work, challenges, and heuristics of developers using software agents

Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used

Can AI Refute Economic Theory? Evidence from Beyond the Knowledge Cutoff

Can AI Refute Economic Theory? Evidence from Beyond the Knowledge Cutoff

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Pattern Selectivity is Not Task-Causal Structure: A Cross-Architecture Mechanistic Study of Composed-Task Circuits in 1B-Class Language Models

Pattern Selectivity is Not Task-Causal Structure: A Cross-Architecture Mechanistic Study of Composed-Task Circuits in 1B-Class Language Models

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Three-Dimensional Retinal Microvasculature Restoration in OCT Angiography

Three-Dimensional Retinal Microvasculature Restoration in OCT Angiography

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

A Taxonomy of Runtime Faults in Model Context Protocol Servers

A Taxonomy of Runtime Faults in Model Context Protocol Servers

Topic · 机器学习框架
仅有原始 MD
Quick Read
LLM failed, fallback used

A Model of Multi-turn Human Persuadability Using Probabilistic Belief Tracing

A Model of Multi-turn Human Persuadability Using Probabilistic Belief Tracing

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

The Invisible Hand of Physics: When Video Diffusion Models Know More Than They Show

The Invisible Hand of Physics: When Video Diffusion Models Know More Than They Show

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Gradient descent at the Edge of Stability: free energy model and kinetic description of the two-layer network

Gradient descent at the Edge of Stability: free energy model and kinetic description of the two-layer network

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

LoRi: Low-Rank Distillation for Implicit Reasoning

LoRi: Low-Rank Distillation for Implicit Reasoning

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Statistically Reliable LLM-Based Ranking Evaluation via Prediction-Powered Inference

Statistically Reliable LLM-Based Ranking Evaluation via Prediction-Powered Inference

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents

Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents

Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used

Do Models Share Safety Representations? Cross-Model Steering for Safe Visual Generation

Do Models Share Safety Representations? Cross-Model Steering for Safe Visual Generation

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Personal AI Agent for Camera Roll VQA

Personal AI Agent for Camera Roll VQA

Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used

Policy-Conditioned Counterfactual Credit for Verifiable Reinforcement Learning of Long-Horizon Language Agents

Policy-Conditioned Counterfactual Credit for Verifiable Reinforcement Learning of Long-Horizon Language Agents

Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used

X-Band UAV-enabled Integrated Sensing and Communications for Vehicular Networks

X-Band UAV-enabled Integrated Sensing and Communications for Vehicular Networks

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

NIV: Neural Axis Variations for Variable Font Generation

NIV: Neural Axis Variations for Variable Font Generation

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

From Attack Simulation to SIEM Rule: Deterministic Detection-as-Code Synthesis with Probe-Level Traceability

From Attack Simulation to SIEM Rule: Deterministic Detection-as-Code Synthesis with Probe-Level Traceability

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Search-Time Contamination in Deep Research Agents: Measuring Performance Inflation in Public Benchmark Evaluation

Search-Time Contamination in Deep Research Agents: Measuring Performance Inflation in Public Benchmark Evaluation

Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
PrevPage 13/14Next