Research Stream

2026-05-29 · 354 篇

显示 1-20 / 354
筛选与排序 默认折叠,需要时再展开,当前条件会直接显示在右侧。
日期 2026-05-29
清空
快捷日期
更多筛选

Physics Is All You Need? A Case Study in Physicist-Supervised AI Development of Scientific Software

Physics Is All You Need? A Case Study in Physicist-Supervised AI Development of Scientific Software

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

SchGen: PCB Schematic Generation with Semantic-Grounded Code Representations

SchGen: PCB Schematic Generation with Semantic-Grounded Code Representations

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Tiny but Trusted: Efficient Vision-Language Reasoning for Time-Series Anomaly Detection

Tiny but Trusted: Efficient Vision-Language Reasoning for Time-Series Anomaly Detection

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Locally Coherent, Globally Incoherent: Bounding Compositional Incoherence in Multi-Component LLM Agents

Locally Coherent, Globally Incoherent: Bounding Compositional Incoherence in Multi-Component LLM Agents

Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used

Demystifying Data Organization for Enhanced LLM Training

Demystifying Data Organization for Enhanced LLM Training

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

MIRA: Mid-training Rubric Anchoring for Source-Aware Data Selection

MIRA: Mid-training Rubric Anchoring for Source-Aware Data Selection

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

ProjectionBench: Evaluating Scientific Hypothesis Generation in LLMs Under Progressive Information Disclosure

ProjectionBench: Evaluating Scientific Hypothesis Generation in LLMs Under Progressive Information Disclosure

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

mcp-proto-okn: Natural-language access to open scientific knowledge graphs through the Model Context Protocol

mcp-proto-okn: Natural-language access to open scientific knowledge graphs through the Model Context Protocol

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

When Should Models Change Their Minds? Contextual Belief Management in Large Language Models

When Should Models Change Their Minds? Contextual Belief Management in Large Language Models

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Persona Conditioning of Brand Recommendations in Retrieval-Augmented Commercial Chat: A Prominence-Stratified Cross-Provider Audit

Persona Conditioning of Brand Recommendations in Retrieval-Augmented Commercial Chat: A Prominence-Stratified Cross-Provider Audit

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Double-Edged Sword or Sharp Tool? Designing and Evaluating Triadic LLM-Teacher Collaboration for K-12 Writing at Scale

Double-Edged Sword or Sharp Tool? Designing and Evaluating Triadic LLM-Teacher Collaboration for K-12 Writing at Scale

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Modularizing Educational LLM-Agency for Fostering Responsible Learning Assistance

Modularizing Educational LLM-Agency for Fostering Responsible Learning Assistance

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

BioRefusalAudit: Auditing Biosecurity Refusal Depth Using General and Domain-Fine-Tuned Sparse Autoencoders

BioRefusalAudit: Auditing Biosecurity Refusal Depth Using General and Domain-Fine-Tuned Sparse Autoencoders

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Meta-Cognitive Memory Policy Optimization for Long-Horizon LLM Agents

Meta-Cognitive Memory Policy Optimization for Long-Horizon LLM Agents

Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used

Temporal Stability and Few-Shot Prompting in Math Task Assessment

Temporal Stability and Few-Shot Prompting in Math Task Assessment

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

Anchorless Diversification for Parallel LLM Ideation

Anchorless Diversification for Parallel LLM Ideation

Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used

AgentSchool: An LLM-Powered Multi-Agent Simulation for Education

AgentSchool: An LLM-Powered Multi-Agent Simulation for Education

Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used

Enhancing Multi-Agent Communication through Attention Steering with Context Relevance

Enhancing Multi-Agent Communication through Attention Steering with Context Relevance

Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used

VLA-Trace: Diagnosing Vision-Language-Action Models through Representation and Behavior Tracing

VLA-Trace: Diagnosing Vision-Language-Action Models through Representation and Behavior Tracing

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used

PokerSkill: LLMs Can Play Expert-Level Poker without Training or Solvers

PokerSkill: LLMs Can Play Expert-Level Poker without Training or Solvers

Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Page 1/18Next