ArXiv Intelligence

Evolve as a Team: Collaborative Self-Evolution for LLM-based Multi-Agent Systems

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Hista and Numca: Estimate State Value Effectively for LLM Reinforcement Learning

Topic · 强化学习

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Energy-Aware NECO for Single-Pass Pixel-wise Out-of-Distribution Detection in Semantic Segmentation

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

A unified deeplearning framework for contrast-phase-specific virtual monochromatic imaging

Topic · 机器学习框架

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Multi-Legal-Bench: Evaluating LLMs on Legal Reasoning Across Jurisdictions, Languages, and Legal Traditions

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

The Little Book of Generative AI Foundations: An Intuitive Mathematical Primer

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Teaching Language Models to Check Grounded Claim Factuality with Human Test-Taking Strategies

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Personalized Turn-Level User Conversation Satisfaction Benchmark

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

From Prompts to Context: An Ontology-Driven Framework for Human-Generative AI Collaboration

Topic · 机器学习框架

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

EviLink: Multi-Path Schema Linking with Uncertainty-Guided Evidence Acquisition for Large-Scale Text-to-SQL

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Opir: Efficient Multi-Task Safety Classification for Toxicity, Jailbreaks, Hate Speech, and Harmful Content

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

OccamToken: Efficient VLM Inference with Training-Free and Budget-Adaptive Token Pruning

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

The Sample Complexity of Multiclass and Sparse Contextual Bandits

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Predicting Causal Effects from Natural Language Queries using Structured Representations

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Entity-Collision: A Stratified Protocol for Attributing Retrieval Lift in Agent Memory

Topic · Agent

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

COMET: Concept Space Dissection of the Modality Gap in Audio-Text Multimodal Contrastive Embeddings

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

DLM-SWAI: Steering Diffusion Language Models Before They Unmask

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Learning Context-Conditioned Predicate Semantics via Prototype Feedback

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Training Deliberative Monitors for Black-Box Scheming Detection

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

Brain-IT-VQA: From Brain Signals to Answers

Topic · 其他

仅有原始 MD

Quick Read

LLM failed, fallback used

详情问答

2026-05-29 · 354 篇

Evolve as a Team: Collaborative Self-Evolution for LLM-based Multi-Agent Systems

Hista and Numca: Estimate State Value Effectively for LLM Reinforcement Learning

Energy-Aware NECO for Single-Pass Pixel-wise Out-of-Distribution Detection in Semantic Segmentation

A unified deeplearning framework for contrast-phase-specific virtual monochromatic imaging

Multi-Legal-Bench: Evaluating LLMs on Legal Reasoning Across Jurisdictions, Languages, and Legal Traditions

The Little Book of Generative AI Foundations: An Intuitive Mathematical Primer

Teaching Language Models to Check Grounded Claim Factuality with Human Test-Taking Strategies

Personalized Turn-Level User Conversation Satisfaction Benchmark

From Prompts to Context: An Ontology-Driven Framework for Human-Generative AI Collaboration

EviLink: Multi-Path Schema Linking with Uncertainty-Guided Evidence Acquisition for Large-Scale Text-to-SQL

Opir: Efficient Multi-Task Safety Classification for Toxicity, Jailbreaks, Hate Speech, and Harmful Content

OccamToken: Efficient VLM Inference with Training-Free and Budget-Adaptive Token Pruning

The Sample Complexity of Multiclass and Sparse Contextual Bandits

Predicting Causal Effects from Natural Language Queries using Structured Representations

Entity-Collision: A Stratified Protocol for Attributing Retrieval Lift in Agent Memory

COMET: Concept Space Dissection of the Modality Gap in Audio-Text Multimodal Contrastive Embeddings

DLM-SWAI: Steering Diffusion Language Models Before They Unmask

Learning Context-Conditioned Predicate Semantics via Prototype Feedback

Training Deliberative Monitors for Black-Box Scheming Detection

Brain-IT-VQA: From Brain Signals to Answers