Evolve as a Team: Collaborative Self-Evolution for LLM-based Multi-Agent Systems
Evolve as a Team: Collaborative Self-Evolution for LLM-based Multi-Agent Systems
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Hista and Numca: Estimate State Value Effectively for LLM Reinforcement Learning
Hista and Numca: Estimate State Value Effectively for LLM Reinforcement Learning
Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used
Energy-Aware NECO for Single-Pass Pixel-wise Out-of-Distribution Detection in Semantic Segmentation
Energy-Aware NECO for Single-Pass Pixel-wise Out-of-Distribution Detection in Semantic Segmentation
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
A unified deeplearning framework for contrast-phase-specific virtual monochromatic imaging
A unified deeplearning framework for contrast-phase-specific virtual monochromatic imaging
Topic · 机器学习框架
仅有原始 MD
Quick Read
LLM failed, fallback used
Multi-Legal-Bench: Evaluating LLMs on Legal Reasoning Across Jurisdictions, Languages, and Legal Traditions
Multi-Legal-Bench: Evaluating LLMs on Legal Reasoning Across Jurisdictions, Languages, and Legal Traditions
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
The Little Book of Generative AI Foundations: An Intuitive Mathematical Primer
The Little Book of Generative AI Foundations: An Intuitive Mathematical Primer
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Teaching Language Models to Check Grounded Claim Factuality with Human Test-Taking Strategies
Teaching Language Models to Check Grounded Claim Factuality with Human Test-Taking Strategies
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Personalized Turn-Level User Conversation Satisfaction Benchmark
Personalized Turn-Level User Conversation Satisfaction Benchmark
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
From Prompts to Context: An Ontology-Driven Framework for Human-Generative AI Collaboration
From Prompts to Context: An Ontology-Driven Framework for Human-Generative AI Collaboration
Topic · 机器学习框架
仅有原始 MD
Quick Read
LLM failed, fallback used
EviLink: Multi-Path Schema Linking with Uncertainty-Guided Evidence Acquisition for Large-Scale Text-to-SQL
EviLink: Multi-Path Schema Linking with Uncertainty-Guided Evidence Acquisition for Large-Scale Text-to-SQL
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Opir: Efficient Multi-Task Safety Classification for Toxicity, Jailbreaks, Hate Speech, and Harmful Content
Opir: Efficient Multi-Task Safety Classification for Toxicity, Jailbreaks, Hate Speech, and Harmful Content
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
OccamToken: Efficient VLM Inference with Training-Free and Budget-Adaptive Token Pruning
OccamToken: Efficient VLM Inference with Training-Free and Budget-Adaptive Token Pruning
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
The Sample Complexity of Multiclass and Sparse Contextual Bandits
The Sample Complexity of Multiclass and Sparse Contextual Bandits
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Predicting Causal Effects from Natural Language Queries using Structured Representations
Predicting Causal Effects from Natural Language Queries using Structured Representations
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Entity-Collision: A Stratified Protocol for Attributing Retrieval Lift in Agent Memory
Entity-Collision: A Stratified Protocol for Attributing Retrieval Lift in Agent Memory
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
COMET: Concept Space Dissection of the Modality Gap in Audio-Text Multimodal Contrastive Embeddings
COMET: Concept Space Dissection of the Modality Gap in Audio-Text Multimodal Contrastive Embeddings
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
DLM-SWAI: Steering Diffusion Language Models Before They Unmask
DLM-SWAI: Steering Diffusion Language Models Before They Unmask
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Learning Context-Conditioned Predicate Semantics via Prototype Feedback
Learning Context-Conditioned Predicate Semantics via Prototype Feedback
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Training Deliberative Monitors for Black-Box Scheming Detection
Training Deliberative Monitors for Black-Box Scheming Detection
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Brain-IT-VQA: From Brain Signals to Answers
Brain-IT-VQA: From Brain Signals to Answers
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used