Parallax: Parameterized Local Linear Attention for Language Modeling
Parallax: Parameterized Local Linear Attention for Language Modeling
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
CA-AC-MPC: CUDA-Accelerated Actor-Critic Model Predictive Control
CA-AC-MPC: CUDA-Accelerated Actor-Critic Model Predictive Control
Topic · 具身智能
仅有原始 MD
Quick Read
LLM failed, fallback used
Unveiling Multi-regime Patterns in SciML: Distinct Failure Modes and Regime-specific Optimization
Unveiling Multi-regime Patterns in SciML: Distinct Failure Modes and Regime-specific Optimization
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Real-rootedness of the Poincaré polynomials of $\overline{\mathcal M}_{0,n}$: an AI-assisted proof
Real-rootedness of the Poincaré polynomials of $\overline{\mathcal M}_{0,n}$: an AI-assisted proof
Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used
SafeRx-Agent: A Knowledge-Grounded Multi-Agent Framework for Safe and Explainable Medication Recommendation
SafeRx-Agent: A Knowledge-Grounded Multi-Agent Framework for Safe and Explainable Medication Recommendation
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Toward User Preference Alignment in LLM Recommendation via Explicit Context Feedback
Toward User Preference Alignment in LLM Recommendation via Explicit Context Feedback
Topic · 大模型后训练
仅有原始 MD
Quick Read
LLM failed, fallback used
Multi-Resolution End-to-End Deep Neural Network for Optimizing Latency-Accuracy Tradeoff in Autonomous Driving
Multi-Resolution End-to-End Deep Neural Network for Optimizing Latency-Accuracy Tradeoff in Autonomous Driving
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
When and How Long? The Readout-Mediator Angle in Temporal Reasoning
When and How Long? The Readout-Mediator Angle in Temporal Reasoning
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
A Minimal Bifurcation Model of Load Imbalance in a Softmax Mixture-of-Experts Router
A Minimal Bifurcation Model of Load Imbalance in a Softmax Mixture-of-Experts Router
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
unix-ctf: Procedural Environments for Unix-Competence Reinforcement Learning
unix-ctf: Procedural Environments for Unix-Competence Reinforcement Learning
Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used
GEO-Bench: Benchmarking Ranking Manipulation in Generative Engine Optimization
GEO-Bench: Benchmarking Ranking Manipulation in Generative Engine Optimization
Topic · 具身智能
仅有原始 MD
Quick Read
LLM failed, fallback used
OISD: On-Policy Internal Self-Distillation of Language Models
OISD: On-Policy Internal Self-Distillation of Language Models
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Same Question, Different Source, Different Answer: Auditing Source-Dependence in Medical Multi-Source RAG
Same Question, Different Source, Different Answer: Auditing Source-Dependence in Medical Multi-Source RAG
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Structured Prompt Optimization Meets Reinforcement Learning for Global and Local Interpretability over Complex Text
Structured Prompt Optimization Meets Reinforcement Learning for Global and Local Interpretability over Complex Text
Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used
SCDBench: A Benchmark for LLM-Based Smart Contract Decompilers
SCDBench: A Benchmark for LLM-Based Smart Contract Decompilers
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Return-to-Go Is More Than a Number: Q-Guided Alignment for Return-Conditioned Supervised Learning
Return-to-Go Is More Than a Number: Q-Guided Alignment for Return-Conditioned Supervised Learning
Topic · 大模型后训练
仅有原始 MD
Quick Read
LLM failed, fallback used
Label-Free Reinforcement Learning via Cross-Model Entropy
Label-Free Reinforcement Learning via Cross-Model Entropy
Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used
LoRe: Adaptive Interaction-Evaluation Routing with Per-Step Interaction Budgets for Iterative Graph Solvers
LoRe: Adaptive Interaction-Evaluation Routing with Per-Step Interaction Budgets for Iterative Graph Solvers
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
FormInv: A Measurement Protocol for Semantic Invariance in Mathematical Reasoning Benchmarks
FormInv: A Measurement Protocol for Semantic Invariance in Mathematical Reasoning Benchmarks
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Measuring Real-World Prompt Injection Attacks in LLM-based Resume Screening
Measuring Real-World Prompt Injection Attacks in LLM-based Resume Screening
Topic · 强化学习
仅有原始 MD
Quick Read
LLM failed, fallback used