Do Physics Foundation Models Learn Generalizable Physics? A Bias-Aware Benchmark Across Physical Regimes and Distribution Shifts
Do Physics Foundation Models Learn Generalizable Physics? A Bias-Aware Benchmark Across Physical Regimes and Distribution Shifts
Topic · 大模型底座
仅有原始 MD
Quick Read
LLM failed, fallback used
LoopFM: Learning frOm HistOrical RePresentations of Foundation Model for Recommendation
LoopFM: Learning frOm HistOrical RePresentations of Foundation Model for Recommendation
Topic · 大模型底座
仅有原始 MD
Quick Read
LLM failed, fallback used
Code-QA-Bench: Separating Code Reasoning from Documentation Memorization in Repository-Level QA
Code-QA-Bench: Separating Code Reasoning from Documentation Memorization in Repository-Level QA
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Causal Label Recovery in Payment Networks
Causal Label Recovery in Payment Networks
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Compute Allocation in Evolutionary Search: From Depth-Breadth to Multi-Armed Bandits
Compute Allocation in Evolutionary Search: From Depth-Breadth to Multi-Armed Bandits
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
KLAS: Using Similarity to Stitch Neural Networks for Improved Accuracy-Efficiency Tradeoffs
KLAS: Using Similarity to Stitch Neural Networks for Improved Accuracy-Efficiency Tradeoffs
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
DynSess: Dynamic Session-Level Evaluation and Optimization Framework for Role-Playing Agents
DynSess: Dynamic Session-Level Evaluation and Optimization Framework for Role-Playing Agents
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Extreme dynamic symmetry enables omnidirectional and multifunctional robots
Extreme dynamic symmetry enables omnidirectional and multifunctional robots
Topic · 具身智能
仅有原始 MD
Quick Read
LLM failed, fallback used
OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources
OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Wait! There's a Way Out: A Decision Mechanism for Forecasting Conversational Derailment
Wait! There's a Way Out: A Decision Mechanism for Forecasting Conversational Derailment
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
BlockBatch: Multi-Scale Consensus Decoding for Efficient Diffusion Language Model Inference
BlockBatch: Multi-Scale Consensus Decoding for Efficient Diffusion Language Model Inference
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Toward Ethical Facial Age Estimation: A Generalized Zero-Shot Benchmark Without Training on Children's Data
Toward Ethical Facial Age Estimation: A Generalized Zero-Shot Benchmark Without Training on Children's Data
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Relevance as a Vulnerability: How Web Retrieval Degrades Safety Alignment in LLM Agents
Relevance as a Vulnerability: How Web Retrieval Degrades Safety Alignment in LLM Agents
Topic · Agent
仅有原始 MD
Quick Read
LLM failed, fallback used
Stochastic Lifting for Generating Trajectories of Stochastic Physical Systems
Stochastic Lifting for Generating Trajectories of Stochastic Physical Systems
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Influence-Guided Symbolic Regression: Scientific Discovery via LLM-Driven Equation Search with Granular Feedback
Influence-Guided Symbolic Regression: Scientific Discovery via LLM-Driven Equation Search with Granular Feedback
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
TIMEGATE: Sustainable Time-Boxed Promotion Gates for Continual ML Adaptation Under Resource Constraints
TIMEGATE: Sustainable Time-Boxed Promotion Gates for Continual ML Adaptation Under Resource Constraints
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Sustainable Metal-Organic Framework Water Harvesters in the Artificial Intelligence Era
Sustainable Metal-Organic Framework Water Harvesters in the Artificial Intelligence Era
Topic · 机器学习框架
仅有原始 MD
Quick Read
LLM failed, fallback used
UA-Legal-Bench: A Benchmark for Evaluating Large Language Models on Ukrainian Legal Reasoning
UA-Legal-Bench: A Benchmark for Evaluating Large Language Models on Ukrainian Legal Reasoning
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Domain-Informed Representation for Evolutionary Sieving in Integral and Module Lattices
Domain-Informed Representation for Evolutionary Sieving in Integral and Module Lattices
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used
Evolutionary Refinement of Generative Graph Topologies: A Hybrid WGAN-GA Approach
Evolutionary Refinement of Generative Graph Topologies: A Hybrid WGAN-GA Approach
Topic · 其他
仅有原始 MD
Quick Read
LLM failed, fallback used