We gratefully acknowledge support from
the Simons Foundation and member institutions.

Machine Learning

Authors and titles for recent submissions

[ total of 746 entries: 1-25 | 26-50 | 51-75 | 76-100 | ... | 726-746 ]
[ showing 25 entries per page: fewer | more | all ]

Fri, 4 Apr 2025 (showing first 25 of 125 entries)

[1]  arXiv:2504.02827 [pdf, other]
Title: On Vanishing Variance in Transformer Length Generalization
Authors: Ruining Li, Gabrijel Boduljak, Jensen (Jinghao) Zhou
Comments: Project page: this https URL The first two authors contributed equally to this work
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[2]  arXiv:2504.02797 [pdf, other]
Title: Spline-based Transformers
Journal-ref: European Conference on Computer Vision (ECCV 2024)
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[3]  arXiv:2504.02781 [pdf, other]
Title: Towards Green AI-Native Networks: Evaluation of Neural Circuit Policy for Estimating Energy Consumption of Base Stations
Comments: 15 pages, 9 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE); Signal Processing (eess.SP)
[4]  arXiv:2504.02698 [pdf, other]
Title: SCMPPI: Supervised Contrastive Multimodal Framework for Predicting Protein-Protein Interactions
Comments: 19 pages,11 figures,conference
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)
[5]  arXiv:2504.02692 [pdf, other]
Title: GPTQv2: Efficient Finetuning-Free Quantization for Asymmetric Calibration
Subjects: Machine Learning (cs.LG)
[6]  arXiv:2504.02685 [pdf, other]
Title: STOOD-X methodology: using statistical nonparametric test for OOD Detection Large-Scale datasets enhanced with explainability
Comments: 18 pages, 7 Figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (stat.ML)
[7]  arXiv:2504.02667 [pdf, other]
Title: Compositionality Unlocks Deep Interpretable Models
Subjects: Machine Learning (cs.LG)
[8]  arXiv:2504.02666 [pdf, other]
Title: BECAME: BayEsian Continual Learning with Adaptive Model MErging
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[9]  arXiv:2504.02662 [pdf, ps, other]
Title: Integrating Human Knowledge Through Action Masking in Reinforcement Learning for Operations Research
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[10]  arXiv:2504.02658 [pdf, other]
Title: MiLo: Efficient Quantized MoE Inference with Mixture of Low-Rank Compensators
Subjects: Machine Learning (cs.LG)
[11]  arXiv:2504.02646 [pdf, other]
Title: Prompt Optimization with Logged Bandit Data
Comments: Preprint
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (stat.ML)
[12]  arXiv:2504.02644 [pdf, ps, other]
Title: Solving the Paint Shop Problem with Flexible Management of Multi-Lane Buffers Using Reinforcement Learning and Action Masking
Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[13]  arXiv:2504.02639 [pdf, other]
Title: Reservoir Computing: A New Paradigm for Neural Networks
Authors: Felix Grezes
Subjects: Machine Learning (cs.LG)
[14]  arXiv:2504.02630 [pdf, other]
Title: Grammar-based Ordinary Differential Equation Discovery
Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE); Symbolic Computation (cs.SC)
[15]  arXiv:2504.02620 [pdf, other]
Title: Efficient Model Editing with Task-Localized Sparse Fine-tuning
Comments: Accepted ICLR 2025 - this https URL
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[16]  arXiv:2504.02618 [pdf, other]
Title: Variational Online Mirror Descent for Robust Learning in Schrödinger Bridge
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[17]  arXiv:2504.02607 [pdf, other]
Title: Learning Geometrically-Informed Lyapunov Functions with Deep Diffeomorphic RBF Networks
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Systems and Control (eess.SY)
[18]  arXiv:2504.02606 [pdf, other]
Title: Improving Counterfactual Truthfulness for Molecular Property Prediction through Uncertainty Quantification
Comments: 24 pages, 5 figures, 4 tabels, accepted at the 3rd xAI World Conference
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[19]  arXiv:2504.02591 [pdf, other]
Title: State-Space Model Inspired Multiple-Input Multiple-Output Spiking Neurons
Comments: 9 pages, 3 figures, 6 tables, conference - 2025 Neuro Inspired Computational Elements (NICE)
Journal-ref: 2025 Neuro Inspired Computational Elements (NICE)
Subjects: Machine Learning (cs.LG)
[20]  arXiv:2504.02589 [pdf, other]
Title: Knowledge Graph Completion with Mixed Geometry Tensor Factorization
Comments: Accepted to AISTATS 2025
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (stat.ML)
[21]  arXiv:2504.02587 [pdf, other]
Title: Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme
Comments: Code is public and available at: this https URL
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[22]  arXiv:2504.02546 [pdf, other]
Title: GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[23]  arXiv:2504.02544 [pdf, other]
Title: Fourier Sliced-Wasserstein Embedding for Multisets and Measures
Authors: Tal Amir, Nadav Dym
Comments: ICLR 2025 camera-ready. arXiv admin note: substantial text overlap with arXiv:2405.16519
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[24]  arXiv:2504.02543 [pdf, other]
Title: Probabilistic Pontryagin's Maximum Principle for Continuous-Time Model-Based Reinforcement Learning
Comments: 7 pages, 2 figures, 2 tables
Subjects: Machine Learning (cs.LG)
[25]  arXiv:2504.02507 [pdf, other]
Title: ZClip: Adaptive Spike Mitigation for LLM Pre-Training
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)
[ total of 746 entries: 1-25 | 26-50 | 51-75 | 76-100 | ... | 726-746 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2504, contact, help  (Access key information)