Machine Learning

Authors and titles for recent submissions

[ total of 746 entries: 1-25 | 26-50 | 51-75 | 76-100 | ... | 726-746 ]
[ showing 25 entries per page: fewer | more | all ]

Fri, 4 Apr 2025 (showing first 25 of 125 entries)

[1] arXiv:2504.02827 [pdf, other]: Title: On Vanishing Variance in Transformer Length Generalization

Authors: Ruining Li, Gabrijel Boduljak, Jensen (Jinghao) Zhou

Comments: Project page: this https URL The first two authors contributed equally to this work

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[2] arXiv:2504.02797 [pdf, other]: Title: Spline-based Transformers

Authors: Prashanth Chandran, Agon Serifi, Markus Gross, Moritz Bächer

Journal-ref: European Conference on Computer Vision (ECCV 2024)

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[3] arXiv:2504.02781 [pdf, other]: Title: Towards Green AI-Native Networks: Evaluation of Neural Circuit Policy for Estimating Energy Consumption of Base Stations

Authors: Selim Ickin, Shruti Bothe, Aman Raparia, Nitin Khanna, Erik Sanders

Comments: 15 pages, 9 figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE); Signal Processing (eess.SP)
[4] arXiv:2504.02698 [pdf, other]: Title: SCMPPI: Supervised Contrastive Multimodal Framework for Predicting Protein-Protein Interactions

Authors: Shengrui XU, Tianchi Lu, Zikun Wang, Jixiu Zhai, Jingwan Wang

Comments: 19 pages,11 figures,conference

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Quantitative Methods (q-bio.QM)
[5] arXiv:2504.02692 [pdf, other]: Title: GPTQv2: Efficient Finetuning-Free Quantization for Asymmetric Calibration

Authors: Yuhang Li, Ruokai Yin, Donghyun Lee, Shiting Xiao, Priyadarshini Panda

Subjects: Machine Learning (cs.LG)
[6] arXiv:2504.02685 [pdf, other]: Title: STOOD-X methodology: using statistical nonparametric test for OOD Detection Large-Scale datasets enhanced with explainability

Authors: Iván Sevillano-García, Julián Luengo, Francisco Herrera

Comments: 18 pages, 7 Figures

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC); Machine Learning (stat.ML)
[7] arXiv:2504.02667 [pdf, other]: Title: Compositionality Unlocks Deep Interpretable Models

Authors: Thomas Dooms, Ward Gauderis, Geraint A. Wiggins, Jose Oramas

Subjects: Machine Learning (cs.LG)
[8] arXiv:2504.02666 [pdf, other]: Title: BECAME: BayEsian Continual Learning with Adaptive Model MErging

Authors: Mei Li, Yuxiang Lu, Qinyan Dai, Suizhi Huang, Yue Ding, Hongtao Lu

Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
[9] arXiv:2504.02662 [pdf, ps, other]: Title: Integrating Human Knowledge Through Action Masking in Reinforcement Learning for Operations Research

Authors: Mirko Stappert, Bernhard Lutz, Niklas Goby, Dirk Neumann

Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[10] arXiv:2504.02658 [pdf, other]: Title: MiLo: Efficient Quantized MoE Inference with Mixture of Low-Rank Compensators

Authors: Beichen Huang, Yueming Yuan, Zelei Shao, Minjia Zhang

Subjects: Machine Learning (cs.LG)
[11] arXiv:2504.02646 [pdf, other]: Title: Prompt Optimization with Logged Bandit Data

Authors: Haruka Kiyohara, Daniel Yiming Cao, Yuta Saito, Thorsten Joachims

Comments: Preprint

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (stat.ML)
[12] arXiv:2504.02644 [pdf, ps, other]: Title: Solving the Paint Shop Problem with Flexible Management of Multi-Lane Buffers Using Reinforcement Learning and Action Masking

Authors: Mirko Stappert, Bernhard Lutz, Janis Brammer, Dirk Neumann

Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
[13] arXiv:2504.02639 [pdf, other]: Title: Reservoir Computing: A New Paradigm for Neural Networks

Authors: Felix Grezes

Subjects: Machine Learning (cs.LG)
[14] arXiv:2504.02630 [pdf, other]: Title: Grammar-based Ordinary Differential Equation Discovery

Authors: Karin L. Yu, Eleni Chatzi, Georgios Kissas

Subjects: Machine Learning (cs.LG); Computational Engineering, Finance, and Science (cs.CE); Symbolic Computation (cs.SC)
[15] arXiv:2504.02620 [pdf, other]: Title: Efficient Model Editing with Task-Localized Sparse Fine-tuning

Authors: Leonardo Iurada, Marco Ciccone, Tatiana Tommasi

Comments: Accepted ICLR 2025 - this https URL

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[16] arXiv:2504.02618 [pdf, other]: Title: Variational Online Mirror Descent for Robust Learning in Schrödinger Bridge

Authors: Dong-Sig Han, Jaein Kim, Hee Bin Yoo, Byoung-Tak Zhang

Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
[17] arXiv:2504.02607 [pdf, other]: Title: Learning Geometrically-Informed Lyapunov Functions with Deep Diffeomorphic RBF Networks

Authors: Samuel Tesfazgi, Leonhard Sprandl, Sandra Hirche

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Systems and Control (eess.SY)
[18] arXiv:2504.02606 [pdf, other]: Title: Improving Counterfactual Truthfulness for Molecular Property Prediction through Uncertainty Quantification

Authors: Jonas Teufel, Annika Leinweber, Pascal Friederich

Comments: 24 pages, 5 figures, 4 tabels, accepted at the 3rd xAI World Conference

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[19] arXiv:2504.02591 [pdf, other]: Title: State-Space Model Inspired Multiple-Input Multiple-Output Spiking Neurons

Authors: Sanja Karilanova, Subhrakanti Dey, Ayça Özçelikkale

Comments: 9 pages, 3 figures, 6 tables, conference - 2025 Neuro Inspired Computational Elements (NICE)

Journal-ref: 2025 Neuro Inspired Computational Elements (NICE)

Subjects: Machine Learning (cs.LG)
[20] arXiv:2504.02589 [pdf, other]: Title: Knowledge Graph Completion with Mixed Geometry Tensor Factorization

Authors: Viacheslav Yusupov, Maxim Rakhuba, Evgeny Frolov

Comments: Accepted to AISTATS 2025

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (stat.ML)
[21] arXiv:2504.02587 [pdf, other]: Title: Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Authors: Yan Ma, Steffi Chern, Xuyang Shen, Yiran Zhong, Pengfei Liu

Comments: Code is public and available at: this https URL

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
[22] arXiv:2504.02546 [pdf, other]: Title: GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning

Authors: Xiangxiang Chu, Hailang Huang, Xiao Zhang, Fei Wei, Yong Wang

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[23] arXiv:2504.02544 [pdf, other]: Title: Fourier Sliced-Wasserstein Embedding for Multisets and Measures

Authors: Tal Amir, Nadav Dym

Comments: ICLR 2025 camera-ready. arXiv admin note: substantial text overlap with arXiv:2405.16519

Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
[24] arXiv:2504.02543 [pdf, other]: Title: Probabilistic Pontryagin's Maximum Principle for Continuous-Time Model-Based Reinforcement Learning

Authors: David Leeftink, Çağatay Yıldız, Steffen Ridderbusch, Max Hinne, Marcel van Gerven

Comments: 7 pages, 2 figures, 2 tables

Subjects: Machine Learning (cs.LG)
[25] arXiv:2504.02507 [pdf, other]: Title: ZClip: Adaptive Spike Mitigation for LLM Pre-Training

Authors: Abhay Kumar, Louis Owen, Nilabhra Roy Chowdhury, Fabian Güra

Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL)

[ total of 746 entries: 1-25 | 26-50 | 51-75 | 76-100 | ... | 726-746 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, new, 2504, contact, help (Access key information)

> cs > cs.LG

Machine Learning

Authors and titles for recent submissions

Fri, 4 Apr 2025 (showing first 25 of 125 entries)