


default search action
28th EDBT 2025: Barcelona, Spain
- Alkis Simitsis, Bettina Kemme, Anna Queralt, Oscar Romero, Petar Jovanovic:
Proceedings 28th International Conference on Extending Database Technology, EDBT 2025, Barcelona, Spain, March 25-28, 2025. OpenProceedings.org 2024
Volume 1
Research Track
- Fernando de Meer Pardo, Claude Lehmann, Dennis Gehrig, Andrea Nagy, Stefano Nicoli, Branka Hadji Misheva, Martin Braschler, Kurt Stockinger
:
GraLMatch: Matching Groups of Entities with Graphs and Language Models. 1-12 - Trung-Hoang Le
, Hady W. Lauw:
Selecting Comparative Sets of Reviews Across Multiple Items. 13-24 - Panagiotis Bouros, Theodoros Chondrogiannis, Daniel Kowalski:
Fast Geosocial Reachability Queries. 25-38 - Ioannis Xarchakos, Nick Koudas:
Coping With Data Drift in Online Video Analytics. 39-52 - Qihao Cheng
, Da Yan, Tianhao Wu, Lyuheng Yuan, Ji Cheng, Zhongyi Huang, Yang Zhou:
Efficient Enumeration of Large Maximal k-Plexes. 53-65 - Daren Chao, Nick Koudas, Xiaohui Yu, Yueting Chen:
Ensembling Object Detectors for Effective Video Query Processing. 66-79 - Yingjun Dai, Ahmed El-Roby, Elmira Adeeb, Vivek Thaker:
OmniMatch: Overcoming the Cold-Start Problem in Cross-Domain Recommendations using Auxiliary Reviews. 80-91 - Gyanendra Shrestha, Chutian Jiang, Sai Akula, Vivek Yannam, Anna Pyayt, Michael N. Gubanov:
Tabular Embeddings for Tables with Bi-Dimensional Hierarchical Metadata and Nesting. 92-105 - Angela Bonifati, Stefania Dumbrava, Haridimos Kondylakis, Georgia Troullinou, Giannis Vassiliou:
Progressive Querying on Knowledge Graphs. 106-118 - Giorgos Alexiou, George Papastefanatos, Vassilis Stamatopoulos, Georgia Koutrika, Nectarios Koziris:
QueryER: A Framework for Fast Analysis-Aware Deduplication over Dirty Data. 119-131 - Ala Eddine Laouir, Abdessamad Imine:
Private Approximate Query over Horizontal Data Federation. 132-144 - Adeel Aslam, Kaustubh Beedkar, Giovanni Simonini:
SPO-Join: Efficient Stream Inequality Join. 145-157
Experiments & Analyses Track
- Jonathan Fürst, Catherine Kosten, Farhad Nooralahzadeh, Yi Zhang, Kurt Stockinger
:
Evaluating the Data Model Robustness of Text-to-SQL Systems Based on Real User Queries. 158-170 - Nikolai Merkel, Daniel Stoll, Ruben Mayer, Hans-Arno Jacobsen:
An Experimental Comparison of Partitioning Strategies for Distributed Graph Neural Network Training. 171-184 - Sana Ebrahimi, Rishi Advani, Abolfazl Asudeh:
Evaluating the Feasibility of Sampling-Based Techniques for Training Multilayer Perceptrons. 185-198 - Anna Mitsopoulou, Georgia Koutrika:
Analysis of Text-to-SQL Benchmarks: Limitations, Challenges and Opportunities. 199-212
Volume 2
Research Track
- Sina Shaham, Gabriel Ghinita, Bhaskar Krishnamachari, Cyrus Shahabi:
Differentially Private Publication of Smart Electricity Grid Data. 213-225 - Naiqing Guan, Kaiwen Chen, Nick Koudas:
DataSculpt: Cost-Efficient Label Function Design via Prompting Large Language Models. 226-232 - Hyunjin Choo, Minho Eom, Gyuri Kim, Young-Gyu Yoon, Kijung Shin:
RASP: Robust Mining of Frequent Temporal Sequential Patterns under Temporal Variations. 233-245 - Goetz Graefe, Marius Kuhrt, Bernhard Seeger:
Modifying an existing sort order with offset-value codes. 246-254 - Arnab Phani, Matthias Boehm:
MEMPHIS: Holistic Lineage-based Reuse and Memory Management for Multi-backend ML Systems. 255-269 - Tavor Lipman, Tova Milo, Amit Somech, Tomer Wolfson, Oz Zafar:
LINX: A Language Driven Generative System for Goal-Oriented Automated Data Exploration. 270-283 - Jacco Johannes Egbert Kiezebrink, Wieger R. Punter, Odysseas Papapetrou, Kevin Verbeek:
Synopses for Summarizing Spatial Data Streams. 284-296 - Jan-Eric Hellenberg, Fabian Mahling, Lukas Laskowski, Felix Naumann, Matteo Paganelli, Fabian Panse:
PRISMA: A Privacy-Preserving Schema Matcher using Functional Dependencies. 297-309 - Panos Vassiliadis, Alexandros Karakasidis:
Time-Related Patterns Of Schema Evolution. 310-323 - Tao Li, Feng Liang
, Jinqi Quan, Huang Chuang, Teng Wang, Runhuai Huang, Jie Wu, Xiping Hu:
Taste: Towards Practical Deep Learning-based Approaches for Semantic Type Detection in the Cloud. 324-336
Experiments & Analyses Track
- Angelo Mozzillo, Luca Zecchini, Luca Gagliardelli, Adeel Aslam, Sonia Bergamaschi, Giovanni Simonini:
Evaluation of Dataframe Libraries for Data Preparation on a Single Machine. 337-349 - Felix Neutatz, Marius Lindauer, Ziawasch Abedjan:
How Green is AutoML for Tabular Data? 350-363
Research Track
- Fatemeh Ahmadi
, Marc Speckmann, Malte F. Kuhlmann, Ziawasch Abedjan:
MaTElDa: Multi-Table Error Detection. 364-376 - Christina Christodoulakis, Moshe Gabel, Angela Demke Brown:
Metadata Unification in Open Data with Gnomon. 377-383 - Akshay A. Bapat, Saravanan Thirumuruganathan, Nick Koudas:
Pythia: A Neural Model for Data Prefetching. 384-396 - Martin Pekár Christensen, Aristotelis Leventidis, Matteo Lissandrini, Laura Di Rocco, Renée J. Miller, Katja Hose
:
Fantastic Tables and Where to Find Them: Table Search in Semantic Data Lakes. 397-410 - Michail Theologitis, Georgios Frangias, Georgios Anestis, Vasilis Samoladas, Antonios Deligiannakis:
Communication-Efficient Distributed Deep Learning via Federated Dynamic Averaging. 411-424
Experiments & Analyses Track
- Ran Wei, Zichen Zhu, Andrew Kryczka, Jay Zhuang, Manos Athanassoulis:
Benchmarking, Analyzing, and Optimizing WA of Partial Compaction in RocksDB. 425-437
Research Track
- Reza Salkhordeh, Felix Martin Schuhknecht, Hossein Asadi, Steffen Eiden, André Brinkmann:
No Time to Halt: In-Situ Analysis for Large-Scale Data Processing via Virtual Snapshotting. 438-450 - Aneesh Raman, Konstantinos Karatsenidis, Shaolin Xie, Matthaios Olma, Subhadeep Sarkar, Manos Athanassoulis:
QuIT your B+-tree for the Quick Insertion Tree. 451-463 - Nikolaos Koutroumanis, Christos Doulkeridis, Akrivi Vlachou:
Parallel Spatial Join Processing with Adaptive Replication. 464-476 - Henning Koehler, Muhammad Farhan, Qing Wang:
Stable Tree Labelling for Accelerating Distance Queries on Dynamic Road Networks. 477-489 - André L. C. Mendonça, Felipe T. Brito, Javam C. Machado:
PEG: Local Differential Privacy for Edge-Labeled Graphs. 490-502 - Andrea Colombo, Teodoro Baldazzi, Luigi Bellomarini, Emanuel Sallinger, Stefano Ceri:
Template-based Explainable Inference over High-Stakes Financial Knowledge Graphs. 503-515
Experiments & Analyses Track
- Adrian Lutsch
, Muhammad El-Hindi, Matthias Heinrich, Daniel Ritter, Zsolt István, Carsten Binnig:
Benchmarking Analytical Query Processing in Intel SGXv2. 516-528 - Ralph Peeters
, Aaron Steiner, Christian Bizer
:
Entity Matching using Large Language Models. 529-541
Volume 3
Research Track
- Sedir Mohammed, Felix Naumann, Hazar Harmouch:
Step-by-Step Data Cleaning Recommendations to Improve ML Prediction Accuracy. 542-554 - Mahesh Dananjaya, Vasilis Gavrielatos, Antonios Katsarakis, Nikos Ntarmos, Vijay Nagarajan:
Fast, Highly Available, and Recoverable Transactions on Disaggregated Data Stores. 555-568 - Stefano Calzavara, Lorenzo Cazzaro, Donald Gera, Salvatore Orlando:
Watermarking Decision Tree Ensembles. 569-575 - Xinglin Du, Peng Tang, Rui Chen, Ning Wang, Chengyu Hu, Shanqing Guo:
Query Rewriting-Based View Generation for Efficient Multi-Relation Multi-Query with Differential Privacy. 576-588 - Wang Yue, Martin Boissier, Manisha Luthra, Tilmann Rabl:
Dema: Efficient Decentralized Aggregation for Non-Decomposable Quantile Functions. 589-595 - Landy Andriamampianina, Franck Ravat, Jiefu Song, Nathalie Vallès-Parlangeau, Yanpei Wang:
Selective Evolving Centrality in Temporal Heterogeneous Graphs. 596-608 - Eugenie Y. Lai, Yuze Lou, Brit Youngmann, Michael J. Cafarella:
Toward Standardized Data Preparation: A Bottom-Up Approach. 609-622 - Tanmay Surve, Romila Pradhan:
Explaining Fairness Violations using Machine Unlearning. 623-635 - Minglang Xie, Jianye Yang, Wenjie Zhang, Shiyu Yang, Xuemin Lin:
Deep Skyline Community Search. 636-648 - Enas Khwaileh, Yannis Velegrakis:
Dataset Discovery using Semantic Matching. 649-660 - Josef Schmeißer, Clemens Lutz, Volker Markl:
Efficiently Indexing Large Data on GPUs with Fast Interconnects. 661-667 - Kasun Amarasinghe, Farhana Choudhury, Jianzhong Qi, James Bailey:
Learned Indexes with Distribution Smoothing via Virtual Points. 668-680 - Parisa Esmaeilian Ghahroudi, Sean Chester, Alex Thomo:
Efficient Multicore Discovery of Small, High-Quality k-Plex Teams in Multi-attributed Networks. 681-693 - Camilla Birch Okkels, Martin Aumüller, Viktor Bello Thomsen, Arthur Zimek:
High-dimensional density-based clustering using locality-sensitive hashing. 694-706 - Tianshu Wang, Xiaoyang Chen, Hongyu Lin, Xianpei Han, Le Sun, Hao Wang, Zhenyu Zeng:
DBCopilot: Natural Language Querying over Massive Databases via Schema Routing. 707-721 - Yu Liu, Qi Luo, Yanwei Zheng, Wenjie Zhang, Xuemin Lin, Dongxiao Yu:
Effective and Efficient Community Search over Large-Scale Hypergraphs. 722-734 - Hafiz Tayyab Rauf, Alex Teodor Bogatu, Norman W. Paton, André Freitas:
Gem: Gaussian Mixture Model Embeddings for Numerical Feature Distributions. 735-747 - Hoa Thi Le, Angela Bonifati, Andrea Mauri:
Graph Consistency Rule Mining with LLMs: an Exploratory Study. 748-754 - Bole Chang, Linxin Xie, Wei Li, Meng Qin, Jianfeng Hou:
Z-Shadow: An Efficient Method for Estimating Bicliques in Massive Graphs Using Füredi's Theorem. 755-768 - Christian Knödler, Naeem Ramzan, Ilia Petrov:
hybridNDP: Dynamic Operation Offloading and Cooperative Query Execution in Smart Storage Settings. 769-782 - Renzo Angles, Angela Bonifati, Roberto García, Domagoj Vrgoc:
Path-based Algebraic Foundations of Graph Query Languages. 783-795 - Christoph Schinninger, Fabian Panse, Constantin Kühne, Lisa Ehrlinger:
Icewafl: A Configurable Data Stream Polluter. 796-802 - Mengying Wang, Hanchao Ma, Yiyang Bian, Yangxin Fan, Yinghui Wu:
Generating Skyline Datasets for Data Science Models. 803-815 - Loredana Caruccio, Stefano Cirillo, Giuseppe Polese, Roberto Stanzione:
An RFD-based approach for concept drift detection in Machine Learning Systems. 816-828 - Otmar Ertl:
ExaLogLog: Space-Efficient and Practical Approximate Distinct Counting up to the Exa-Scale. 829-841 - Nripsuta Ani Saxena, Ronit Mathur, Cyrus Shahabi:
Legally-Compliant Spatial Fairness Framework: Advancing Beyond Spatial Fairness. 842-854 - Nodirbek Korchiev, Akash Pateria, Vodelina Samatova, Sogolsadat Mansouri, Kemafor Anyanwu:
Taming the Beast of User-Programmed Transactions on Blockchains: A Declarative Transaction Approach. 855-866 - Mohamed Maher, Osama Fayez Oun, Mahmoud Saeed Mesmeh, Radwa El Shawi:
FedForecaster: An Automated Federated Learning Approach for Time-series Forecasting. 867-873 - Sijie Dong, Soror Sahri, Themis Palpanas, Qitong Wang:
Automated Data Quality Validation in an End-to-End GNN Framework. 874-880
Experiments & Analyses Track
- Peichen Xie, Zhigao Zheng, Yongluan Zhou, Yang Xiu, Hao Liu, Zhixiang Yang, Yu Zhang, Bo Du:
GPU Architectures in Graph Analytics: A Comparative Experimental Study. 881-893 - Ling Zhang, Shaleen Deep, Joyce Cahoon, Jignesh M. Patel, Anja Gruenheid:
From Feature Selection to Resource Prediction: An Analysis of Commonly Applied Workflows and Techniques. 894-908 - Ananya Rahaman, Anny Zheng, Mostafa Milani, Fei Chiang, Rachel Pottinger:
Evaluating SQL Understanding in Large Language Models. 909-921 - Zeyu Zhang, Paul Groth, Iacer Calixto, Sebastian Schelter:
A Deep Dive Into Cross-Dataset Entity Matching with Large and Small Language Models. 922-934 - Thomas Bodner, Theo Radig, David Justen, Daniel Ritter, Tilmann Rabl:
An Empirical Evaluation of Serverless Cloud Infrastructure for Large-Scale Data Processing. 935-948 - Mark Dodds, Khuzaima Daudjee:
Apache Ignite + Calcite Composable Database System: Experimental Evaluation and Analysis. 949-961
Vision Track
- Sihem Amer-Yahia, Jasmina Bogojeska, Roberta Facchinetti, Valeria Franceschi, Aristides Gionis, Katja Hose, Georgia Koutrika, Roger Kouyos, Matteo Lissandrini, Silviu Maniu, Katsiaryna Mirylenka, Davide Mottin, Themis Palpanas, Mattia Rigotti, Yannis Velegrakis:
Towards Reliable Conversational Data Analytics. 962-969 - Mouna Ammar, Christopher Rost, Riccardo Tommasini, Shubhangi Agarwal, Angela Bonifati, Petra Selmer, Evgeny Kharlamov, Erhard Rahm:
Towards Hybrid Graphs: Unifying Property Graphs and Time Series. 970-977 - Sepehr Sadoughi, Nikolay Yakovets, George Fletcher:
Breaking Down the Data-metadata Barrier for Effective Property Graph Data Management. 978-984 - Koyena Pal, David Bau, Renée J. Miller:
Model Lakes. 985-995
Industrial & Applications Track
- Boge Liu, Chunling Wang, Xiaoshuang Chen, Yu Hao, Zhengyi Yang, Yi Jin, Yixing Yang, Wenke Yang, Wanchuan Zhang, Wenjie Zhang:
PhoebeDB: A Disk-Based RDBMS Kernel for High-Performance and Cost-Effective OLTP. 996-1004 - Andreas Kouvaras, Periklis Mantenoglou, Alexander Artikis:
Generating Activity Definitions with Large Language Models. 1005-1013 - Gerald White, Deep Mistry, Kevin Chhoa, Senjuti Basu Roy, Lingyi Zhang, Adam Bienkowski, Krishna R. Pattipati:
A Computational Framework for Estimating Days of Maintenance Delay of Naval Ships. 1014-1022 - Zhijia Chen, Weiyi Meng, Eduard C. Dragut:
ComCrawler: General Crawling Solution for Aticle Comments. 1023-1031 - Rakesh Menon, Kun Qian, Liqun Chen, Ishika Joshi, Daniel Pandyan, Shashank Srivastava, Yunyao Li:
FISQL: Enhancing Text-to-SQL Systems with Rich Interactive Feedback. 1032-1038 - Chanuk Lim, Kyong-Ha Lee, Hyun Ji Jeong, Sungsu Lim:
GRAIL: Graph Retrieval-Augmented In-Context Learning for Node Classification in Real-World Textual-Attributed Graphs. 1039-1047 - Liat Antwarg Friedman, Gal Lavee, Bracha Shapira, Dorin Shmaryahu:
Data Completion In E-commerce. 1048-1056 - Ilaria Bordino, Francesco Di Iorio, Andrea Galliani, Alessio Rosatelli, Lorenzo Severini:
UniAsk: AI-powered search for banking knowledge bases. 1057-1065
Demonstration Track
- Mihail Stoian, Alexander van Renen, Jan Kobiolka, Ping-Lin Kuo, Andreas Zimmerer, Josif Grabocka, Andreas Kipf:
Virtual: Compressing Data Lake Files. 1066-1069 - Georgios Grigoropoulos, Alexandros Troupiotis-Kapeliaris, Ilias Chamatidis, Evangelia Filippou, Konstantina Bereta:
Transforming Maritime Safety: Data-driven Applications for the Real-Time Detection and Mitigation of Maritime Incidents. 1070-1073 - Panagiotis Gidarakos, Nikolaos Theologitis, Stavros Maroulis, Loukas Kavouras, Giorgos Giannopoulos, George Papastefanatos:
GLOVES: Global Counterfactual-based Visual Explanations. 1074-1077 - Chiara Forresi, Matteo Francia, Enrico Gallinucci, Matteo Golfarelli:
ASSO: the Automated Schemaless Stream Overseer. 1078-1081 - Joel Rorseth, Parke Godfrey, Lukasz Golab, Divesh Srivastava, Jarek Szlichta:
LADYBUG: an LLM Agent DeBUGger for data-driven applications. 1082-1085 - Evgeny S. Skvortsov, Shayan Mirjafari, Ojaswa Garg, Yilin Xia, Shawn Bowers, Bertram Ludäscher:
LogicLM: Robust Application of Large Language Models with Logic Programming for Data Analytics. 1086-1089 - Mohamed Abdelaal, Samuel Lokadjaja, Arne Kreuz, Harald Schöning:
DataLens: ML-Oriented Interactive Tabular Data Quality Dashboard. 1090-1093 - Haralampos Gavriilidis, Lennart Behme, Christian Munz, Varun Pandey, Volker Markl:
CompoDB: A Demonstration of Modular Data Systems in Practice. 1094-1097 - Anastasiia Avksientieva, Parke Godfrey, Lukasz Golab, Divesh Srivastava, Jarek Szlichta:
REACT: REcourse Analysis with Counterfactuals and Explanation Tables. 1098-1101 - Zesong Zhang, Jianzhong Qi, Xin Cao, Christian S. Jensen:
SemaSK: Answering Semantics-aware Spatial Keyword Queries with Large Language Models. 1102-1105 - Fajrian Yunus, Pratik Karmakar, Pierre Senellart, Talel Abdessalem, Stéphane Bressan:
Using A Probabilistic Database in an Image Retrieval Application. 1106-1109 - Marc Maynou, Sergi Nadal:
Supporting Data Discovery Tasks at Scale with FREYJA. 1110-1113 - Francesco Invernici, Anna Bernasconi, Francesca Curati, Jelena Jakimov, Amirhossein Samavi:
TETYS: Configurable Topic Modeling Exploration for Big Corpora of Text Documents. 1114-1117 - Wenbo Sun, Ziyu Li, Vaishnav Srinidhi, Rihan Hai:
Database is All You Need: Serving LLMs with Relational Queries. 1118-1121 - Justus Henneberg, Felix Schuhknecht:
Do Research, not Data Visualization! How to Create More Consistent Plots for Experimental Research Papers in Less Time. 1122-1125 - Sven Rasmusen, Konstantina Pityanou, Dimitra Papatsaroucha, Sofiane Lagraa, Moussa Ouedraogo, Evangelos K. Markakis:
Secure and Transparent Data Sharing with TrustShare: A GDPR-Compliant Platform. 1126-1129 - Ariane Ziehn, Lily Seidl, Samira Akili, Steffen Zeuch, Volker Markl:
Enabling Complex Event Processing in NebulaStream. 1130-1133 - Antonios Kontaxakis, Dimitris Sacharidis, Alkis Simitsis, Alberto Abelló, Sergi Nadal:
Hyppo: Efficient Discovery and Execution of Data Science Pipelines in Collaborative Environments. 1134-1137 - Pasquale Leonardo Lazzaro, Marialaura Lazzaro, Paolo Missier, Riccardo Torlone:
PROLIT: Supporting the Transparency of Data Preparation Pipelines through Narratives over Data Provenance. 1138-1141 - Moein Shirdel, Joel Rorseth, Parke Godfrey, Lukasz Golab, Divesh Srivastava, Jarek Szlichta:
AprèsCoT: Explaining LLM Answers with Knowledge Graphs and Chain of Thought. 1142-1145 - Thomas Bodner, Tilmann Rabl:
An Interactive Analysis of Serverless Cloud Infrastructure. 1146-1149 - Jáchym Bártík, Alzbeta Srutková, Irena Holubová:
TransforMMer: A Universal Multi-Model Data Generator. 1150-1153 - Andrea Baraldi, Matteo Brucato, Miroslav Dudík, Francesco Guerra, Matteo Interlandi:
FairnessEval: a Framework for Evaluating Fairness of Machine Learning Models. 1154-1157 - Charlotte Felius, Peter Boncz:
VCrypt: Leveraging Vectorized and Compressed Execution for Client-side Encryption. 1158-1161
Tutorial Track
- Vincent T'kindt, Patrick Marcel:
Can Operations Research bring you to the next level? Basics and application. 1162-1165 - Da Yan, Lyuheng Yuan, Akhlaque Ahmad, Saugat Adhikari:
Systems for Scalable Graph Analytics and Machine Learning: Trends and Methods. 1166-1169 - Mohamed-Amine Baazizi, Dario Colazzo, Giorgio Ghelli, Carlo Sartiani, Stefanie Scherzinger:
Everything You Always Wanted to Know About JSON Schema (But Were Afraid to Ask). 1170-1173 - Chuangtao Ma, Yongrui Chen, Tianxing Wu, Arijit Khan, Haofen Wang:
Unifying Large Language Models and Knowledge Graphs for Question Answering: Recent Advances and Opportunities. 1174-1177

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.