Abstract
We study repeated bilateral trade where an adaptive σ-smooth adversary generates the valuations of sellers and buyers. We provide a complete characterization of the regret regimes for fixed-price mechanisms under different feedback models in the two cases where the learner can post either the same or different prices _to buyers and sellers. We begin by showing that the minimax regret after T rounds is of order √T in the full-feedback scenario. Under partial feedback, any algorithm that has to post the same price to buyers and sellers suffers worst-case linear regret. However, when the learner can post two different prices at each round, we design an algorithm enjoying regret of order T3/4 ignoring log factors. We prove that this rate is optimal by presenting a surprising T3/4 lower bound, which is the main technical contribution of the paper.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 36th Conference on Learning Theory |
| Publisher | MLResearchPress |
| Pages | 1095-1130 |
| Number of pages | 36 |
| Volume | 195 |
| Publication status | Published - 15 Jul 2023 |
| Event | 36th Annual Conference on Learning Theory, COLT 2023 - Bangalore, India Duration: 12 Jul 2023 → 15 Jul 2023 |
Publication series
| Name | Proceedings of Machine Learning Research |
|---|---|
| Volume | 195 |
| ISSN (Print) | 2640-3498 |
Conference
| Conference | 36th Annual Conference on Learning Theory, COLT 2023 |
|---|---|
| Country/Territory | India |
| City | Bangalore |
| Period | 12/07/23 → 15/07/23 |
Bibliographical note
Publisher Copyright:© 2023 N. Cesa-Bianchi, T. Cesari, R. Colomboni, F. Fusco & S. Leonardi.
Keywords
- online learning
- regret minimization
- smoothed analysis
- two-sided markets
Fingerprint
Dive into the research topics of 'Repeated Bilateral Trade Against a Smoothed Adversary'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver