7
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Learning Best Response Strategies for Agents in Ad Exchanges

      Preprint
      ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Ad exchanges are widely used in platforms for online display advertising. Autonomous agents operating in these exchanges must learn policies for interacting profitably with a diverse, continually changing, but unknown market. We consider this problem from the perspective of a publisher, strategically interacting with an advertiser through a posted price mechanism. The learning problem for this agent is made difficult by the fact that information is censored, i.e., the publisher knows if an impression is sold but no other quantitative information. We address this problem using the Harsanyi-Bellman Ad Hoc Coordination (HBA) algorithm, which conceptualises this interaction in terms of a Stochastic Bayesian Game and arrives at optimal actions by best responding with respect to probabilistic beliefs maintained over a candidate set of opponent behaviour profiles. We adapt and apply HBA to the censored information setting of ad exchanges. Also, addressing the case of stochastic opponents, we devise a strategy based on a Kaplan-Meier estimator for opponent modelling. We evaluate the proposed method using simulations wherein we show that HBA-KM achieves substantially better competitive ratio and lower variance of return than baselines, including a Q-learning agent and a UCB-based online learning agent, and comparable to the offline optimal algorithm.

          Related collections

          Most cited references8

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          The sample complexity of revenue maximization

            Bookmark
            • Record: found
            • Abstract: not found
            • Book Chapter: not found

            Ad Exchanges: Research Issues

              Bookmark
              • Record: found
              • Abstract: not found
              • Conference Proceedings: not found

              Regret Minimization for Reserve Prices in Second-Price Auctions

                Bookmark

                Author and article information

                Journal
                10 February 2019
                Article
                10.1007/978-3-030-14174-5_6
                1902.03588
                6791cc1e-0c9f-485a-9260-9b8ff0fbe871

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                EUMAS 2018, LNAI 11450, pp. 1-17, 2019
                cs.GT cs.AI

                Theoretical computer science,Artificial intelligence
                Theoretical computer science, Artificial intelligence

                Comments

                Comment on this article