Reverse Auctions with Multiple Reinforcement Learning Agents * | Zendy

Bandyopadhyay Subhajyoti | Zendy; Rees Jackie | Zendy; Barron John M. | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Reverse Auctions with Multiple Reinforcement Learning Agents *

Author(s) -

Bandyopadhyay Subhajyoti,

Rees Jackie,

Barron John M.

Publication year - 2008

Publication title -

decision sciences

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.238

H-Index - 108

eISSN - 1540-5915

pISSN - 0011-7315

DOI - 10.1111/j.1540-5915.2008.00181.x

Subject(s) - bidding , common value auction , microeconomics , competition (biology) , economics , reverse auction , exploit , computer science , game theory , nash equilibrium , ecology , computer security , biology

Reverse auctions in business‐to‐business (B2B) exchanges provide numerous benefits to participants. Arguably the most notable benefit is that of lowered prices driven by increased competition in such auctions. The competition between sellers in reverse auctions has been analyzed using a game‐theoretic framework and equilibria have been established for several scenarios. One finding of note is that, in a setting in which sellers can meet total demand with the highest‐bidding seller being able to sell only a fraction of the total capacity, the sellers resort to a mixed‐strategy equilibrium. Although price randomization in industrial bidding is an accepted norm, one might argue that in reality managers do not utilize advanced game theory calculations in placing bids. More likely, managers adopt simple learning strategies. In this situation, it remains an open question as to whether the bid prices converge to the theoretical equilibrium over time. To address this question, we model reverse‐auction bidding behavior by artificial agents as both two‐player and n ‐player games in a simulation environment. The agents begin the game with a minimal understanding of the environment but over time analyze wins and losses for use in determining future bids. To test for convergence, the agents explore the price space and exploit prices where profits are higher, given varying cost and capacity scenarios. In the two‐player case, the agents do indeed converge toward the theoretical equilibrium. The n ‐player case provides results that reinforce our understanding of the theoretical equilibria. These results are promising enough to further consider the use of artificial learning mechanisms in reverse auctions and other electronic market transactions, especially as more sophisticated mechanisms are developed to tackle real‐life complexities. We also develop the analytical results when one agent does not behave strategically while the other agent does and show that our simulations for this environment also result in convergence toward the theoretical equilibrium. Because the nature of the best response in the new setting is very different (pure strategy as opposed to mixed), it indicates the robustness of the devised algorithm. The use of artificial agents can also overcome the limitations in rationality demonstrated by human managers. The results thus have interesting implications for designing artificial agents in automating bid responses for large numbers of bids where human intervention might not always be possible.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research