Tai C.-SHong J.-HHong D.-YLI-CHEN FU2022-04-252022-04-25202223524677https://www.scopus.com/inward/record.uri?eid=2-s2.0-85120986195&doi=10.1016%2fj.segan.2021.100572&partnerID=40&md5=49c087e4f5520422372af027e42f7bfchttps://scholars.lib.ntu.edu.tw/handle/123456789/607420With the increase in global energy consumption, the demand-side management (DSM) system has grown into an important research topic because of its ability to reduce the total electricity cost and peak-to-average ratio (PAR) by rescheduling loads. Besides, the large amount of sensor data in the home area network (HAN) helps to record the energy demand and achieve the DSM capacity, and the machine learning skills such as reinforcement learning can be applied to solve the DSM problem. However, determining a suitable energy management strategy is complicated because the user behaviors are uncertain. In this study, a real-time multi-agent DSM system based on HAN was proposed to find a suitable control policy for reducing the energy cost in a smart home. This system integrated the Deep Q-Network (DQN) agents that adaptively learn the preference of appliance usage to control different types of appliances and energy storage system. The simulation results show that the proposed DSM system reduced peak value, PAR value, and electricity cost by 28.9%, 20.9%, and 28.6% respectively. This system can also be applied to REDD dataset and achieved 74.9% cost reduction. ? 2021Deep Q NetworkDemand-side managementHome area networkReinforcement learningSmart gridUser behaviorAutomationBehavioral researchCost reductionDeep learningDemand side managementDigital storageElectric power transmission networksElectric utilitiesEnergy managementEnergy utilizationHome networksMulti agent systemsSmart power gridsDeep Q networkElectricity costsEnergy-consumptionGlobal energyPeak to average ratiosQ-learningReal- timeResearch topicsUser behaviorsUser's preferences[SDGs]SDG7A real-time demand-side management system considering user preference with adaptive deep Q learning in home area networkjournal article10.1016/j.segan.2021.1005722-s2.0-85120986195