TY - GEN
T1 - A production technique for a Q-table with an influence map for speeding up Q-learning
AU - Cho, Kyungeun
AU - Sung, Yunsick
AU - Um, Kyhyun
PY - 2007
Y1 - 2007
N2 - Q-learning is a reinforcement learning widely used for automatic learning in the game environment. Before applying Q-learning, the many states of environment that an agent may come in contact with is defined. The weak point of Q-learning is the time it takes to learn these states as states become larger. In this paper, the Q-learning mechanism using an influence map (QIM) is proposed to reduce the time needed for learning. By using an influence map and the learning result, a medium Q-value, which is not yet learnt, will be generated. Generally, when learning is finished, it is difficult to improve the performances. If QIM is used, however, the performance could be improved. Although the Q-table in QIM has been defined with small states, QIM obtains nearly the same learning result.
AB - Q-learning is a reinforcement learning widely used for automatic learning in the game environment. Before applying Q-learning, the many states of environment that an agent may come in contact with is defined. The weak point of Q-learning is the time it takes to learn these states as states become larger. In this paper, the Q-learning mechanism using an influence map (QIM) is proposed to reduce the time needed for learning. By using an influence map and the learning result, a medium Q-value, which is not yet learnt, will be generated. Generally, when learning is finished, it is difficult to improve the performances. If QIM is used, however, the performance could be improved. Although the Q-table in QIM has been defined with small states, QIM obtains nearly the same learning result.
UR - http://www.scopus.com/inward/record.url?scp=50249119081&partnerID=8YFLogxK
U2 - 10.1109/IPC.2007.88
DO - 10.1109/IPC.2007.88
M3 - Conference contribution
AN - SCOPUS:50249119081
SN - 0769530060
SN - 9780769530062
T3 - Proceedings The 2007 International Conference on Intelligent Pervasive Computing, IPC 2007
SP - 72
EP - 75
BT - Proceedings The 2007 International Conference on Intelligent Pervasive Computing, IPC 2007
T2 - 2007 International Conference on Intelligent Pervasive Computing, IPC 2007
Y2 - 11 October 2007 through 13 October 2007
ER -