In this paper, we propose a novel method of managing a semidynamic cluster through the use of a reinforcement learning. We derive some concepts from reinforcement learning that could be suitable for cooperative networks. We also verify the performance of proposed algorithm by means of a simulation, in which we examined spectral efficiency and convergence properties. The proposed algorithm represents a considerable improvement for edge users in particular. In addition, we analyze the complexity of the clustering schemes. Our proposed algorithm is effective in the environment where there is a limited computational resource.