About Me

Hi, There

一份简要的简历

LinkedIn | GitHub | dockers | Mail me

Ci Song / 宋辞

Well, it is now more obvious that

  • the most difficult part of RL is to design the components, somehow, the algorithm part is not that important as I used to think of

  • through the work on the rubik's cube applying RL, I found that Q-learning is absolutely the first option you should try when the situation you are dealing with is limited. Q-learning is easy, fast, and reasonable for explanation