What is it?
In Probability and Reinforcement Learning, a Markov Chain represents dynamic stochastic processes which changes with time, which assumes the Markov Property. Markov Chains care about the states of an environment, instead of performing actions (this is the job for the Markov Decision Process.
How does it work?
Given a set of states , which the transition between states are non-deterministic and stochastic, the probability of transitioning from current state to is:
Given also : a vector of probabilities of occurrence of each state , where and; a matrix of transition probabilities (probabilities of change from a state to another) and ;
One can model a Markov Chain similar to the one in the image above.