sometimes we face gpu crush on cluster environment, especially when multiple job requested at a same time. we need to handle this problem.