distributed wide and deep with tf.contrib.learn api example stuck on k8s
I am new to distributed tensorflow. I tried to run distributed wide-and-deep example on one node k8s cluster, but the worker tasks all stuck at
Test in localhost and in docker are all OK.using ansible for provisioning docker containershow to run Docker in Travis hosted in travis-ci.comDocker, varnish, Connection reset by peerASP core HttpClient.Get From container to localhost site is failingHow to connect to Cassandra in DockerHow can I make Atifactory docker registry images use docker manifest version 2?
Here is my code. https://github.com/zhoudongyan/wide-and-deep
- docker version: 17.03.1-ce
- k8s version: v1.6.3
- tensorflow version: 1.1.0, python3
- os: ubuntu 14.04 64bit
Anyone know how to run it correctly? Thanks a lot!