Best way to to implement Spark + AWS + Caffe/CUDA?

I am looking to deploy an application that already has a trained caffemodel file and I need to deploy it to a Spark cluster on AWS for processing due to GPU computation power needed (20K patches per image). From my research it seems that the best way to do it is to use Spark to create an AWS cluster which then runs a Docker image or Amazon AMI to install project dependencies automatically. Once everything is installed, the job can run in the cluster through Spark. What I am wondering is how to do this from start to finish. I have seen several guides, and have taken some online courses on Spark (BerkeleyX, Udemy) and Docker (Udemy); however almost all the information I have seen are examples of how to implement the simplest application that has little to no heavy software dependencies (CUDA drivers, CuDNN, Caffe, DIGITS). I have deployed Spark clusters on AWS and ran simple examples that had no dependencies, but have found little to no information on running an application that would require even a small dependency such as numpy. I would like to leverage the group to see if anyone has experience in such an implementation and can point me in the right direction or offer some help/suggestions?

Here are some things I have looked into:
bitfusion AMI:

  • Port forwarding in when running a Tomcat Docker in an AWS Elastic Beanstalk application
  • Spark Job Server not able to read text files in Docker Server
  • One docker container on AWS - resource usage?
  • Node Docker Container On Elastic Beanstalk Can't Find Local Gulp
  • AWS EC2 Container Service / Elastic Beanstalk Docker Container Port udp binding
  • Docker Compose Expose Spark Worker UI Ports Dynamically to Host When Scaling
  • My question is in regards to how to implement a small sample application from start to end with the Spark cluster getting created automatically side-by-side while installing the dependencies needed through either Docker or an AMI from above?

    Platform: Ubuntu 14.04
    Language: Python
    Dependencies: CUDA 7.5, caffe­nv, libcudnn4, NVIDIA Graphics Driver (346-352)

  • `Authorization Token has expired` issue AWS-CLI on MacOS Sierra
  • How to publish Docker on AWS from sbt-native-packager?
  • ECS network host mode and links = CannotCreateContainerError: Container already exists
  • Using a docker container querying an AWS RDS Instance
  • aws container service \ how dockers configured for communicating with each other
  • Running Spring Boot Microservices using Docker and AWS ECS
  • Docker will be the best open platform for developers and sysadmins to build, ship, and run distributed applications.