Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
929 views
in Technique[技术] by (71.8m points)

tensorflow2.0 - Dockerfile sagemaker training failed

I want to train sagemaker model in machine learning and for that I create code image in ecr and build that using sudo docker build -t command and also tag the images in tensorflow but model is not train due to not star container.

I make one folder with named Docker and put train.py and Dockerfile in that folder and I run the Dockerfile with sudo docker build Docker command.

I dont understand what I need to do can anybody help me? I provide you all the things under.

Docker Images enter image description here

Container List enter image description here

Dockrfile enter image description here

I got the following error : AlgorithmError: CannotStartContainerError. Please make sure the container can be run with 'docker run train'. Please refer SageMaker documentation for details. It is possible that the Dockerfile's entrypoint is not properly defined, or missing permissions.

Its very helpful to me if anybody can help me or guide me.

question from:https://stackoverflow.com/questions/65913401/dockerfile-sagemaker-training-failed

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)
Waitting for answers

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...