Skip to content

Conversation

@johnwhumphreys
Copy link
Contributor

Summary:
This diff adds a working Dockerfile. It has been tested w/ the TorchX kubernetes integration on AWS multi-node.

It has an overrideable base image for the nightly from pytorch.

The below test plan shows how to log into a devgpu on the cloud, build it, upload it to the ECR registry, test it, then build with an override and test again.

A good next step is to automate this with GitHub actions. I feel that building it based on the nightlies is a good move and we should not build this every commit / we should not redo all the build logic separately here as it would be complex and redundant.

Differential Revision: D89013574

@meta-cla meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Dec 12, 2025
@meta-codesync
Copy link

meta-codesync bot commented Dec 12, 2025

@johnwhumphreys has exported this pull request. If you are a Meta employee, you can view the originating Diff in D89013574.

…-pytorch#2131)

Summary:

This diff adds a working Dockerfile.  It has been tested w/ the TorchX kubernetes integration on AWS multi-node.

It has an overrideable base image for the nightly from pytorch.

The below test plan shows how to log into a devgpu on the cloud, build it, upload it to the ECR registry, test it, then build with an override and test again.

A good next step is to automate this with GitHub actions.  I feel that building it based on the nightlies is a good move and we should not build this every commit / we should not redo all the build logic separately here as it would be complex and redundant.

Reviewed By: thedavekwon

Differential Revision: D89013574
@meta-codesync
Copy link

meta-codesync bot commented Dec 13, 2025

This pull request has been merged in 7a9ef93.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot. fb-exported Merged meta-exported

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants