Skip to content

fahadaz/TensorflowTrainingDemo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 

Repository files navigation

Tensorflow Distributed Training Demo

We successfully tested distributed (multi GPU) TensorFlow training within Snowflake Container Runtime 🎉

Snowflake's CR doesn’t come with Tensorflow, but it can easily be installed with pip. Our example highlights CR's open-source connectivity with preconfigured Ray and GPU infrastructure..

The differentiators for this work were the Snowflake DataConnector and Ray. Ray is an open-source framework for distributed computing and provides shared memory such as multi-GPU. DataConnector efficiently connects Snowflake data to Ray. We hope this example is helpful for others.

We adapted the example of the

This opens up exciting possibilities for training large-scale TF models directly within Snowflake CR. I am incredibly thankful to Garrett Frere for his collaboration and help in making this possible. If you have any feedback to improve this solution, feel free to share it with us.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors