Implementation of model parallelism in tensorflow

Tags:

I'm currently working on a system with 2 GPUs each of 12GB. I want to implement model parallelism across the two GPUs to train large models. I have been looking through all over the internet, SO, tensorflow documentation, etc, i was able to find the explanations of model parallelism and its results but nowhere did i find a small tutorial or small code snippets on how to implement it using tensorflow. I mean we have to exchange activations after every layer right? So how do we do that? Is there a specific or cleaner ways of implementing model parallelism in tensorflow? It would be very helpful if you could suggest me a place where i can learn to implement it or a simple code like mnist training on multiple GPU using 'MODEL PARALLELISM'.

Note: I have done data parallelism like in CIFAR10 - multi gpu tutorial but i haven't found any implementation of model parallelism.

358

asked Feb 06 '17 13:02

krish567

1 Answers

Here's an example. The model has some parts on GPU0, some parts on GPU1 and some parts on CPU, so this is 3 way model parallelism.

with tf.device("/gpu:0"):
    a = tf.Variable(tf.ones(()))
    a = tf.square(a)
with tf.device("/gpu:1"):
    b = tf.Variable(tf.ones(()))
    b = tf.square(b)
with tf.device("/cpu:0"):
    loss = a+b
opt = tf.train.GradientDescentOptimizer(learning_rate=0.1)
train_op = opt.minimize(loss)

sess = tf.Session()
sess.run(tf.global_variables_initializer())
for i in range(10):
    loss0, _ = sess.run([loss, train_op])
    print("loss", loss0)

173

answered Oct 20 '22 12:10

Yaroslav Bulatov

Related questions
                            
                                Parallelize nested for loop in GNU Parallel
                            
                                how to parallelize "make" command which can distribute task on multiple machine
                            
                                How many parallel processes?
                            
                                Error: processing vignette failed with diagnostics: 4 simultaneous processes spawned
                            
                                SqlConnection closes unexpectedly inside using statement
                            
                                First-Occurrence Parallel String Matching Algorithm
                            
                                Vector Usage in MPI(C++)
                            
                                When should I use AsParallel() in linq/plinq
                            
                                How to write parallel code with Haskell vectors?
                            
                                Why would parallelization decrease performance so dramatically?
                            
                                Choose Akka or Spark for parallel processing? [closed]
                            
                                Task.StartNew() vs Parallel.ForEach : Multiple Web Requests Scenario
                            
                                Can generating permutations be done in parallel?
                            
                                Are pointers private in OpenMP parallel sections?
                            
                                What is the most robust way to append text to a single file from multiple connections
                            
                                Task Parallel is unstable, using 100% CPU at times
                            
                                Shared array usage in Julia
                            
                                Parallel.ForEach while retaining order
                            
                                Why is Spark not using all cores on local machine
                            
                                Do reducers (in Clojure) address the scaling foldr accumulation issue outlined by Guy Steele?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Implementation of model parallelism in tensorflow

Tags:

parallel-processing

tensorflow

distributed

krish567

People also ask

1 Answers

Yaroslav Bulatov

Recent Activity

Donate For Us