I'm trying to parallelize an algorithm I have. This is a sketch of how I would write it in C++: <pre class="prettyprint"><code>void thread_func(std::vector<int>& results, int threadid) { results[threadid] = threadid; } std::vector<int> foo() { std::vector<int> results(4); for(int i = 0; i < 4; i++) { spawn_thread(thread_func, results, i); } join_threads(); return results; } </code></pre> The point here is that each thread has a reference to a shared, mutable object that it does not own. It seems like this is difficult to do in Rust. Should I try to cobble it together in terms of (and I'm guessing here) <code>Mutex</code>, <code>Cell</code> and <code>&mut</code>, or is there a better pattern I should follow?

The proper way is to use <code>Arc<Mutex<...>></code> or, for example, <code>Arc<RWLock<...>></code>. <code>Arc</code> is a shared ownership-based concurrency-safe pointer to immutable data, and <code>Mutex</code>/<code>RWLock</code> introduce synchronized internal mutability. Your code then would look like this: <pre class="prettyprint"><code>use std::sync::{Arc, Mutex}; use std::thread; fn thread_func(results: Arc<Mutex<Vec<i32>>>, thread_id: i32) { let mut results = results.lock().unwrap(); results[thread_id as usize] = thread_id; } fn foo() -> Arc<Mutex<Vec<i32>>> { let results = Arc::new(Mutex::new(vec![0; 4])); let guards: Vec<_> = (0..4).map(|i| { let results = results.clone(); thread::spawn(move || thread_func(results, i)) }).collect(); for guard in guards { guard.join(); } results } </code></pre> This unfortunately requires you to return <code>Arc<Mutex<Vec<i32>>></code> from the function because there is no way to "unwrap" the value. An alternative is to clone the vector before returning. However, using a crate like scoped_threadpool (whose approach could only be recently made sound; something like it will probably make into the standard library instead of the now deprecated <code>thread::scoped()</code> function, which is unsafe) it can be done in a much nicer way: <pre class="prettyprint"><code>extern crate scoped_threadpool; use scoped_threadpool::Pool; fn thread_func(result: &mut i32, thread_id: i32) { *result = thread_id; } fn foo() -> Vec<i32> { let results = vec![0; 4]; let mut pool = Pool::new(4); pool.scoped(|scope| { for (i, e) in results.iter_mut().enumerate() { scope.execute(move || thread_func(e, i as i32)); } }); results } </code></pre> If your <code>thread_func</code> needs to access the whole vector, however, you can't get away without synchronization, so you would need a <code>Mutex</code>, and you would still get the unwrapping problem: <pre class="prettyprint"><code>extern crate scoped_threadpool; use std::sync::Mutex; use scoped_threadpool::Pool; fn thread_func(results: &Mutex<Vec<u32>>, thread_id: i32) { let mut results = results.lock().unwrap(); result[thread_id as usize] = thread_id; } fn foo() -> Vec<i32> { let results = Mutex::new(vec![0; 4]); let mut pool = Pool::new(4); pool.scoped(|scope| { for i in 0..4 { scope.execute(move || thread_func(&results, i)); } }); results.lock().unwrap().clone() } </code></pre> But at least you don't need any <code>Arc</code>s here. Also <code>execute()</code> method is <code>unsafe</code> if you use stable compiler because it does not have a corresponding fix to make it safe. It is safe on all compiler versions greater than 1.4.0, according to its build script.

Thread-safe mutable non-owning pointer in Rust?

Tags:

multithreading

rust

I'm trying to parallelize an algorithm I have. This is a sketch of how I would write it in C++:

void thread_func(std::vector<int>& results, int threadid) {
   results[threadid] = threadid;
}

std::vector<int> foo() {
  std::vector<int> results(4);

  for(int i = 0; i < 4; i++)
  {
     spawn_thread(thread_func, results, i);
  }

  join_threads();

  return results;
}

The point here is that each thread has a reference to a shared, mutable object that it does not own. It seems like this is difficult to do in Rust. Should I try to cobble it together in terms of (and I'm guessing here) Mutex, Cell and &mut, or is there a better pattern I should follow?

376

asked Aug 27 '15 12:08

anjruu

1 Answers

The proper way is to use Arc<Mutex<...>> or, for example, Arc<RWLock<...>>. Arc is a shared ownership-based concurrency-safe pointer to immutable data, and Mutex/RWLock introduce synchronized internal mutability. Your code then would look like this:

use std::sync::{Arc, Mutex};
use std::thread;

fn thread_func(results: Arc<Mutex<Vec<i32>>>, thread_id: i32) {
    let mut results = results.lock().unwrap();
    results[thread_id as usize] = thread_id;
}

fn foo() -> Arc<Mutex<Vec<i32>>> {
    let results = Arc::new(Mutex::new(vec![0; 4]));

    let guards: Vec<_> = (0..4).map(|i| {
        let results = results.clone();
        thread::spawn(move || thread_func(results, i))
    }).collect();

    for guard in guards {
        guard.join();
    }

    results
}

This unfortunately requires you to return Arc<Mutex<Vec<i32>>> from the function because there is no way to "unwrap" the value. An alternative is to clone the vector before returning.

However, using a crate like scoped_threadpool (whose approach could only be recently made sound; something like it will probably make into the standard library instead of the now deprecated thread::scoped() function, which is unsafe) it can be done in a much nicer way:

extern crate scoped_threadpool;

use scoped_threadpool::Pool;

fn thread_func(result: &mut i32, thread_id: i32) {
    *result = thread_id;
}

fn foo() -> Vec<i32> {
    let results = vec![0; 4];
    let mut pool = Pool::new(4);

    pool.scoped(|scope| {
        for (i, e) in results.iter_mut().enumerate() {
            scope.execute(move || thread_func(e, i as i32));
        }
    });

    results
}

If your thread_func needs to access the whole vector, however, you can't get away without synchronization, so you would need a Mutex, and you would still get the unwrapping problem:

extern crate scoped_threadpool;

use std::sync::Mutex;

use scoped_threadpool::Pool;

fn thread_func(results: &Mutex<Vec<u32>>, thread_id: i32) {
    let mut results = results.lock().unwrap();
    result[thread_id as usize] = thread_id;
}

fn foo() -> Vec<i32> {
    let results = Mutex::new(vec![0; 4]);
    let mut pool = Pool::new(4);

    pool.scoped(|scope| {
        for i in 0..4 {
            scope.execute(move || thread_func(&results, i));
        }
    });

    results.lock().unwrap().clone()
}

But at least you don't need any Arcs here. Also execute() method is unsafe if you use stable compiler because it does not have a corresponding fix to make it safe. It is safe on all compiler versions greater than 1.4.0, according to its build script.

answered Oct 01 '22 17:10

Vladimir Matveev

Related questions
                            
                                What is the Use and need of thread local
                            
                                Under what conditions will writes to non-volatile variables be unseen by other threads? Can I force such conditions for experimental purposes?
                            
                                Lifetime of std::thread arguments
                            
                                Sharing scope across awaits
                            
                                Phonegap "['Media'] Plugin should use a background thread."
                            
                                Crash on [NSKeyedArchiver archivedDataWithRootObject:self.data]
                            
                                An attempt to create atomic reference counting is failing with deadlock. Is this the right approach?
                            
                                Stopping a python thread running an Infinite Loop
                            
                                Alternate to Dataflow BroadcastBlock with guaranteed delivery
                            
                                Flask and/or Tornado - handling time consuming call to external webservice
                            
                                Returning from a task without blocking UI thread
                            
                                How to avoid busy spinning in Java
                            
                                Extending threading.Timer for returning value from function gives TypeError
                            
                                use FutureTask for concurrency
                            
                                Ensuring that current thread holds a lock on a C++11 mutex
                            
                                Java thread start time
                            
                                Why does this code catch block not execute?
                            
                                How can I ensure Task.Delay is more accurate?
                            
                                Java : How to return intermediate results from a Thread
                            
                                Why do I get a CoreData multithreading violation when trying to access a property of a fetched object?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With