How can I perform parallel asynchronous HTTP GET requests with reqwest?

Tags:

The async example is useful, but being new to Rust and Tokio, I am struggling to work out how to do N requests at once, using URLs from a vector, and creating an iterator of the response HTML for each URL as a string.

How could this be done?

727

asked Jun 26 '18 13:06

user964375

1 Answers

Concurrent requests

As of reqwest 0.10:

use futures::{stream, StreamExt}; // 0.3.5 use reqwest::Client; // 0.10.6 use tokio; // 0.2.21, features = ["macros"]  const CONCURRENT_REQUESTS: usize = 2;  #[tokio::main] async fn main() {     let client = Client::new();      let urls = vec!["https://api.ipify.org"; 2];      let bodies = stream::iter(urls)         .map(|url| {             let client = &client;             async move {                 let resp = client.get(url).send().await?;                 resp.bytes().await             }         })         .buffer_unordered(CONCURRENT_REQUESTS);      bodies         .for_each(|b| async {             match b {                 Ok(b) => println!("Got {} bytes", b.len()),                 Err(e) => eprintln!("Got an error: {}", e),             }         })         .await; }

stream::iter(urls)

stream::iter

Take a collection of strings and convert it into a Stream.

.map(|url| {

StreamExt::map

Run an asynchronous function on every element in the stream and transform the element to a new type.

let client = &client; async move {

Take an explicit reference to the Client and move the reference (not the original Client) into an anonymous asynchronous block.

let resp = client.get(url).send().await?;

Start an asynchronous GET request using the Client's connection pool and wait for the request.

resp.bytes().await

Request and wait for the bytes of the response.

.buffer_unordered(N);

StreamExt::buffer_unordered

Convert a stream of futures into a stream of those future's values, executing the futures concurrently.

bodies     .for_each(|b| {         async {             match b {                 Ok(b) => println!("Got {} bytes", b.len()),                 Err(e) => eprintln!("Got an error: {}", e),             }         }     })     .await;

StreamExt::for_each

Convert the stream back into a single future, printing out the amount of data received along the way, then wait for the future to complete.

Without bounded execution

If you wanted to, you could also convert an iterator into an iterator of futures and use future::join_all:

use futures::future; // 0.3.4 use reqwest::Client; // 0.10.1 use tokio; // 0.2.11  #[tokio::main] async fn main() {     let client = Client::new();      let urls = vec!["https://api.ipify.org"; 2];      let bodies = future::join_all(urls.into_iter().map(|url| {         let client = &client;         async move {             let resp = client.get(url).send().await?;             resp.bytes().await         }     }))     .await;      for b in bodies {         match b {             Ok(b) => println!("Got {} bytes", b.len()),             Err(e) => eprintln!("Got an error: {}", e),         }     } }

I'd encourage using the first example as you usually want to limit the concurrency, which buffer and buffer_unordered help with.

Parallel requests

Concurrent requests are generally good enough, but there are times where you need parallel requests. In that case, you need to spawn a task.

use futures::{stream, StreamExt}; // 0.3.8 use reqwest::Client; // 0.10.9 use tokio; // 0.2.24, features = ["macros"]  const PARALLEL_REQUESTS: usize = 2;  #[tokio::main] async fn main() {     let urls = vec!["https://api.ipify.org"; 2];      let client = Client::new();      let bodies = stream::iter(urls)         .map(|url| {             let client = client.clone();             tokio::spawn(async move {                 let resp = client.get(url).send().await?;                 resp.bytes().await             })         })         .buffer_unordered(PARALLEL_REQUESTS);      bodies         .for_each(|b| async {             match b {                 Ok(Ok(b)) => println!("Got {} bytes", b.len()),                 Ok(Err(e)) => eprintln!("Got a reqwest::Error: {}", e),                 Err(e) => eprintln!("Got a tokio::JoinError: {}", e),             }         })         .await; }

The primary differences are:

We use tokio::spawn to perform work in separate tasks.
We have to give each task its own reqwest::Client. As recommended, we clone a shared client to make use of the connection pool.
There's an additional error case when the task cannot be joined.

Shepmaster

Related questions
                            
                                Why does Rust not implement total ordering via the Ord trait for f64 and f32?
                            
                                Is it possible to represent Higher-Order Abstract Syntax in Rust?
                            
                                Is there a way other than traits to add methods to a type I don't own?
                            
                                What does an empty set of parentheses mean when used in a generic type declaration?
                            
                                How do I return an instance of a trait from a method?
                            
                                How do you make a range in Rust?
                            
                                How to concatenate a char onto a string in Rust?
                            
                                How to swap two variables?
                            
                                Tuple struct constructor complains about private fields
                            
                                How to convert 'struct' to '&[u8]'?
                            
                                Rust can't find crate
                            
                                How to solve "returns a value referencing data owned by the current function" error in Rust? [duplicate]
                            
                                Cannot borrow `x` as mutable more than once at a time
                            
                                Generating documentation in macros
                            
                                Should trait bounds be duplicated in struct and impl?
                            
                                How do I synchronously return a value calculated in an asynchronous Future in stable Rust?
                            
                                How to include files from same directory in a module using Cargo/Rust?
                            
                                Are there equivalents to slice::chunks/windows for iterators to loop over pairs, triplets etc?
                            
                                How to convert from &[u8] to Vec<u8>?
                            
                                Is there any way to create a type alias for multiple traits?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How can I perform parallel asynchronous HTTP GET requests with reqwest?

Tags:

rust

rust-tokio

reqwest