What is the correct way how to find a substring if I need to start not from 0? I have this code: <pre class="prettyprint"><code>fn SplitFile(reader: BufReader<File>) { for line in reader.lines() { let mut l = line.unwrap(); // l contains "06:31:53.012 index0:2015-01-06 00:00:13.084 ... </code></pre> I need to find third <code>:</code> and parse the date behind it. Still no idea how to do it, because <code>find</code> doesn't have any param like <code>begin</code> - see https://doc.rust-lang.org/std/string/struct.String.html#method.find. (I know I can use regex. I have it done, but I'd like to compare the performance - whether parsing by hand might the quicker than using regex.)

You are right, there doesn't appear to be any trivial way of skipping several matches when searching a string. You can do it by hand though. <pre class="prettyprint"><code>fn split_file(reader: BufReader<File>) { for line in reader.lines() { let mut l = &line.as_ref().unwrap()[..]; // get a slice for _ in 0..3 { if let Some(idx) = l.find(":") { l = &l[idx+1..] } else { panic!("the line didn't have enough colons"); // you probably shouldn't panic } } // l now contains the date ... </code></pre> Update: As faiface points out below, you can do this a bit cleaner with <code>splitn()</code>: <pre class="prettyprint"><code>fn split_file(reader: BufReader<File>) { for line in reader.lines() { let l = line.unwrap(); if let Some(datetime) = l.splitn(4, ':').last() { // datetime now contains the timestamp string ... } else { panic!("line doesn't contain a timestamp"); } } } </code></pre> You should go upvote his answer.

Find a string starting from given index

Tags:

rust

What is the correct way how to find a substring if I need to start not from 0?

I have this code:

fn SplitFile(reader: BufReader<File>) {
  for line in reader.lines() {
    let mut l = line.unwrap();
    // l contains "06:31:53.012   index0:2015-01-06 00:00:13.084
    ...

I need to find third : and parse the date behind it. Still no idea how to do it, because find doesn't have any param like begin - see https://doc.rust-lang.org/std/string/struct.String.html#method.find.

(I know I can use regex. I have it done, but I'd like to compare the performance - whether parsing by hand might the quicker than using regex.)

835

asked Jul 07 '15 21:07

stej

2 Answers

There is a lot simpler solution to this problem in my opinion, and that is to use a .splitn() method. This method splits a string by a given pattern at most n times. For example:

let s = "ab:bc:cd:de:ef".to_string();
println!("{:?}", s.splitn(3, ':').collect::<Vec<_>>());
// ^ prints ["ab", "bc", "cd:de:ef"]

In your case, you need to split the line into 4 parts separated by ':' and take the 4th one (indexed from 0):

// assuming the line is correctly formatted
let date = l.splitn(4, ':').nth(3).unwrap();

If you don't want to use unwrap (the line might not be correctly formatted):

if let Some(date) = l.splitn(4, ':').nth(3) {
    // parse the date and time
}

136

answered Sep 30 '22 05:09

faiface

You are right, there doesn't appear to be any trivial way of skipping several matches when searching a string. You can do it by hand though.

fn split_file(reader: BufReader<File>) {
    for line in reader.lines() {
        let mut l = &line.as_ref().unwrap()[..]; // get a slice
        for _ in 0..3 {
            if let Some(idx) = l.find(":") {
                l = &l[idx+1..]
            } else {
                panic!("the line didn't have enough colons"); // you probably shouldn't panic
            }
        }
        // l now contains the date
        ...

Update:

As faiface points out below, you can do this a bit cleaner with splitn():

fn split_file(reader: BufReader<File>) {
    for line in reader.lines() {
        let l = line.unwrap();
        if let Some(datetime) = l.splitn(4, ':').last() {
            // datetime now contains the timestamp string
            ...
        } else {
            panic!("line doesn't contain a timestamp");
        }
    }
}

You should go upvote his answer.

answered Sep 30 '22 06:09

Lily Ballard

Related questions
                            
                                Json Serialization feature of chrono crate
                            
                                How does interior mutability work for caching behavior?
                            
                                Calling Rust from NodeJS
                            
                                Are there capabilities in the Rust standard library to download a file from a URL
                            
                                Adding codegen flags to a Cargo build
                            
                                Is it possible to pattern match in Rust with multiple types?
                            
                                In Rust, how can a reference be a pointer to a pointer-to-a-pointer?
                            
                                Is there any way to create an alias of a specific FnMut?
                            
                                Why do I have to explicitly cast to a constrained type?
                            
                                Where does rustup install itself to?
                            
                                How to use StructOpt to parse an argument into a Vec without it being treated as multiple arguments?
                            
                                Uploading a string to S3 using rusoto
                            
                                How do I make format! return a &str from a conditional expression?
                            
                                Can I pop from a HashSet efficiently?
                            
                                How do I annotate the type of an empty slice in Rust? [duplicate]
                            
                                Does Rust have an equivalent to C++'s decltype() to get the type of an expression?
                            
                                Can I create string enum in Rust?
                            
                                Why are nested associated type paths considered ambiguous?
                            
                                Separate compilation for generics in Rust
                            
                                Cannot borrow as immutable - String and len()

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With