I have a string and I need to scan for every occurrence of "foo" and read all the text following it until a second <code>"</code>. <s>Since Rust does not have a <code>contains</code> function for strings</s>, I need to iterate by characters scanning for it. How would I do this? Edit: Rust's <code>&str</code> has a <code>contains()</code> and <code>find()</code> method.

<blockquote> I need to iterate by characters scanning for it. </blockquote> The <code>.chars()</code> method returns an iterator over characters in a string. e.g. <pre class="prettyprint"><code>for c in my_str.chars() { // do something with `c` } for (i, c) in my_str.chars().enumerate() { // do something with character `c` and index `i` } </code></pre> If you are interested in the byte offsets of each char, you can use <code>char_indices</code>. Look into <code>.peekable()</code>, and use <code>peek()</code> for looking ahead. It's wrapped like this because it supports UTF-8 codepoints instead of being a simple vector of characters. You could also create a vector of <code>char</code>s and work on it from there, but that's more time and space intensive: <pre class="prettyprint"><code>let my_chars: Vec<_> = mystr.chars().collect(); </code></pre>

How do you iterate over a string by character

Tags:

iterator

string

rust

I have a string and I need to scan for every occurrence of "foo" and read all the text following it until a second ". ~~Since Rust does not have a contains function for strings~~, I need to iterate by characters scanning for it. How would I do this?

Edit: Rust's &str has a contains() and find() method.

218

asked Mar 01 '14 18:03

user2171584

2 Answers

I need to iterate by characters scanning for it.

The .chars() method returns an iterator over characters in a string. e.g.

for c in my_str.chars() {      // do something with `c` }  for (i, c) in my_str.chars().enumerate() {     // do something with character `c` and index `i` }

If you are interested in the byte offsets of each char, you can use char_indices.

Look into .peekable(), and use peek() for looking ahead. It's wrapped like this because it supports UTF-8 codepoints instead of being a simple vector of characters.

You could also create a vector of chars and work on it from there, but that's more time and space intensive:

let my_chars: Vec<_> = mystr.chars().collect();

199

answered Oct 14 '22 02:10

centaurian_slug

The concept of a "character" is very ambiguous and can mean many different things depending on the type of data you are working with. The most obvious answer is the chars method. However, this does not work as advertised. What looks like a single "character" to you may actually be made up of multiple Unicode code points, which can lead to unexpected results:

"a̐".chars() // => ['a', '\u{310}']

For actual string processing, you want to work with graphemes. A grapheme consists of one or more unicode code points represented as a string slice. These map better to the human perception of "characters". To create an iterator of graphemes, you can use the unicode-segmentation crate:

use unicode_segmentation::UnicodeSegmentation;  for grapheme in my_str.graphemes(true) {     // ... }

If you are working with raw ASCII then none of the above applies to you, and you can simply use the bytes iterator:

for byte in my_str.bytes() {     // ... }

Although, if you are working with ASCII then arguably you shouldn't be using String/&str at all and instead use Vec<u8>/&[u8] or the ascii crate.

answered Oct 14 '22 03:10

Ibraheem Ahmed

Related questions
                            
                                How do you convert a byte array to a hexadecimal string in C?
                            
                                Format string, integer with leading zeros
                            
                                Unrecognized escape sequence for path string containing backslashes
                            
                                Regex for converting CamelCase to camel_case in java
                            
                                Regular Expression Match to test for a valid year
                            
                                Remove empty strings from array while keeping record Without Loop?
                            
                                String concatenation vs. string substitution in Python
                            
                                Scala check if element is present in a list
                            
                                Calling PHP functions within HEREDOC strings
                            
                                How can I get the string representation of a struct?
                            
                                Joining multiple strings if they are not empty in Python
                            
                                How can I remove the ANSI escape sequences from a string in python
                            
                                How to convert an Int to Hex String in Swift
                            
                                How to check if a string starts with another string in C?
                            
                                The most sophisticated way for creating comma-separated Strings from a Collection/Array/List?
                            
                                Java - Create a new String instance with specified length and filled with specific character. Best solution? [duplicate]
                            
                                How to compare Unicode characters that "look alike"?
                            
                                Tetris-ing an array
                            
                                Python regex - r prefix
                            
                                What is lexicographical order?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With