Assume the following: <pre class="prettyprint lang-java prettyprint-override"><code>String example = "something"; String firstLetter = ""; </code></pre> Are there differences to be aware of with the following ways of assigning <code>firstLetter</code> that could impact performance; which would be best, and why? <pre class="prettyprint lang-java prettyprint-override"><code>firstLetter = String.valueOf(example.charAt(0)); firstLetter = Character.toString(example.charAt(0)); firstLetter = example.substring(0, 1); </code></pre> The reason the first letter is being returned as a <code>String</code> is that this is being run in Hadoop, and a string is required to assign to a <code>Text</code> type, <code>firstLetter</code> will be output as a <code>key</code> from a <code>map()</code> method, for example: <pre class="prettyprint lang-java prettyprint-override"><code>public class FirstLetterMapper extends Mapper<LongWritable, Text, Text, IntWritable> { String line = new String(); Text firstLetter = new Text(); IntWritable wordLength = new IntWritable(); @Override public void map(LongWritable key, Text value, Context context) throws IOException, InterruptedException { line = value.toString(); for (String word : line.split("\\W+")){ if (word.length() > 0) { // --------------------------------------------- // firstLetter assignment firstLetter.set(String.valueOf(word.charAt(0)).toLowerCase()); // --------------------------------------------- wordLength.set(word.length()); context.write(firstLetter, wordLength); } } } } </code></pre>

Long story short, it probably doesn't matter. Use whichever you think looks nicest. Longer answer, using Oracle's Java 7 JDK specifically, since this isn't defined at the JLS: <code>String.valueOf</code> or <code>Character.toString</code> work the same way, so use whichever you feel looks nicer. In fact, <code>Character.toString</code> simply calls <code>String.valueOf</code> (source). So the question is, should you use one of those or <code>String.substring</code>. Here again it doesn't matter much. <code>String.substring</code> uses the original string's <code>char[]</code> and so allocates one object fewer than <code>String.valueOf</code>. This also prevents the original string from being GC'ed until the one-character string is available for GC (which can be a memory leak), but in your example, they'll both be available for GC after each iteration, so that doesn't matter. The allocation you save also doesn't matter -- a <code>char[1]</code> is cheap to allocate, and short-lived objects (as the one-char string will be) are cheap to GC, too. If you have a large enough data set that the three are even measurable, <code>substring</code> will probably give a slight edge. Like, really slight. But that "if... measurable" contains the real key to this answer: why don't you just try all three and measure which one is fastest?

What is the best way to get the first letter from a string in Java, returned as a string of length 1?

Tags:

java

string

Assume the following:

String example      = "something";
String firstLetter  = "";

Are there differences to be aware of with the following ways of assigning firstLetter that could impact performance; which would be best, and why?

firstLetter = String.valueOf(example.charAt(0));
firstLetter = Character.toString(example.charAt(0));
firstLetter = example.substring(0, 1);

The reason the first letter is being returned as a String is that this is being run in Hadoop, and a string is required to assign to a Text type, firstLetter will be output as a key from a map() method, for example:

public class FirstLetterMapper extends Mapper<LongWritable, Text, Text, IntWritable> {
    String line = new String();
    Text firstLetter = new Text();
    IntWritable wordLength = new IntWritable();

    @Override
    public void map(LongWritable key, Text value, Context context)
            throws IOException, InterruptedException {

        line = value.toString();

        for (String word : line.split("\\W+")){
            if (word.length() > 0) {

                // ---------------------------------------------
                // firstLetter assignment
                firstLetter.set(String.valueOf(word.charAt(0)).toLowerCase());
                // ---------------------------------------------

                wordLength.set(word.length());
                context.write(firstLetter, wordLength);
            }
        }
  }
}

744

asked Aug 13 '13 05:08

Adrian Torrie

4 Answers

Performance wise substring(0, 1) is better as found by following:

    String example = "something";
    String firstLetter  = "";

    long l=System.nanoTime();
    firstLetter = String.valueOf(example.charAt(0));
    System.out.println("String.valueOf: "+ (System.nanoTime()-l));

    l=System.nanoTime();
    firstLetter = Character.toString(example.charAt(0));
    System.out.println("Character.toString: "+ (System.nanoTime()-l));

    l=System.nanoTime();
    firstLetter = example.substring(0, 1);
    System.out.println("substring: "+ (System.nanoTime()-l));

Output:

String.valueOf: 38553
Character.toString: 30451
substring: 8660

197

answered Oct 14 '22 02:10

Ankur Lathi

Long story short, it probably doesn't matter. Use whichever you think looks nicest.

Longer answer, using Oracle's Java 7 JDK specifically, since this isn't defined at the JLS:

String.valueOf or Character.toString work the same way, so use whichever you feel looks nicer. In fact, Character.toString simply calls String.valueOf (source).

So the question is, should you use one of those or String.substring. Here again it doesn't matter much. String.substring uses the original string's char[] and so allocates one object fewer than String.valueOf. This also prevents the original string from being GC'ed until the one-character string is available for GC (which can be a memory leak), but in your example, they'll both be available for GC after each iteration, so that doesn't matter. The allocation you save also doesn't matter -- a char[1] is cheap to allocate, and short-lived objects (as the one-char string will be) are cheap to GC, too.

If you have a large enough data set that the three are even measurable, substring will probably give a slight edge. Like, really slight. But that "if... measurable" contains the real key to this answer: why don't you just try all three and measure which one is fastest?

answered Oct 14 '22 02:10

yshavit

String whole = "something";
String first = whole.substring(0, 1);
System.out.println(first);

answered Oct 14 '22 01:10

MIk.13

import org.openjdk.jmh.annotations.Benchmark;
import org.openjdk.jmh.annotations.BenchmarkMode;
import org.openjdk.jmh.annotations.Fork;
import org.openjdk.jmh.annotations.Measurement;
import org.openjdk.jmh.annotations.Mode;
import org.openjdk.jmh.annotations.OutputTimeUnit;
import org.openjdk.jmh.annotations.Scope;
import org.openjdk.jmh.annotations.Setup;
import org.openjdk.jmh.annotations.State;
import org.openjdk.jmh.annotations.Warmup;

import java.util.concurrent.TimeUnit;

@State(Scope.Thread)
@BenchmarkMode(Mode.AverageTime)
@OutputTimeUnit(TimeUnit.NANOSECONDS)
@Warmup(iterations = 5, time = 1)
@Fork(value = 1)
@Measurement(iterations = 5, time = 1)
public class StringFirstCharBenchmark {

    private String source;

    @Setup
    public void init() {
        source = "MALE";
    }

    @Benchmark
    public String substring() {
        return source.substring(0, 1);
    }

    @Benchmark
    public String indexOf() {
        return String.valueOf(source.indexOf(0));
    }
}

Results:

+----------------------------------------------------------------------+
| Benchmark                           Mode  Cnt   Score   Error  Units |
+----------------------------------------------------------------------+
| StringFirstCharBenchmark.indexOf    avgt    5  23.777 ? 5.788  ns/op |
| StringFirstCharBenchmark.substring  avgt    5  11.305 ? 1.411  ns/op |
+----------------------------------------------------------------------+

answered Oct 14 '22 01:10

Nikita

Related questions
                            
                                Automated tests for Java Swing GUIs [closed]
                            
                                Java com.* package namespace [duplicate]
                            
                                Why does JDK sourcecode take a `final` copy of `volatile` instances
                            
                                Turning off IntelliJ Auto-save
                            
                                Jackson: What happens if a property is missing?
                            
                                TreeMap or HashMap? [duplicate]
                            
                                JUnit Testing private variables? [duplicate]
                            
                                How to have 2 JVMs talk to one another
                            
                                How do I get Maven to use the correct repositories?
                            
                                Java IO implementation of unix/linux "tail -f"
                            
                                What is the difference between linearizability and serializability?
                            
                                How to create a Java / Maven project that works in Visual Studio Code?
                            
                                Why can't I use foreach on Java Enumeration?
                            
                                Method argument extends class implements interface
                            
                                How to load a jar file at runtime [duplicate]
                            
                                Stream.peek() method in Java 8 vs Java 9
                            
                                Can Spring Security use @PreAuthorize on Spring controllers methods?
                            
                                How to test main class of Spring-boot application
                            
                                JTable, disable user column dragging
                            
                                How to draw circle by canvas in Android?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With