(# ﾟДﾟ) is a 5-letter-word. But in iOS, [@"(# ﾟДﾟ)" length] is 7. <ol> <li>Why?</li> <li>I'm using <code><UITextInput></code> to modify the text in a <code>UITextField</code> or <code>UITextView</code>. When I make a UITextRange of 5 character length, it can just cover the (# ﾟДﾟ) . So, why this (# ﾟДﾟ) looks like a 5-character-word in <code>UITextField</code> and <code>UITextView</code>, but looks like a 7-character-word in NSString??? </li> <li>How can I get the correct length of a string in this case?</li> </ol>

Both <code>ﾟ</code> and <code>Дﾟ</code> are represented by a character sequence of two Unicode characters (even when they are visually presented as one). <code>-[NSString length]</code> reports the number of Unicode chars: <blockquote> The number returned includes the individual characters of composed character sequences, so you cannot use this method to determine if a string will be visible when printed or how long it will appear. </blockquote> If you want to see the byte representation: <pre class="prettyprint"><code>#import <Foundation/Foundation.h> NSString* describeUnicodeCharacters(NSString* str) { NSMutableString* codePoints = [NSMutableString string]; for(NSUInteger i = 0; i < [str length]; ++i){ long ch = (long)[str characterAtIndex:i]; [codePoints appendFormat:@"%0.4lX ", ch]; } return codePoints; } int main(int argc, char *argv[]) { @autoreleasepool { NSString *s = @" ﾟДﾟ"; NSLog(@"%ld unicode chars. bytes: %@", [s length], describeUnicodeCharacters(s)); } } </code></pre> The output is: <code>4 unicode chars. bytes: 0020 FF9F 0414 FF9F</code>. 2) and 3): what NJones said.

(# ﾟДﾟ) is a 5-letter-word. But in iOS, [@"(# ﾟДﾟ)" length] is 7. Why?

2 Answers

1) As many in the comments have already stated, Your string is made of 5 composed character sequences (or character clusters if you prefer). When broken down by unichars as NSString’s length method does you will get a 7 which is the number of unichars it takes to represent your string in memory.

2) Apparently the UITextField and UITextView are handling the strings in a unichar savvy way. Good news, so can you. See #3.

3) You can get the number of composed character sequences by using some of the NSString API which properly deals with composed character sequences. A quick example I baked up, very quickly, is a small NSString category:

Click to copy

@implementation NSString (ComposedCharacterSequences_helper)
-(NSUInteger)numberOfComposedCharacterSequences{
    __block NSUInteger count = 0;
    [self enumerateSubstringsInRange:NSMakeRange(0, self.length)
                             options:NSStringEnumerationByComposedCharacterSequences
                          usingBlock:^(NSString *substring, NSRange substringRange, NSRange enclosingRange, BOOL *stop){
                              NSLog(@"%@",substring); // Just for fun
                              count++;
                          }];
    return count;
}
@end

Again this is quick code; but it should get you started. And if you use it like so:

Click to copy

NSString *string = @"(# ﾟДﾟ)";
NSLog(@"string length %i", string.length);
NSLog(@"composed character count %i", [string numberOfComposedCharacterSequences]);

You will see that you get the desired result.

For an in-depth explanation of the NSString API check out the WWDC 2012 Session 215 Video "Text and Linguistic Analysis"

143

answered Oct 20 '22 14:10

NJones

Both ﾟ and Дﾟ are represented by a character sequence of two Unicode characters (even when they are visually presented as one). -[NSString length] reports the number of Unicode chars:

The number returned includes the individual characters of composed character sequences, so you cannot use this method to determine if a string will be visible when printed or how long it will appear.

If you want to see the byte representation:

Click to copy

#import <Foundation/Foundation.h>

NSString* describeUnicodeCharacters(NSString* str)
{
    NSMutableString* codePoints = [NSMutableString string];
    for(NSUInteger i = 0; i < [str length]; ++i){
        long ch = (long)[str characterAtIndex:i];
        [codePoints appendFormat:@"%0.4lX ", ch];
    }
    return codePoints;
}


int main(int argc, char *argv[]) {
    @autoreleasepool {
        NSString *s = @" ﾟДﾟ";
        NSLog(@"%ld unicode chars. bytes: %@", 
            [s length], describeUnicodeCharacters(s));
    }
}

The output is: 4 unicode chars. bytes: 0020 FF9F 0414 FF9F.

2) and 3): what NJones said.

answered Oct 20 '22 16:10

Jano

Related questions
                            
                                GKTurnBasedParticipant information
                            
                                How to add UILongPressGestureRecognizer to a UITextField?
                            
                                Show NSData as binary in a NSString
                            
                                Trying to Implement Delegate Inheritance
                            
                                Change NavigationBar color (background color)
                            
                                Using iPhone serial connection (pins 12 and 13)
                            
                                How to implement mute functionality in a PJSIP call on iOS
                            
                                How to set navigationcontroller push animation to "No" while using storyboard
                            
                                Fade out end of text label strings that don't fit (instead of truncate)
                            
                                Map Annotations in iOS 6 don't stay rotated when user pans the map
                            
                                How does Marmalade SDK cross compiler work?
                            
                                iphone - UIGestureRecognizer prevents UITableView from scrolling in Xcode 4.5
                            
                                is Store Kit framework working on i OS 6 Simulator?
                            
                                Can an IOS in-app Non-consumable purchase be modified after it is available?
                            
                                page width in bootstrap responsive design on iphone
                            
                                Get XML with id's of all uploaded videos on youtube channel
                            
                                Facebook iOS SDK Logout
                            
                                How to detect all available Wifi networks and connect to one of them in an iOS app
                            
                                Trying to run code upon a return from a segue
                            
                                Why isn't my iPhone 5 showing 60+ fps even on a completely bare UITableView scroll?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

(# ﾟДﾟ) is a 5-letter-word. But in iOS, [@"(# ﾟДﾟ)" length] is 7. Why?

Tags:

ios

iphone

nsstring

uitextinput

YuAo

People also ask

2 Answers

NJones

Jano

Recent Activity

Donate For Us