Linguistic tagger incorrectly tagging as 'OtherWord'

Tags:

I've been using NSLinguisticTagger with sentences and have been encountering a strange issue with sentences such as 'I am hungry' or 'I am drunk'. Whilst one would expect 'I' to be tagged as a pronoun, 'am' as a verb and 'hungry' as an adjective, they are not. Rather they are all tagged as OtherWord.

Is there something I'm doing incorrectly?

NSString *input = @"I am hungry";
NSLinguisticTaggerOptions options = NSLinguisticTaggerOmitWhitespace;
NSLinguisticTagger *tagger = [[NSLinguisticTagger alloc] initWithTagSchemes:[NSLinguisticTagger availableTagSchemesForLanguage:@"en"] options:options];
tagger.string = input;

[tagger enumerateTagsInRange:NSMakeRange(0, input.length) scheme:NSLinguisticTagSchemeNameTypeOrLexicalClass options:options usingBlock:^(NSString *tag, NSRange tokenRange, NSRange sentenceRange, BOOL *stop) {
    NSString *token = [input substringWithRange:tokenRange];
    NSString *lemma = [tagger tagAtIndex:tokenRange.location
                                  scheme:NSLinguisticTagSchemeLemma
                              tokenRange: NULL
                           sentenceRange:NULL];
    NSLog(@"%@ (%@) : %@\n", token, lemma, tag);
}];

And the output is:

I ((null)) : OtherWord
am ((null)) : OtherWord
hungry ((null)) : OtherWord

721

asked Mar 27 '15 22:03

Joshua

1 Answers

After quite some time in chat we found the issue:

The sentence does not contain enough information to determine its language.

To fix this you can either:

add a demo sentence in your language of choice after your actual sentence. That should guarantee your preferred language gets detected.

Tell the tagger what language to use: add the line

[tagger setOrthography:[NSOrthography orthographyWithDominantScript:@"Latn" languageMap:@{@"Latn" : @[@"en"]}] range:NSMakeRange(0, input.length)];

before the enumerate call. That way you explicitly tell the tagger what language you want the text to be in, in this case englisch (en) as part of the latin dominant language (Latn).

If you dont know the language for sure, it may be usefull to use either of theses methods only as a fallback if the words get tagged as OtherWord meaning the language could not be detected.

190

answered Sep 21 '22 15:09

luk2302

Related questions
                            
                                Best option for streaming data between iPhones
                            
                                h.264 video won't play on iOS
                            
                                iOS >> Blocks >> Changing Values of Variables External to the Block
                            
                                Using CAGradientLayer for an angle / circle gradient
                            
                                ARC, self and blocks
                            
                                iOS - How to make an animation track touches
                            
                                Standard Image Button Icons for iOS in Xcode
                            
                                iOS UINavigationBar tint color appears darker than color set
                            
                                Swift Passing Closure With Params
                            
                                Why does a multiline UILabel with linespacing remove the 3 dots at the end?
                            
                                Word Wrap not working for UILabel
                            
                                What is NSURLErrorCancelled = -999 in iOS?
                            
                                How to get data out of bluetooth characteristic in Swift
                            
                                Avoiding duplicates when getting pictures with PHAsset
                            
                                iOS 8 CoreBluetooth deprecated RSSI methods
                            
                                UITextView attributed text is null after being set
                            
                                Facebook iOS app not launching my app for Applinks enabled link
                            
                                How to draw round dot stroke pattern for a CAShapeLayer?
                            
                                How can I set the collection cell width to dynamic stretch to phone width
                            
                                Xcode 6 resizes app automatically for iPhone 6 and 6 plus

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Linguistic tagger incorrectly tagging as 'OtherWord'

Tags:

ios

objective-c

cocoa

nlp

linguistics

Joshua

People also ask

1 Answers

luk2302

Recent Activity

Donate For Us