This is my problem: My language (Portuguese) uses ISO-8859-1 char encoding! When I want access a character from a string like 'coração' (heart) I use: <pre class="prettyprint"><code>mb_internal_encoding('ISO-8859-1'); $str = "coração"; $len = mb_strlen($str,'UTF-8'); for($i=0;$i<$len;++$i) echo mb_substr($str, $i, 1, 'UTF-8')." "; </code></pre> This produces: <pre class="prettyprint"> c o r a ç ã o </pre> This works fine... But my issue is if the use of mb_substr function is not fast as simple string normal access! But I want a simple way to do this.... like in normal string character access: echo $str[$pos].... It is possible?

Try: <pre class="prettyprint"><code>preg_match_all( "/./u", $str, $ar_chars ); print_r( $ar_chars ); </code></pre>

There are simple way to get a character from multibyte string in PHP?

Tags:

multibyte

This is my problem: My language (Portuguese) uses ISO-8859-1 char encoding! When I want access a character from a string like 'coração' (heart) I use:

mb_internal_encoding('ISO-8859-1');
$str = "coração";

$len = mb_strlen($str,'UTF-8');

for($i=0;$i<$len;++$i)
    echo mb_substr($str, $i, 1, 'UTF-8')."<br/>";

This produces:

c
o
r
a
ç
ã
o

This works fine... But my issue is if the use of mb_substr function is not fast as simple string normal access! But I want a simple way to do this.... like in normal string character access: echo $str[$pos].... It is possible?

770

asked Apr 28 '12 05:04

Lucas Batistussi

2 Answers

mb_substr function is not fast as [...] like in normal string character access: echo $str[$pos].... It is possible?

No.

The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!)
Premature optimization

The multibyte functions have to check every character to determine how many bytes (1 to 4 in UTF-8) it occupies. There you immediately have the reason why character indexing ($a[n]) won't work: you don't know what byte(s) you need to get the n th character before you've read all characters before that one.

To speed things up a bit, you can look at the answers here: How to iterate UTF-8 string in PHP?

However, since you use ISO 8859-1 or Latin-1, you don't have to use the mb_ functions at all, since in that encoding all characters are encoded in one byte.

170

answered Oct 07 '22 10:10

CodeCaster

Try:

preg_match_all( "/./u", $str, $ar_chars );
print_r( $ar_chars );

answered Oct 07 '22 09:10

tty01

Related questions
                            
                                Android RESTful Web application using Zend Framework
                            
                                how to Reusing a cUrl context after doing a PUT request in PHP?
                            
                                Delimited txt file / strings issue
                            
                                Posting to Facebook Graph Api is slow
                            
                                export chain with openssl_pkcs12_export in PHP
                            
                                PHP imagecreatefromjpeg works, so why doesn't png/bmp/gif work?
                            
                                ThreadPool of CLI Processes
                            
                                Browser doesn't follow redirect from an AJAX response (PHP-generated response is using CAS authentication)
                            
                                How to add breadcrumb?
                            
                                NOW() for DATETIME InnoDB Transaction guaranteed?
                            
                                Deserialize xml to object with Symfony2
                            
                                How to check if the same ID get the same grade for different subjects?
                            
                                What is wrong with my .ctags file?
                            
                                Wordpress in CodeIgniter
                            
                                What happens when connections to MongoDB are not closed?
                            
                                What is the equivalent of PHP's InfiniteIterator in .NET?
                            
                                Create thumbnail image from video in server in php
                            
                                Mandrill giving invalid app key error
                            
                                How do i get out of the habit of procedural programming and into object oriented programming?
                            
                                Removing items from an array if they exist in another array [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

There are simple way to get a character from multibyte string in PHP?

Tags:

string

php

encoding