Reading csv file with Japanese characters into R

Tags:

I am struggling to have R read in a csv file which has some of its columns in standard English characters, some numerical and some fields in Japanese characters.Here is how the data looks like:

category,desc,otherdesc,volume
UPC - 31401 Age Itameabura,かどや製油　純白ごま油,OIL_OTHERS_SML_ECO,83.0
UPC - 31401 Age Itameabura,オレインリッチ,OIL_OTHERS_MED,137.0
UPC - 31401 Age Itameabura,ＴＶキャノーラ油,OIL_CANOLA_OTHERS_LRG,3026.0

Keeping the R's language setting as English, the japanese characters are converted into some gibberish. When I change the language setting in R to Japanese, Sys.setlocale("LC_CTYPE", "japanese"), I see the file is not read in at all. R gives an error saying:

Error in make.names(col.names, unique = TRUE) : invalid multibyte string at 'ｻcategory'

I have no clue what's wrong with my csv file or the header names. Can you guide me as to how can I go about reading this csv file into R so that everything is displayed just as they do in the csv file?

Thanks! Vish

503

asked Oct 18 '13 17:10

user2895779

1 Answers

For japanese the below works for me:

df <- read.csv("your_file.csv", fileEncoding="cp932")

165

answered Oct 26 '22 16:10

MarKo9

Related questions
                            
                                ggplot: Multiple Lines for one Color/class
                            
                                Different random number generation between OS
                            
                                How to write map reduce in R?
                            
                                Useful book(s) on learning Object Oriented Programming in R? [closed]
                            
                                Number of Observations by Day in R
                            
                                Curved vector graphics using paths
                            
                                Using auto.arima on xts objects
                            
                                How to add javascript in the head of a HTML knitr document?
                            
                                R memory issue with memory.limit()
                            
                                R and snow on amazon EC2 using starcluster
                            
                                How to change .Rprofile location in RStudio
                            
                                Storing a large but low-rank matrix efficiently
                            
                                Run R/Rook as a web server on startup
                            
                                Any pitfalls to using programmatically constructed formulas?
                            
                                R remove duplicate elements in character vector, not duplicate rows
                            
                                Importing *cell-formatting* information from excel file into R
                            
                                Manually set coefficient for new factor level when predicting
                            
                                Use Rcartogram on a SpatialPolygonsDataFrame object
                            
                                enet() works but not when run via caret::train()
                            
                                R ggmap: Why can I create rectangular maps using the filename attribute, but not use them in a plot?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Reading csv file with Japanese characters into R

Tags:

r

csv

locale

multibyte

user2895779

People also ask

1 Answers

MarKo9

Recent Activity

Donate For Us