Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

R Read abbreviated month form a date that is not in English

Tags:

date

r

as.date

I have a text file that includes dates, and want to convert it to a datatable.

Converting the dates such as 03-FEB-2011 can be done with

data$fecha <- as.Date(data$textDate , "%d-%b-%Y")

The problem is that the column is in Spanish, so I don't get Jan but Ene, or Aug but Ago. How can I change the locale so the %b abbreviation works for Spanish? Is there any other way to achieve this?

like image 243
Maria Velasco Avatar asked Aug 17 '15 18:08

Maria Velasco


2 Answers

As my previous comment, here is a complete and tested answer. As I've said you have to set your locale to the one right for your data (in this case spanish).

The code that allows you to do that is the following:

Sys.setlocale(locale="es_ES.UTF-8")

you can see the complete list of available locales with system("locale -a", intern = TRUE) (not sure if it works well on Windows systems).

Here is an example:

x <- c("03-Ago-2011", "21-Ene-2012")
as.Date(x, format = "%d-%b-%Y")
[1] "2011-08-03" "2012-01-21"
like image 129
SabDeM Avatar answered Sep 24 '22 05:09

SabDeM


If you can't add locales to your OS,

> Sys.setlocale(locale = "es")
[1] ""
Warning message:
In Sys.setlocale(locale = "es") :
OS reports request to set locale to "es" cannot be honored

the package readr() has ways to specify and even create locales:

> library(readr) 
> parse_date("31 DICIEMBRE 2011","%d %B %Y",locale=locale("es"))
[1] "2011-12-31"
like image 33
Juan Zuluaga Avatar answered Sep 23 '22 05:09

Juan Zuluaga