Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

`write': "\xCF" from ASCII-8BIT to UTF-8 (Encoding::UndefinedConversionError) while writing to file from url

I am getting error:

write': "\xCF" from ASCII-8BIT to UTF-8 (Encoding::UndefinedConversionError)

from line:

open(uri) {|url_file| tempfile.write(url_file.read)}

relevant code is:

require 'tempfile'
require 'open-uri'
require 'uri'
..
uri = URI.parse(@download_link)
tempfile = Tempfile.create(file_name)
open(uri) {|url_file| tempfile.write(url_file.read)}`
..

It runs completely fine if I run it like ruby lib/file.rb, but gives error when I run it in rails environment: rails runner lib/file.rb.

Most questions with this error refer to gem installation scenarios. My guess that I have to include/update some gems, but have no idea which.

like image 565
Vadim Avatar asked Jul 14 '17 17:07

Vadim


3 Answers

This should solve the problem.

data = URI.parse(@download_link).read
tempfile = Tempfile.create(file_name)
tempfile.binmode # This will help deal encoding problem with download files from the internet
tempfile.write(data)

binmode is binary mode

like image 170
newdark-it Avatar answered Oct 17 '22 10:10

newdark-it


Use force_encoding:

open(uri) {|url_file| tempfile.write(url_file.read.force_encoding("UTF-8"))
like image 43
Michael Malov Avatar answered Oct 17 '22 10:10

Michael Malov


Accepted answer is fine, but I think it is worth mentioning that You can also set encoding when creating/opening Tempfile, for example:

Tempfile.new("file.pdf", encoding: 'ascii-8bit') # or 'utf-8'
like image 15
Jakub Kopyś Avatar answered Oct 17 '22 09:10

Jakub Kopyś