How to check in linux shell encoding of string already generated by Python script

Tags:

I run a Python script, that generates a string and then execute a shell script using that string. I want to check the encoding of that string using linux shell but without writing that string in file (disk operations runs slowly). Is it possible to check an encoding of string in Linux (Ubuntu) using only RAM? Something like:

check-encoding 'My string with random encoding'

Python check encoding script is slow too.

405

asked May 05 '15 09:05

Eugene

1 Answers

Try file utility. You can pass any string as file argument to file by using echo piped to utility with - option (many commands use a hyphen (-) in place of a filename as an argument to indicate when the input should come from stdin rather than a file):

:~  $ echo "test" | file -i -
/dev/stdin: text/plain; charset=us-ascii

:~  $ echo "тест" | file -i -
/dev/stdin: text/plain; charset=utf-8

with pipe to sed:

:~  $ echo "тест" | file -i - | sed 's/.*charset=\(.*\)/\1/'
utf-8

or to awk (you can mix it of course):

:~  $ echo "тест" | file -i - | awk '{ print $3 }'
charset=utf-8

also you can use python chardet module. Chardet comes with a command-line script which reports on the encodings of one or more files. Just install it with:

pip install chardet

and use with pipe from echo:

:~  $ echo "тест" | chardetect
<stdin>: utf-8 with confidence 0.938125

answered Nov 15 '22 10:11

ndpu

Related questions
                            
                                Why does pandas.DataFrame.update change the dtypes of the updated dataframe?
                            
                                python module not working in PyCharm with virtualenv
                            
                                How to read HDF5 files that have only datasets (no groups) using h5py?
                            
                                Apply a Python function to an std::vector via Cython (callback)
                            
                                Extending threading.Timer for returning value from function gives TypeError
                            
                                Compressing request body with python-requests?
                            
                                Editing workbooks with rich text in openpyxl
                            
                                What is the best practice for storing UI messaging strings in Python/Django?
                            
                                Embedding multiple gridspec layouts on a single matplotlib figure?
                            
                                numpy sort acting weirdly when sorting on a pandas DataFrame
                            
                                Efficient data structure keeping objects sorted on multiple keys
                            
                                Running PEP8 checks from Python
                            
                                Python - safe & elegant way to set a variable from function that may return None
                            
                                Multiprocessing with Qt works in windows but not linux
                            
                                Split a Python string with nested separated symbol
                            
                                How to extrapolate curves in Python?
                            
                                Python : overflow error long int too large to convert to float
                            
                                Read Json with NaN into Python and Pandas
                            
                                Copy certain files from one folder to another using python
                            
                                timeseries fitted values from trend python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to check in linux shell encoding of string already generated by Python script

Tags:

python

string

linux

shell

encoding

Eugene

People also ask

1 Answers

ndpu

Recent Activity

Donate For Us