How to strip color codes used by mIRC users?

Tags:

I'm writing a IRC bot in Python using irclib and I'm trying to log the messages on certain channels.
The issue is that some mIRC users and some Bots write using color codes.
Any idea on how i could strip those parts and leave only the clear ascii text message?

700

asked Jun 09 '09 14:06

daniels

4 Answers

Regular expressions are your cleanest bet in my opinion. If you haven't used them before, this is a good resource. For the full details on Python's regex library, go here.

import re
regex = re.compile("\x03(?:\d{1,2}(?:,\d{1,2})?)?", re.UNICODE)

The regex searches for ^C (which is \x03 in ASCII, you can confirm by doing chr(3) on the command line), and then optionally looks for one or two [0-9] characters, then optionally followed by a comma and then another one or two [0-9] characters.

(?: ... ) says to forget about storing what was found in the parenthesis (as we don't need to backreference it), ? means to match 0 or 1 and {n,m} means to match n to m of the previous grouping. Finally, \d means to match [0-9].

The rest can be decoded using the links I refer to above.

>>> regex.sub("", "blabla \x035,12to be colored text and background\x03 blabla")
'blabla to be colored text and background blabla'

chaos' solution is similar, but may end up eating more than a max of two numbers and will also not remove any loose ^C characters that may be hanging about (such as the one that closes the colour command)

140

answered Sep 30 '22 09:09

Smerity

The second-rated and following suggestions are defective, as they look for digits after whatever character, but not after the color code character.

I have improved and combined all posts, with the following consequences:

we do remove the reverse character
remove color codes without leaving digits in the text.

Solution:

regex = re.compile("\x1f|\x02|\x12|\x0f|\x16|\x03(?:\d{1,2}(?:,\d{1,2})?)?", re.UNICODE)

answered Sep 30 '22 09:09

frederik

As I found this question useful, I figured I'd contribute.

I added a couple things to the regex

regex = re.compile("\x1f|\x02|\x03|\x16|\x0f(?:\d{1,2}(?:,\d{1,2})?)?", re.UNICODE)

\x16 removed the "reverse" character. \x0f gets rid of another bold character.

answered Sep 30 '22 09:09

Xorlev

AutoDl-irssi had a very good one written in perl, here it is in python:

def stripMircColorCodes(line) : line = re.sub("\x03\d\d?,\d\d?","",line) line = re.sub("\x03\d\d?","",line) line = re.sub("[\x01-\x1F]","",line) return line

answered Sep 30 '22 08:09

sparks

Related questions
                            
                                ImportError: cannot import name 'model_to_dot'
                            
                                ModuleNotFoundError: No module named 'sklearn.grid_search'
                            
                                In Python, how do I get the list of classes defined within a particular file?
                            
                                Python OpenCV streaming from camera - multithreading, timestamps
                            
                                Iteratively fitting polynomial curve
                            
                                How to send telegram mediaGroup with caption/text
                            
                                How to make a binance futures order with ccxt in python?
                            
                                Cloudfront give Access denied response created through AWS CDK Python for S3 bucket origin without public Access
                            
                                ImportError: cannot import name 'Literal' from 'typing'
                            
                                Python: How to retrive the best model from Optuna LightGBM study?
                            
                                TypeError: Input 'filename' of 'ReadFile' Op has type float32 that does not match expected type of string
                            
                                Can't login to Instagram using requests
                            
                                Python: expand list of strings by adding n elements for each original element [duplicate]
                            
                                How to combine three string columns to one which have Nan values in Pandas
                            
                                How Does One Read Bytes from File in Python
                            
                                How to iterate over a list repeating each element in Python
                            
                                PyObjc vs RubyCocoa for Mac development: Which is more mature?
                            
                                Uses for Dynamic Languages
                            
                                Django with Passenger
                            
                                Ordering a list of dictionaries in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to strip color codes used by mIRC users?

Tags:

python

irc

daniels

People also ask

4 Answers

Smerity

frederik

Xorlev

sparks

Recent Activity

Donate For Us