Pyspark, Add a character in the middle of a string

Tags:

Let's say I have a column of Strings like this

Hour
0045
2322

And I want it to become like this:

Hour
00:45
23:22

In order to after turn into a timestamp. How would I go about it?

636

asked Jan 02 '18 14:01

BryceSoker

1 Answers

You can use regexp_replace

from pyspark.sql.functions import col, regexp_replace

df.withColumn("Hour", regexp_replace(col("Hour") ,  "(\\d{2})(\\d{2})" , "$1:$2" ) ).show()

+-----+
| hour|
+-----+
|00:45|
|00:50|
+-----+

answered Sep 22 '22 04:09

philantrovert

Related questions
                            
                                How to use tf.data.Dataset.padded_batch with a nested shape?
                            
                                Pandas Loc select by index as well as boolean condition in single expression
                            
                                How can I use a custom function within an expression using the eval dataframe method?
                            
                                How to calculate (statistical) power function vs. sample size in python?
                            
                                Can I pickle Python objects in memory instead of a physical file? [duplicate]
                            
                                Django ModuleNotFoundError
                            
                                Python socket listen on all ports
                            
                                Automatically updating known_hosts file when host key changes using Paramiko
                            
                                Got an error creating the test database: Django unittest
                            
                                Instrumenting Python Code
                            
                                Bokeh scatterplot with gradient colors
                            
                                Python custom decorator not working with Celery tasks [duplicate]
                            
                                How to define a setup method only called once during testing with nosetest?
                            
                                Spark: Extracting summary for a ML logistic regression model from a pipeline model
                            
                                Django Queryset for concat query fullname of first_name and last_name
                            
                                Custom metrics with tf.estimator
                            
                                How to check django staff user first time login in admin panel?
                            
                                How to add new dictionary into existed json file with dictionary?
                            
                                Queryset in __init__ Form Django
                            
                                How to convert unicode string into normal text in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pyspark, Add a character in the middle of a string

Tags:

python

split

apache-spark

pyspark

BryceSoker

People also ask

1 Answers

philantrovert

Recent Activity

Donate For Us