How can I pretty print a data frame in Zeppelin/Spark/Scala?

Tags:

I am using Spark 2 and Scala 2.11 in a Zeppelin 0.7 notebook. I have a dataframe that I can print like this:

dfLemma.select("text", "lemma").show(20,false)

and the output looks like:

+---------------------------------------------------------------------------------------------------------------------------+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------+ |text                                                                                                                       |lemma                                                                                                                                                                  | +---------------------------------------------------------------------------------------------------------------------------+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------+ |RT @Dope_Promo: When you and your crew beat your high scores on FUGLY FROG 😍🔥 https://time.com/Sxp3Onz1w8                    |[rt, @dope_promo, :, when, you, and, you, crew, beat, you, high, score, on, FUGLY, FROG, https://time.com/sxp3onz1w8]                                                      | |RT @axolROSE: Did yall just call Kermit the frog a lizard?  https://time.com/wDAEAEr1Ay                                        |[rt, @axolrose, :, do, yall, just, call, Kermit, the, frog, a, lizard, ?, https://time.com/wdaeaer1ay]                                                                     |

I am trying to make the output nicer in Zeppelin, by:

val printcols= dfLemma.select("text", "lemma") println("%table " + printcols)

which gives this output:

printcols: org.apache.spark.sql.DataFrame = [text: string, lemma: array<string>]

and a new blank Zeppelin paragraph headed

[text: string, lemma: array]

Is there a way of getting the dataframe to show as a nicely formatted table? TIA!

770

asked Jul 06 '17 10:07

schoon

1 Answers

In Zeppelin you can use z.show(df) to show a pretty table. Here's an example:

val df = Seq(   (1,1,1), (2,2,2), (3,3,3) ).toDF("first_column", "second_column", "third_column")  z.show(df)

enter image description here

192

answered Sep 22 '22 16:09

Daniel de Paula

Related questions
                            
                                vuex namespaced mapState with multiple modules
                            
                                Unable to start android emulator with ERROR: There's another emulator instance running with the current AVD
                            
                                How to republish an app after unpublishing it?
                            
                                How to load an URL inside a WebView using Android Kotlin?
                            
                                Laravel: Validating a number greater than zero is failing
                            
                                Formik, Yup Password Strength Validation with React
                            
                                Yarn Build - Babel-loader issues with Storybook
                            
                                What are the advantages of using Objective-C over C++ [closed]
                            
                                SQL Server simple Insert statement times out
                            
                                Best practices for column naming in Sql [closed]
                            
                                ActiveRecord::Base Without Table
                            
                                URLConnection FileNotFoundException for non-standard HTTP port sources

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With