By day I write C#. Everything I do goes through Microsoft code analysis and static analysis tools, so my C# has a very regular structure and layout. Obviously, I write code with a certain style. It is partially, because I have no choice (it won't compile if I miss out a space before that comma), but it's also nice to have regular looking code, knowing where to look for things, etc.
At the weekends I'm getting into Scala. Looking at the Scala API and Lift web framework source, I can't obviously see any standardised style. One thing that jumps out at me, for example, is the lack of a separate file per class. The lack of consistency with brackets and braces is another example.
I understand that there are probably a few reasons driving this: firstly, with open source (or hobby) code making sure that an obvious method isn't completely documented is less of a priority. Secondly, things like case classes cut down 20-line class declarations into a single line. Thirdly, C# is a much 'flatter' language: unless it's a complex LINQ statement, the numbers of nested parens, braces and brackets isn't that deep. In Scala, things tend to get a little nested.
Do regular Scala users have a specific style that they stick to? Am I just being stupid putting a one-line case-class in its own file for the sake of [alien] convention? Any tips?
We mostly follow Java's and Scala's standard naming conventions. Classes, traits, objects should follow Java class convention, i.e. PascalCase style. Packages should follow Java package naming conventions, i.e. all-lowercase ASCII letters. Methods/functions should be named in camelCase style.
=> is the "function arrow". It is used both in function type signatures as well as anonymous function terms. () => Unit is a shorthand for Function0[Unit] , which is the type of functions which take no arguments and return nothing useful (like void in other languages).
Naming Conventions:- Scala uses “camel case” naming. That is, each word is capitalized, except possibly the first word. Scala prefer to not to use the Underscore in names because Scala has different definition of underscore.
Scala functions are first class values. Difference between Scala Functions & Methods: Function is a object which can be stored in a variable. But a method always belongs to a class which has a name, signature bytecode etc. Basically, you can say a method is a function which is a member of some object.
Having multiple classes and objects inside a single file is considered good form in Scala, as long as the classes are tightly related.
While not necessary, the type returned by a method -- a named function declared on a trait, class or object -- is expected to be declared for non-private methods. Spaces are expected after :
, but not before it.
// methods declared on a class, trait or object def length: Int = ... def multiply(other: Foo): Foo = ... def hypotenuse(a: Double, b: Double): Double = { // function inside a method, so effectively private def square(x: Double) = x * x math.sqrt(square(a) + square(b)) }
Spaces are expected between keywords and parenthesis, but not between a method name and the following parenthesis, in dot notation. For operator notation, there doesn't seem to be an accepted style regarding parenthesis -- or when to use that notation, for that matter, but spaces are expected around non-alphanumeric methods in such notation.
// keywords if (foo) ... // dot notation foo.doSomething(bar) // operator notation foo doSomething bar foo + bar
Exceptionally, when concatenating strings with +
, the recommended style is not to use spaces around it. For example:
// concatenate strings println("Name: "+person.name+"\tAge: "+person.age)
Declarations that can be one-liners are expected to be one-liners, unless the nesting isn't obvious.
// one-liners lazy val foo = calculateFoo def square(x: Int) = x * x
Methods that do not expect parameters, and do not have side effects, are supposed to be used without parenthesis, except for Java methods, which are expected to be used with parenthesis. Parameter-less methods with side effects are supposed to be used with parenthesis.
// without side-effects val x = foo.length val y = bar.coefficient // with side-effects foo.reverse()
Declarations which contains a single expression are expected not to be enclosed inside curly braces unless other syntactic considerations make that impossible. Enclosing an expression within parenthesis to enable multi-line expressions is accepted, but I have seen little use of that.
// single-line expression def sum(list: List[Int]): Int = if (!list.isEmpty) list reduceLeft (_ + _) else 0 // multi-line expression val sum = ( getItems reduceLeft (_ + _) )
In for-comprehensions, keeping generators and conditions vertically aligned seems to be an accepted style. As for yield
, I have seen it both aligned with for
and indented.
// for-comprehensions val squares = for (x <- numbers) yield x * x // Curly brackets-style identation val cells = for { x <- columns y <- rows if x != y } yield Cell(x, y) // Parameter-style identation val cells = for (x <- columns; y <- rows; if x != y) yield Cell(x, y)
It's also accepted style to vertically align parameters of a class declaration.
Speaking of indentation, two-spaces is the accepted convention.
Curly braces are expected to start on the same line of the declaration, and end vertically aligned with that line by itself.
// another example def factorial(n: Int): Int = { def fact(n: Int, acc: Int): Int = n match { case 0 => acc case x => fact(x - 1, x * acc) } fact(n, 1) }
For procedures -- functions whose return type is Unit
--, the expected style was supposed to be to leave out the type of the method and the equal sign:
// procedures def complain { println("Oh, no!") }
Some people think this style is error prone, however, as a missed equal sign will change a function returning something other than Unit
into a procedure.
Identifiers are written in camel case (eg: identifiersHaveHumps
), like in Java. For names of fields, method parameters, local variables and functions, start with a lower case letter. For classes,traits and types, start with an upper case letter.
Departing from Java convention are constant names. In Scala, the practice is to use standard camel case starting with an upper case letter. For example Pi
and not PI
, XOffset and not X_OFFSET
. This rule is usually followed by any singleton. Having a constants and singletons be represented that way has a practical consequence, for case matches:
import scala.Math.Pi val pi = Pi // this identifier will be shadowed by the identifier in the function below def isPi(n: Double): Boolean = n match { case Pi => println("I got a true Pi."); true case pi => println("I got "+pi+" and bounded it to an identifier named pi."); false }
Package names are written beginning with a lower case letter. This is particularly helpful when distinguishing in an import statement what is a package and what is not. In the previous example, Math
is not a package (it's a singleton), as it begins with an upper case letter.
Using the underline character -- _
-- is not recommended, as that character has many special meanings in Scala. These rules for identifiers can be found on pages 141 and 142 of Programming in Scala, by Odersky, Spoon & Venners.
Right now, I can't recall other situations, but feel free to ask for clarification on specific points. Some of these rules were explicitly stated, others are more of a community consensus. I tried to leave out my own preferences, but I may have failed.
More importantly, perhaps, there just isn't really much of an unified convention. One reason for that may be that Scala is attracting people from many very different backgrounds, such as functional language mavens, Java programmers and web 2.0 enthusiasts.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With