Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

What is the best programming language for operationalizing research questions with large data sets? [closed]

Tags:

java

sql

vba

I have completed my graduate public policy program but it was not at all tech heavy - some economics and econometrics but not requiring any CS knowledge. A good portion of the research jobs in DC require a basic level of programming knowledge. Mostly they want people who can perform advanced search and retrieval functions with large datasets and save stuff in different formats within their servers. And, they want STATA/stats knowledge, which I have some of.

My question is this: where is the best place to start learning some programming to get to this level? For instance, is Java, SQL, VBA or something else the best thing and most useful for these purposes? And, how much math do I need to write and run simple requests?

Thanks

like image 843
Kevin Avatar asked Nov 24 '10 22:11

Kevin


People also ask

Which of the language is preferred for high volume data analysis?

Very popular in finance and academia, R is a perfect language for data manipulation, processing and visualization, as well as statistical computing and machine learning. Like Python, R has a large community of users and a vast collection of specialized libraries for data analysis.

Which language is used for manipulating data in a big data environment?

“Java is probably the best language to learn for big data for a number of reasons; MapReduce, HDFS, Storm, Kafka, Spark, Apache Beam and Scala (are all part of the JVM (Java Virtual Machine) ecosystem.


2 Answers

My name is Alvaro. I worked as a senior bioinformatician on huge gene databases. Studied Bioinformatics at Harvard.

The script language you need for that is Perl.

Then you need a full understanding of SQL. All of that you can find it on the web.

If you get to be advanced you can also use R Programming language for Statistics. Check the web about the R Project. And also MathLab.

But not all at once!

Forget about Java or VBA for those purposes.

good luck

like image 184
Alvaro Avatar answered Sep 20 '22 12:09

Alvaro


For statistics and database querying/manipulation I would start with SQL.

like image 32
NoAlias Avatar answered Sep 21 '22 12:09

NoAlias