For data scientists, the big money is in open source - TechRepublic
▻http://www.techrepublic.com/blog/big-data-analytics/data-scientists-can-find-big-money-in-open-source/#ftag=RSS56d97e7
As O’Reilly’s 2013 Data Science Salary Survey suggests, “ the field of big data has ushered in the arrival of new, complex tools that relatively few people understand or have even heard of. ” Knowing those tools is what yields such outsized salaries.
But which tools a data scientist masters turns out to have a large, material impact on her earning power.
The top data tool by far is SQL, which isn’t surprising: data analysis has been around long before we gave it a sexy “data science” label, and accessing data through SQL has long been the standard for data analysis. This isn’t changing overnight.
But once we move beyond SQL, it’s telling just how much of the most widely used Big Data tools are open source: R, Python, Hadoop and more. More interestingly, however, is the bifurcation between what O’Reilly calls “the Hadoop group”(orange) and the “SQL/Excel group” (blue):
Et dans le bloc Hadoop, les data scientists ont tendance à connaître plus d’outils, donc à être mieux payés. En plus, avec Hadoop, ils peuvent évoluer vers des données vraiment grosses.