Storage I – The Data Storage Saga

Ten years ago no one cared about data. Now data scientist is the “Sexiest job of the 21st century”. How did we get here? What came before Hadoop, NoSQL and all the other big data buzz words of the last decade? 1st Gen: Hierarchical Databases “The introduction of the term database coincided with the availability…

Big Data V – Learning The Lingo

50% of any job is just being able to talk the talk. What does that mean for data science? Firstly, recall that data science is an umbrella term housing several different disciplines: Statistics Computer Science Data Storage Machine Learning/Artificial Intelligence Business Analytics/Business Intelligence 1. Data Science Key Terms With the recent hype around data science,…

Big Data IV – Unicorn Or Safety In Numbers?

“…The CEO calmly responds, ‘I want GOD! I want a rockstar programmer who has… built a … big data platform, and has started a company!’ Dmitri respectfully responds back: ‘I wish you the best of luck finding that person.’ ” That’s the punchline of an entertaining and eye opening 2015 article by Harlan Harris, “Analyzing…

Big Data III – Degree or Not Degree?

US comedian JP Sears weighs in [1] on the usefulness of tertiary education in the modern world  – do the rising student loans and an oversupply of graduates still provide a competitive edge in the digital age of democratised information (like Wikipedia), search engines (like Google & Youtube), and MOOCs (Massive Online Open Courses)? He’s…

Big Data II – That’s Not A Tool

Python vs Java? R vs SAS? Hadoop vs Spark vs Flink? Which programming languages and software packages do I really need for data science? There are so many competing to do the same thing; is one of each enough or should I learn a variety? Following on from our last post investigating data scientist skills…

Big Data I – Tools of The Trade

So who’s employing data scientists in 2018 and what are they after? Let’s analyse 10 US, 10 UK and 10 Australian data science jobs from indeed.com and tally the desired data science skills and tools (software). 1. Data Science Skills   2. Data Science Tools 3. Years Experience (Scatter Plots) Discussion Some of the categorizations…

Life, The Universe & Everything: 42 or 4He2?

In 1979, Douglas Adams published ‘The Hitchhiker’s Guide to the Galaxy’, a science fiction novel featuring a supercomputer Deep Thought. Deep Thought’s purpose was to answer the ultimate question of life, the universe and everything. After 7.5 million years, it had completed its data mining and machine learning mission and arrived at the answer of…