TA 6. How far have we come?

(TA6.08) Data Scientists: What are they?


DP World Hall October 24, 2018 10:45 am - 12:00 pm

Bookmark and Share

Seth Dobrin
Abed Benaichouche
Jorge Sanz Moraleda
Jose Pablo Redondo
Tom Smith


The "data scientist" -- a new breed of quantitative professionals with the training to make discoveries in the world of data and subsequently lead disruptions of entire industries. Data scientists have the skills to analyses data, technical skills to solve complex scientific problems and the curiosity to explore what problems needs to be solved. They are part mathematician, part statistician, part computer scientist, part opportunity spotter. The title has been around for only a few years but this group has become enormously influential. What is the future of this profession? What do job growth in sector look like? Will the disruptors be disrupted? And what impact do they truly have in society -- is it more hype than consequence.

Many questions around would Data Scientist replaced the current statistician jobs. The answer will be discussed in this session, as data scientist role is an branch of the statistician, the role require other skills includes the use of advance Technologies and machine learning. There are unique hard skills require for the job include data mining, machine learning "AI development" and competencies to integrate structured and unstructured data, the other competencies should be around the usage of big data platform and structured query language tools and statistical computing language.

Some also will be confused with other similar roles like data analysts and data engineers which sometimes will have similar education background and skills. Data scientist tend to drive the goals in different way by generating questions themselves instead of using analytical tools only to be able to determine how data can be used to achieve certain goals. There are tons of data collected every minutes and much of times, it is not used or neglected but now adays, studies by Data Scientist shows different dimension and meaningful information and insights from the data.