The 10 pioneering data scientists listed here were identified as top data scientists in our previous article entitled data science equation, based on their LinkedIn profile. Here we computed, for each pioneer, the number of endorsements for each of the top 4 data science related skills: analytics, big data, data mining and machine learning; these skills were identified in our previous article as most strongly linked to data science. Then we normalized the counts, so it is expressed here as a ratio between 0 and 1, and for each individual, the total aggregated over these four skills is 100%. Now it makes our classification problem easier.
Note that the correlation between machine learning and analytics is very negative (-0.82). Likewise, the correlation between big data and data mining is very negative (-0.80). All other cross-skill correlations are negligible.