Data Scientist Data ScientistMathematics and StatisticsLinear AlgebraVectorsMatricesMatrix OperationsMatrix DecompositionProbability and StatisticsProbability TheoryStatistical InferenceHypothesis TestingRegression AnalysisCalculusDifferentiationIntegrationOptimizationProgramming and Data ManipulationProgramming LanguagesPythonRSQLData ManipulationData CleaningData TransformationData AggregationData WranglingDatabase ManagementRelational DatabasesSQL QueryingIndexing and OptimizationMachine LearningSupervised LearningLinear RegressionLogistic RegressionDecision TreesRandom ForestsUnsupervised LearningClusteringDimensionality ReductionAssociation Rule LearningRecommender SystemsDeep LearningNeural NetworksConvolutional Neural NetworksRecurrent Neural NetworksGenerative Adversarial NetworksData Visualization and CommunicationData VisualizationData ExplorationCharts and PlotsInteractive VisualizationsDashboardingCommunication SkillsData StorytellingPresentation SkillsData ReportingStakeholder EngagementBig Data and Distributed SystemsHadoopHDFSMapReduceHivePigSparkSpark CoreSpark SQLSpark StreamingSpark MLlibSpark GraphX