Bridging Statistics and Data Science
With the recent big data revolution, enterprises ranging from FORTUNE 500 to startups across the US are using Data Science to bring valuable business insight from all the data collected. Statisticians are great data scientist candidates, but there are relatively few data scientists with statistics education background. This course aims to bridge the gap between Statistics and Data Science. Data science is a combination of science and art with data as the foundation. We will cover both the science part and the art part (such as data science project flow, general pitfalls in data science projects, and soft skills to communicate with business stakeholders effectively). The course will be hands-on and we will use the Databricks community edition cloud platform and R-Studio to illustrate programming, big data platform usage (such as Spark) and standard machine learning algorithms.
Dr. Ming Li(Amazon)
Dr. Hui Lin(DowDuPont)
Dr. Hui Lin is currently a Data Scientist at DowDuPont. She is the leader in the company at applying advanced data science to enhance Marketing and Sales Effectiveness. She has been providing statistical leadership for a broad range of predictive analytics and market research analysis since 2013. She is a co-founder of the Central Iowa R User Group, blogger of scientistcafe.com and 2018 Program Chair for the ASA Section on Statistics in Marketing. She enjoys making analytics accessible to a broad audience and teaches tutorials and workshops for practitioners on data science. She holds M.S. and Ph.D. degrees in statistics from Iowa State University, and a B.S. in mathematical statistics from Beijing Normal University.