数据科学

发布者:赵伟翔

发布时间:2025-03-28

浏览次数:10

数据科学(48课时/3学分)

Data Science (48 hours/3 credits)

 

课程描述

本课程将介绍数据科学中的基本概念及方法。通过课程学习,学生将理解并掌握处理真实、复杂数据的一系列方法。课程主要包含以下内容:数据结构;回归模型如lasso回归、岭回归和样条回归;分类模型如逻辑斯谛回归、线性判别分析、支持向量机和随即森林,以及无监督学习方法如主成分分析、k均值聚类和分层聚类。该课程主要使用R软件开展上机实践。

This course will introduce the fundamental concepts of modern data science. It will provide students with tools to deal with real, messy data, and an understanding of the appropriate methods to use. Topics will include data structures; regression models including lasso regression, ridge regression and non-linearity with splines; classification models including logistic regression, linear discriminant analysis, support vector machines and random forests; and unsupervised learning methods such as principal component analysis, k-means and hierarchical clustering. The practical skills will be focused on data science in R.