Fall 2020 Course Descriptions
COMP 152-01 Statstical Bioinformatics in R
The analysis and interpretation of multivariate biological data sets requires an understanding of statistical and computational methods and tools. This course will introduce students to underlying concepts and specific tools for working with several types of high-dimensional biological data using the R programming language. Topics include probabilistic distributions, statistical modeling, hypothesis testing, data visualization and cleaning, supervised and semi-supervised learning, network and graph representations of biomedical data. Applications include RNA-sequencing, metagenomics, phylogenetics, and disease informatics. Learning will be supported through regular computational homework assignments and a final project including computational and written components and a project presentation.
Prerequisite: Comp 15 or equivalent, or graduate standing (with permission of instructor); some prior experience in R.