BIOSTAT821

Download as PDF

Software Tools for Data Science

Biostat & Bioinformatic Dept AHCG - Allied Health Graduate

Subject

BIOSTAT

Catalog Number

821

Title

Software Tools for Data Science

Course Description

A data scientist needs to master several different tools to obtain, process, analyze, visualize and interpret large biomedical data sets such as electronic health records, medical images, and genomic sequences. It is also critical that the data scientist masters the best practices associated with using these tools, so the results are robust and reproducible. The course covers foundational tools that will allow students to assemble a data science toolkit, including the Unix shell, text editors, regular expressions, relational and NoSQL databases, and the Python programming language for data munging, visualization and machine learning. Best practices that students will learn include the Findable, Accessible, Interoperable and Reusable (FAIR) practices for data stewardship, as well as reproducible analysis with literate programming, version control and containerization. Credits: 3

Grading Basis

ABCDF Grading

Consent (Permission Number)

Instructor Consent Required

Min Units

3

Max Units

3

Lecture