Skip to content

This workshop is designed to give beginners a solid foundation in Python programming for data analysis and visualization, with a specific focus on applications in cancer biology

License

Notifications You must be signed in to change notification settings

Sanduni-P/IntroPython-Bio

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Introduction

This introduction to Python workshop will provide beginners with experience loading, exploring, and visualising biological data using the pandas and matplotlib libraries. The example data used is clinical and gene expression data from the METABRIC breast cancer dataset, allowing participants to work with realistic biomedical data structures and learn how to generate meaningful summaries and plots.

Instructor

Sanduni Rajapaksa

Learning Objectives

By the end of this training, participants will be able to:

  • Load tabular data into Python using pandas
  • Calculate basic statistics such as mean and median
  • Filter and slice data based on clinical features
  • Produce simple plots such as scatter plots and boxplots using matplotlib
  • Modify the appearance of plots

Data

The Metabric study characterized the genomic mutations and gene expression profiles for 2509 primary breast tumours. In addition to the gene expression data generated using microarrays, genome-wide copy number profiles were obtained using SNP microarrays. Targeted sequencing was performed for 2509 primary breast tumours, along with 548 matched normals, using a panel of 173 of the most frequently mutated breast cancer genes as part of the Metabric study.

References:

Both the clinical data and the gene expression values were downloaded from cBioPortal.

We excluded observations for patient tumor samples lacking expression data, resulting in a data set with fewer rows.

Credits and Acknowledgements

This content was adapted from the following course materials:


About

This workshop is designed to give beginners a solid foundation in Python programming for data analysis and visualization, with a specific focus on applications in cancer biology

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published