Introduction

This introduction to Python workshop will provide beginners with experience loading, exploring, and visualising biological data using the pandas and matplotlib libraries. The example data used is clinical and gene expression data from the METABRIC breast cancer dataset, allowing participants to work with realistic biomedical data structures and learn how to generate meaningful summaries and plots.

Instructor

Sanduni Rajapaksa

Learning Objectives

By the end of this training, participants will be able to:

Load tabular data into Python using pandas
Calculate basic statistics such as mean and median
Filter and slice data based on clinical features
Produce simple plots such as scatter plots and boxplots using matplotlib
Modify the appearance of plots

Data

The Metabric study characterized the genomic mutations and gene expression profiles for 2509 primary breast tumours. In addition to the gene expression data generated using microarrays, genome-wide copy number profiles were obtained using SNP microarrays. Targeted sequencing was performed for 2509 primary breast tumours, along with 548 matched normals, using a panel of 173 of the most frequently mutated breast cancer genes as part of the Metabric study.

References:

Both the clinical data and the gene expression values were downloaded from cBioPortal.

We excluded observations for patient tumor samples lacking expression data, resulting in a data set with fewer rows.

Credits and Acknowledgements

This content was adapted from the following course materials:

R for Data Science book
OHI Data Science Training
Data Carpentry
WEHI tidyr coursebook by Brendan R. E. Ansell
content developed by Maria Doyle.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
_includes		_includes
utils/css		utils/css
vignettes		vignettes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
_quarto.yml		_quarto.yml
index.qmd		index.qmd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Introduction

Instructor

Learning Objectives

Data

References:

Credits and Acknowledgements

About

Uh oh!

Releases

Packages

Languages

License

Sanduni-P/IntroPython-Bio

Folders and files

Latest commit

History

Repository files navigation

Introduction

Instructor

Learning Objectives

Data

References:

Credits and Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages