Preparing to run a gene expression meta-analysis of lung cancer subtypes
2022-05-18
Source:vignettes/prepare_example.Rmd
prepare_example.Rmd
This vignette describes how to prepare the data to run a gene expression meta-analysis of lung cancer subtypes according to the method of Hughey and Butte (2015). You will need an internet connection to download the expression data and the custom cdf packages.
After completing the steps in this vignette, you can move on to run the meta-analysis described at vignette('run_example')
.
The meta-analysis will use data from the following four studies:
Instructions
-
Install the required custom CDF packages by running the following command in R.
metapredict::installCustomCdfPackages( c('hgu95av2hsentrezgcdf', 'hgu133plus2hsentrezgcdf'))
-
Download each of the following files (we’re only downloading a subset of the Bhattacharjee data):
Create a folder called
metapredict_example
. Insidemetapredict_example
, create a folder calledexpression_data
. Insideexpression_data
, create a folder calledBhattacharjee
.Unzip the files for the Bhattacharjee dataset and move all the .CEL files to the
Bhattacharjee
folder.Move
GSE11969_series_matrix.txt.gz
andGSE29016_series_matrix.txt.gz
to theexpression_data
folder.Unzip
GSE30219_RAW.tar
, rename the resulting folderGSE30219
, and move it to theexpression_data
folder.