RDS
files available from
Datasets
folders:Differential_expression_deseq2_pc3.rds
: dataset
including gene expression results of PC3 cell line replicatesDifferential_expression_deseq2_vcap.rds
: dataset
including gene expression results of VCAP cell line replicatesHow many genes are in each table?
padj ≤ 0.01
and |log2FoldChange| ≥ 1
.
Create a new column diff
based on the sign of
log2FoldChange
:
log2FoldChange > 0
= up
log2FoldChange ≤ 0
= down
How many differentially expressed genes are there per cell line model? How many are up-regulated and down-regulated?
Summarize the results in a table.
Venn diagram
to show the intersection of
differentially expressed genes between VCAP and PC3.UpSet plot
to visualize the intersection of
the following gene sets:log2FoldChange
values from both PC3 and VCAP experiments. How many such genes are
there?Now, create a scatter plot, using:
log2FoldChange
values from both datasetsbaseMean
as the size of the points.Evaluate the correlation between log2FoldChange
values.
log2FoldChange
values from both PC3 and VCAP experiments. How many such genes are
there?Now, create a scatter plot, using:
log2FoldChange
values from both datasetsbaseMean
as the size of the points.Evaluate the correlation between log2FoldChange
values.
log2FoldChange
values:log2FoldChange > 0 and < 2
log2FoldChange between 2 and 5
log2FoldChange > 5
log2FoldChange < 0 and > -2
log2FoldChange between -2 and -5
log2FoldChange < -5
all
, by concatenating the
differentially expressed gene dataframes from PC3 and VCAP.Before merging, add a new column to each dataframe to indicate the
cell model (PC3
or VCAP
) for proper
distinction.
log2FoldChange
column of the all
dataframe:de_category
.de_category
.Now, assess whether gene length impacts the category of differentially expressed genes.
Create a TxDb object from the GTF file:
File: gencode.v28lift37.basic.annotation.gtf.gz
(located in the Datasets
folder).
Extract genes that are present in the all
dataframe.
Compute gene length
for these genes.
Add the gene length
as a new column in the
all
dataframe.
de_category
.Add Wilcoxon test results to assess whether there is a statistically
significant difference in gene length between PC3 and VCAP within each
de_category
.