Estimated time: 15 Minutes
Motivation
Customizing plots can help us see patterns in the data or make the
claim(s) based on the data represented clearer.
Instructions
- Work independently in the main room, posting any questions that
arise to slack.
- Recommendations for writing your own code:
- Read function documentation
- Test out ideas - it’s okay to make mistakes and generate errors
- Use a search engine to look up errors or recommended solutions using
keywords
- We’ll review possible solutions after time is up as a group.
Exercise
Try doing the following to the pca_plot
, starting with
the “most popular” request and moving on to other customizations if you
have time:
- Add a title and subtitle to the plot
- Update the color palette to be color-blind friendly
- Add labels to show which samples correspond to which points
- Use shape instead of color to indicate groups on the PCA plot.
- Challenge: Change the legend title to “Iron Status”.
Example
Here is a copy of the code we just tested together to 1) pull the
underlying data from the PCA function and 2) change the theme of our PCA
plot to black and white.
pcaData <- plotPCA(rld, intgroup=c("condition"), returnData=TRUE)
using ntop=500 top features by variance
percentVar <- round(100 * attr(pcaData, "percentVar")) # store PC axes (% variance)
# create custom plot object
PCACustom <- ggplot(pcaData, aes(PC1, PC2, color=condition)) +
geom_point(size=3) +
coord_fixed() +
theme_bw()
# add percentVar labels to *displayed plot*
PCACustom +
xlab(paste0("PC1: ",percentVar[1],"% variance")) +
ylab(paste0("PC2: ",percentVar[2],"% variance"))
# add percentVar labels to *stored plot object*
PCACustom2 <- PCACustom +
xlab(paste0("PC1: ",percentVar[1],"% variance")) +
ylab(paste0("PC2: ",percentVar[2],"% variance"))
Details & finding help
Add a title and subtitle to the ggplot plot
- Hint: use the
labs()
function or search for examples
- Remember that unless a change to a plot is assigned to an
object, although the change will be displayed it will not stored
for later reference or output to file
Possible solution
Example of possible :
?labs
PCACustom2 +
labs(title = "Iron Supplemented Mice", subtitle = "PCA of top 500 genes")
PCACustom3 <- PCACustom2 +
labs(title = "Iron Supplemented Mice", subtitle = "PCA of top 500 genes")
Add labels to show which samples correspond to which points
Possible solution
Example of possible solution:
?geom_label_repel
# display
PCACustom2 +
geom_text_repel(aes(label = name),
point.padding = 0.5,
box.padding = 0.5)
# save to new object
PCACustom4 <- PCACustom2 +
geom_text_repel(aes(label = name),
point.padding = 0.5,
box.padding = 0.5)
Make our color palette more color-blind friendly (with
RColorBrewer
)
Possible solution
Example of possible solution:
# look at pre-made color palettes from RColorBrewer
display.brewer.all(colorblindFriendly = TRUE)
# use RColorBrewer palette
PCACustom2 +
scale_colour_brewer(palette = "Set2")
# OR
# customize using manual color palette
# The R Cookbook palette with grey:
cbPalette <- c("#999999", "#E69F00", "#56B4E9", "#009E73", "#F0E442", "#0072B2", "#D55E00", "#CC79A7")
# To use for line and point colors, add manual color scaling with custom palette
PCACustom2 +
scale_colour_manual(values=cbPalette[2:3])
PCACustom5 <- PCACustom2 +
scale_colour_manual(values=cbPalette[2:3])
Use shape instead of color to indicate groups on the PCA plot.
Possible solution
Example of possible solution:
# generate new aesthetic mapping (with default shapes selected)
ggplot(pcaData, aes(PC1, PC2, shape=condition)) +
geom_point(size=3) +
coord_fixed() +
theme_bw() +
xlab(paste0("PC1: ",percentVar[1],"% variance")) +
ylab(paste0("PC2: ",percentVar[2],"% variance"))
# generate new aesthetic mapping (with manually selected shapes)
ggplot(pcaData, aes(PC1, PC2, shape=condition)) +
geom_point(size=3) +
scale_shape_manual(values = c(1, 4)) +
coord_fixed() +
theme_bw() +
xlab(paste0("PC1: ",percentVar[1],"% variance")) +
ylab(paste0("PC2: ",percentVar[2],"% variance"))
# create custom plot object with manual shapes
PCACustom6 <- ggplot(pcaData, aes(PC1, PC2, shape=condition)) +
geom_point(size=3) +
scale_shape_manual(values = c(1, 4)) +
coord_fixed() +
theme_bw() +
xlab(paste0("PC1: ",percentVar[1],"% variance")) +
ylab(paste0("PC2: ",percentVar[2],"% variance"))
Challenge: Change the legend title to “Iron Status”
- Hint, you can do this with the
labs()
function too,
using the corresponding aesthetic mapping (e.g. “color”).
- This
help thread with examples may also be useful
Possible solution
Example of possible solution:
# customize label for colour mapping
PCACustom2 +
guides(colour=guide_legend(title="Iron supplementation status"))
# alternatively specify label for aesthetic mapping
PCACustom2 +
labs(colour="Iron supplementation status")
# store custom plot as new object
PCACustom7 <- PCACustom2 +
labs(colour="Iron supplementation status")
Saving the result
If time permits, consider how you might save your favorite new PCA
plot (with an informative file name). Hint: Consider how we saved our
initial PCA plot in the previous module with ggsave()
.
Solution
Here are examples of some possible approaches:
pdf(file = file.path('outputs', 'figures', 'PCA_rlog_Titled.pdf'), width = 6, height = 6)
PCACustom3
dev.off()
ggsave(
filename = file.path('outputs', 'figures', 'PCA_rlog_Titled.pdf'),
plot = PCACustom3,
width = 6, height = 6, units = 'in')
