res <- results(dds, name="condition_tumor_vs_normal")
summary(res)
```
Box-and-Whisker Plot:depicts the distribution of gene expression values across distinct sample groups (here, normal and cancer).The boxes show the interquartile range (IQR), while the centre line inside the box reflects the median expression value. The whiskers extend from the margins of the boxes and represent the variability of the data. Outliers are points that are not within the whiskers. The figure allows us to examine the expression distributions of the two groups and discover any variations in their central tendencies and spread
```{r}
# Adjustment to overcome NA-error --> exclude rows containing NA's
Each point on the PCA plot represents a sample, and the color of the points indicates which experimental groups the samples(normal and cancer) belong to.
```{r}
library(DESeq2)
library(PCAtools)
vst <- assay(vst(dds))
p <- pca(vst, removeVar = 0.1)
screeplot(p, axisLabSize = 18, titleLabSize = 22)
```
The scree plot depicts the proportion of variation explained by each principle component, allowing us to assess the importance of each PC in capturing the overall variability in the dataset. This figure is useful for evaluating the dimensionality of the data and estimating how many principal components are required to represent a substantial percentage of the dataset's variability.
A biplot is a type of scatter plot that combines the sample scores (positions of the samples in the PCA space) and the variable loadings (contributions of the variables to each principal component) on the same plot.The distance between the sample points on the biplot illustrates how similar or distinct the gene expression patterns of the samples are.