Generate a fraction_design table for EstimateFractions.
create_fraction_design.RdA fraction_design table denotes what populations of labeled/unlabeled RNA are present in your data.
A fraction_design table as one column for each mutation type (e.g., TC) present
in your cB file, and one column named "present". Each entry is either TRUE or
FALSE. The rows include all possible combinations of TRUE and FALSE for all
mutation types columns. A value of TRUE in a mutation type column represents
a population of reads that have high amounts (on average) of that mutation type.
For example, if your fraction_design table has mutation type columns "TC" and
"GA", the row with TC == TRUE and GA == FALSE represents a population of reads
with high T-to-C mutation content and low G-to-A mutation content. In other words,
these are reads from RNA synthesized in the presence of s4U but not s6G. If such
a population exists in your data, the "present" column for that row should have a
value of TRUE.
Arguments
- mutrate_populations
Character vector of the set of mutational populations present in your data. For example, s4U fed data with standard nucleotide recoding chemistry (e.g., TimeLapse, SLAM, TUC, AMUC, etc.) would have a
mutrate_populationsof c("TC"). Dual labeling experiments with s4U and s6G feeds would have amutrate_populationsof c("TC", "GA").
Value
A fraction_design table that assumes that every possible combination of
mutational populations listed in mutrate_populations are present in your data.
The present column can be modified if this assumption is incorrect. This default
is chosen as it will in theory work for all analyses, it may just be unnecessarily
inefficient and estimate the abundance of populations that don't exist.