FastQDesign

The goal of FastQDesign is to guide the investigator in designing a single-cell RNA sequencing(scRNA-seq) experiment. We aim to shift the focus back to raw FastQ reads other than the Unique Molecular Identifier (UMI) matrix when considering the scRNA-seq experiment. The paper is now open-access, PMID: 40175506.

Installation

You can install the development version of FastQDesign from GitHub with:

# install.packages("devtools")
devtools::install_github("yuw444/FastQDesign")

fastF

fastF is a submodel of the FastQDesign framework, it is written in C for efficiency and broad compatibility. It helps generate pseudo-design datasets from the FastQ reference.

Prepare the Reference and Subsample

This is a basic example which shows you how to prepare the subsamples and reference for the comparison:

library(FastQDesign)
library(Seurat)

aibm <- readRDS("~/Documents/Research/FastQDesign/reference_list.rds")[[1]]

ref_list <- SamplePrep(
  aibm,
  condition = "orig.ident",
  cell_3d_embedding = TRUE,
  interactive = TRUE,
  min.pct = 0.2,
  logfc.threshold = 0.3,
  return.thresh = 0.05,
  verbose = TRUE
)

fastq_ds <- readRDS("~/Documents/Research/FastQDesign/bam_downsample_list.rds")[[1]]

ds_list <- SamplePrep(
  fastq_ds,
  n_clusters = 4,
  condition = "orig.ident",
  cell_3d_embedding = TRUE,
  root_cells_ref = Cells(aibm)[aibm$root_cells],
  min.pct = 0.2,
  logfc.threshold = 0.3,
  return.thresh = 0.05,
  verbose = TRUE
)

match_list <- SampleMatch(cb03_list, ds_list)

In the above example, aibm is the reference Seurat object, fastq_ds is generated with the FastQ downsample by fastF.

Generate One Dot in the Similarity Surface

## the similiarity from one subsample

ARI <- mclust::adjustedRandIndex(match_list$seurat_cluster.x, match_list$seurat_cluster.y)

JaccardCluster <- FastQDesign:::JaccardIndex(match_list$genes_cluster$ref, match_list$genes_cluster$ds)
JaccardCondition <- FastQDesign:::JaccardIndex(match_list$genes_condition$ref, match_list$genes_condition$ds)

Kendall <- cor(match_list$cluster_match$pseudotime.x, match_list$cluster_match$pseudotime.y, method = "kendall")

similarity <- mean(c(ARI, JaccardCluster, JaccardCondition, Kendall))

Design

df_power_paper <- read.csv("~/Documents/Research/FastQDesign/AIBM_power.csv")

budget <- 7500
power_threshold <- 0.8
flow_capacities <- c(10^7, 5 * 10^7, 2 * 10^8)
flow_costs <- c(1000, 2000, 3000)
library_costs <- 5000

library(dplyr)
library(ggplot2)
rst_design <- FastQDesign(
  df_power = df_power_paper %>% dplyr::select(n_cells, expected_reads_per_cell, power_fastq),
  budget = budget,
  power_threshold = power_threshold,
  flow_capacities = flow_capacities,
  flow_costs = flow_costs,
  library_costs = library_costs
)

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
R		R
data		data
man		man
tests		tests
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
FastQDesign.Rproj		FastQDesign.Rproj
LICENSE		LICENSE
LICENSE.md		LICENSE.md
NAMESPACE		NAMESPACE
NEWS.md		NEWS.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Uh oh!

Repository files navigation

FastQDesign

Installation

fastF

Prepare the Reference and Subsample

Generate One Dot in the Similarity Surface

Design

About

Licenses found

Uh oh!

Releases 1

Packages

Languages

License

Licenses found

yuw444/FastQDesign

Folders and files

Latest commit

History

Repository files navigation

FastQDesign

Installation

fastF

Prepare the Reference and Subsample

Generate One Dot in the Similarity Surface

Design

About

Topics

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages