assemble
CIRCexplorer2 assemble carries out de novo assembly for circular RNA based on RABT method.
Usage and option summary
Usage:
CIRCexplorer2 assemble [options] -r REF -m TOPHAT [-o OUT]
Options:
-h --help Show help message.
-v --version Show version.
-r REF --ref=REF Gene annotation file.
-m TOPHAT --tophat=TOPHAT TopHat mapping folder.
-o OUT --output=OUT Output directory. [default: assemble]
-p THREAD --thread=THREAD Running threads. [default: 10]
--bb Convert assembly results to BigBed.
--chrom-size=CHROM_SIZE Chrom size file for converting to BigBed.
--remove-rRNA Ignore rRNA during assembling (only for human hg19).
--max-bundle-frags=FRAGMENTS Cufflinks --max-bundle-frags option.
Notes about options
CIRCexplorer2 assemblewill search for the chrom size file in TopHat2 result folder, and you could also specify its path using--chrom-size. If you have usedCIRCexplorer2 alignto align RNA-seq data, the chrom size file would be existed in the TopHat2 result folder.- Assembly for rRNA would be very time-consuming. If you set
--remove-rRNAoption, it would skip assembly for rRNA. To be noted, this option is only suitable for hg19. If the assembly step is still very slow, you could set--max-bundle-fragswith a small number. Please see Cufflinks protocol for more details about--max-bundle-fragsoption. - If you set
--bboption, the BigBed file of assembled transcripts would be created.
Input
CIRCexplorer2 assemble needs a gene annotation file and a tophat folder containing p(A)- or ribo- RNA-seq mapping result. The gene annotation file should be in the format of Gene Predictions and RefSeq Genes with Gene Names. See Annotate for more details.
Output
CIRCexplorer2 assemble will create one assemble folder by default. The transcripts_ref.txt would be used to do alternative splicing analysis for circular RNAs, and it has the same format with refFlat format.
assemble
├── cufflinks.log
├── filtered_junction.gtf
├── filtered_junction.txt
├── genes.fpkm_tracking
├── isoforms.fpkm_tracking
├── skipped.gtf
├── transcripts.gtf
├── transcripts.txt
├── transcripts_ref.txt
├── transcripts_ref.bed
├── transcripts_ref_sorted.bed
└── transcripts_ref_sorted.bb
filtered_junction.gtfandfiltered_junction.txt: Filtered gene annotation files.genes.fpkm_tracking,isoforms.fpkm_tracking,skipped.gtfandtranscripts.gtf: Cufflinks result files.transcripts_ref.txt,transcripts_ref.bedandtranscripts_ref_sorted.bed: Circular RNA transcript files.transcripts_ref_sorted.bb: BigBed file of Circular RNA transcripts.