该问题已被锁定!
2
关注
2093
浏览

如何提取比对上某一个基因或者某一段序列的所有reads

为什么被折叠? 0 个回复被折叠
孟浩巍 超级管理员 用户来自于: 北京市
2018-10-05 13:41
samtools view 命令的使用说明如下: [code]Usage: samtools view [options] || [region ...] Options: -b output BAM -C output CRAM (requires -T) -1 use fast BAM compression (implies -b) -u uncompressed BAM output (implies -b) -h include header in SAM output -H print SAM header only (no alignments) -c print only the count of matching records -o FILE output file name [stdout] -U FILE output reads not selected by filters to FILE [null] -t FILE FILE listing reference names and lengths (see long help) [null] -L FILE only include reads overlapping this BED FILE [null] -r STR only include reads in read group STR [null] -R FILE only include reads with read group listed in FILE [null] -q INT only include reads with mapping quality >= INT [0] -l STR only include reads in library STR [null] -m INT only include reads with number of CIGAR operations consuming query sequence >= INT [0] -f INT only include reads with all of the FLAGs in INT present [0] -F INT only include reads with none of the FLAGS in INT present [0] -G INT only EXCLUDE reads with all of the FLAGs in INT present [0] -s FLOAT subsample reads (given INT.FRAC option value, 0.FRAC is the fraction of templates/read pairs to keep; INT part sets seed) -M use the multi-region iterator (increases the speed, removes duplicates and outputs the reads as they are ordered in the file) -x STR read tag to strip (repeatable) [null] -B collapse the backward CIGAR operation -? print long help, including note about region specification -S ignored (input format is auto-detected) --input-fmt-option OPT[=VAL] Specify a single input file format option in the form of OPTION or OPTION=VALUE -O, --output-fmt FORMAT[,OPT[=VAL]]... Specify output format (SAM, BAM, CRAM) --output-fmt-option OPT[=VAL] Specify a single output file format option in the form of OPTION or OPTION=VALUE -T, --reference FILE Reference sequence FASTA FILE [null] -@, --threads INT Number of additional threads to use [0][/code]其中,-L参数可以帮助你完成你的要求,你只需要把你需要的区间存成BED格式就好。  

关于作者

Czc 注册会员

这家伙很懒,还没有设置简介

问题动态

发布时间
2018-10-04 10:32
更新时间
2018-10-05 13:41
关注人数
2 人关注

相关问题

RNA.fold_compound(seq)如何改进提升
植物基因组组装过程中如何去除质体序列
请问如何使用R语言绘制散点图
如何获得cas9蛋白的高效低效的sgRNA
如何处理fastqc报告中duplication level报错的问题
大咖们好!如何像群主一样优秀 自带关注
统计学,如何从入门到放弃?
VCONTACT2的结果文件genome_by_genome_overview如何进行统计分类
ciriRNA表达量如何计算?(更)
linux系统下,R语言,安装软件包install.packages("units"),出现如下问题该如何解决?

推荐内容

unicycler混合拼接结果是否还需要纠错
信号通路特征基因集的查询?用于作ssGSEA分析。
infercnv运行报错
Protein-Protein Docking 膜蛋白受体研究
linux下使用convert出现报错,可能是什么原因?如何解决?
GAPIT包FarmCPU和Blink模型进行GWAS分析报错
去除批次效应选定靶标细胞群后如何进行亚群分析?
群体结构矫正
如何按一个列表对基因型文件进行过滤,剔除不需要的样本?
使用Tracking Tumor ImmunoPhenotype(TIP)网站分析TCGA的BLCA_tpm数据
All Rights Reserved Powered BY WeCenter V4.1.0 © 2025