该问题已被锁定!
2
关注
1901
浏览

如何提取比对上某一个基因或者某一段序列的所有reads

为什么被折叠? 0 个回复被折叠
孟浩巍 超级管理员 用户来自于: 北京市
2018-10-05 13:41
samtools view 命令的使用说明如下: [code]Usage: samtools view [options] || [region ...] Options: -b output BAM -C output CRAM (requires -T) -1 use fast BAM compression (implies -b) -u uncompressed BAM output (implies -b) -h include header in SAM output -H print SAM header only (no alignments) -c print only the count of matching records -o FILE output file name [stdout] -U FILE output reads not selected by filters to FILE [null] -t FILE FILE listing reference names and lengths (see long help) [null] -L FILE only include reads overlapping this BED FILE [null] -r STR only include reads in read group STR [null] -R FILE only include reads with read group listed in FILE [null] -q INT only include reads with mapping quality >= INT [0] -l STR only include reads in library STR [null] -m INT only include reads with number of CIGAR operations consuming query sequence >= INT [0] -f INT only include reads with all of the FLAGs in INT present [0] -F INT only include reads with none of the FLAGS in INT present [0] -G INT only EXCLUDE reads with all of the FLAGs in INT present [0] -s FLOAT subsample reads (given INT.FRAC option value, 0.FRAC is the fraction of templates/read pairs to keep; INT part sets seed) -M use the multi-region iterator (increases the speed, removes duplicates and outputs the reads as they are ordered in the file) -x STR read tag to strip (repeatable) [null] -B collapse the backward CIGAR operation -? print long help, including note about region specification -S ignored (input format is auto-detected) --input-fmt-option OPT[=VAL] Specify a single input file format option in the form of OPTION or OPTION=VALUE -O, --output-fmt FORMAT[,OPT[=VAL]]... Specify output format (SAM, BAM, CRAM) --output-fmt-option OPT[=VAL] Specify a single output file format option in the form of OPTION or OPTION=VALUE -T, --reference FILE Reference sequence FASTA FILE [null] -@, --threads INT Number of additional threads to use [0][/code]其中,-L参数可以帮助你完成你的要求,你只需要把你需要的区间存成BED格式就好。  

关于作者

Czc 注册会员

这家伙很懒,还没有设置简介

问题动态

发布时间
2018-10-04 10:32
更新时间
2018-10-05 13:41
关注人数
2 人关注

相关问题

如何在20个蛋白序列里找3个motif
如何筛选小鼠lncRNA?
如何根据转录组数据得到新转录本?如何验证一个基因的多个转录本?
用户如何涨积分?
如何获得cas9蛋白的高效低效的sgRNA
sambamba 提取 LITCHI017845的比对情况
统计学,如何从入门到放弃?
请问如何查询某基因的详细的Biological Function?
人源细胞系ATAC-seq比对率较低,显示支原体污染,如何处理?
riboseq中frame指的是什么,是如何确定的,通过riboseq如何知道一个基因的开放阅读框

推荐内容

蛋白保守序列分析
hmmsearch和hmmscan
prokka数据库更新
如何对特征数量少的空间蛋白组数据进行细胞聚类?
GAPIT包导出的GWAS结果如何添加新的阈值线?以及GAPIT的结果文件中的nobs、H&B.P.Value、Effect分别是什么意思?
群体内同源基因的所有变异
ASR祖先序列重建,最后一步使用PAML时出现一些问题
聚类分析问题
MCPcounter输入TCGA矩阵的要求?
linux下非root用户设置所运行任务的CPU占用率和线程数
All Rights Reserved Powered BY WeCenter V4.1.0 © 2025