版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
“HostName:”IP192.168.118.204”[lirj@big[lirj@big~]$pwd 当前位置更换到我们事先已经为您建好的计算下面。我们在光标所在的位置键入(例如:你的用户名是lirj:[lirj@big[lirj@big~]$ BSMAP软件的地址: 文章在BMCBioinformatics2009, BSMAP[lirj@big[lirj@biglirj]$ls 接下来关于BSMAP软件的运算练习将在这个 ./[lirj@bigBSMAP]$ls readlengthuptoallowupto20mismatchessupportpairendmapsupportparallelmapqueryfilebforpairenddata,FASTA/FASTQ/BAMformat.IftheinputisinBAMformat,itshouldbethesameasthefilespecifiedby"-a"outputalignmentfile,iffilenamehas.samsuffix,theoutputwillbeinSAMformat,ifthefilenamehas.bamsuffix,theoutputfilebeinsortedBAMfile,andafilename.baiindexfilewillbegenerated,forotherfilenamesuffixtheoutputisinBSPoutputalignmentfileforunpairedreadsinpairendmap,onlyusedforBSPformatoutput.IftheoutputformatisspecifiedinBAM/SAMformat,thisoptionwillbeignored,allalignmentswillbewritentooneBAM/SAMoutputfilespecifiedbythe"-o"seedsize,default=16,min=8,[seed_size*seed_seg_num+3<=read_length]longerseedsizeisfaster,~1.5timesfasterwitheachadditionalntmaxnumberofmismatchesallowedonaread,default=2,maxmaxnumberofequalbesthitstocount,smallerwillbeinitialquality,[Illuminauses'@',SangerInstituteuses'!'],filterlow-qualityreadscontaining>nNs,numberofprocessorstouse,maxinsertionsizeforpairendmap,mininsertionsizeforpairendmap,indexinterval(1~16),default=4,meaningthereferencegenomewillbeevery4bp,largerindexintervalneedslessmemory,andslightlymapsensitivity.(~0.5%3-endadaptersequence,default=none,requiresatleast4ntmatched,mismatchincludethereferencesequencesastheXR:Z:<string>fieldinSAMdefault=donotstartfromthenthreadorreadpair,default:endatthenthreadorreadpair,default:setrestrictionenzymedigestionsiteandactivateRRBSmapmode,readsmustbemappedtodigestionsitesthedigestionsitemustbepalindromic,digestionpositionismarkedby'-',forexample:'-DC-CGG'(MspI)randomeseedinselectingmultiplehits.default:0(seedsetfromsystemsingleend在 [lirj@big[lirj@bigBSMAP]$bsmap-areads1.fastq-dchr22.fa-o-w100-s14-v3[lirj@big[lirj@bigBSMAP]$psPIDPIDTTYTIME3399pts/300:00:007820pts/300:00:037826pts/300:00:0010000readsfinished.5secs10000readsfinished.5secs20000readsfinished.5secsTotalnumberofalignedreads:20000(1e+02%)FinishedatThuSep Totaltimeconsumed: 5secs bsmap-areads1.fastq-dchr22.fa-o_bsmap.bsp-w100-14-v[lirj@big[lirj@bigBSMAP]$ls可以看到,文件夹里多了一个文件map_bsmap.bsp,这就是运行BSMAP将[lirj@big[lirj@bigBSMAP]$lessS _bsmap.bsp屏幕上将显示输出文件的内容1read2mappedread3qualityofthequery4UM:uniquemapOF:overmapNM:nomapQC:lowquality56maplocation(1based,5'-endcoordinatesoftheregionontheWatsonstrandof7++:forwardstrandofWatsonofreference+-:reversestrandofWatsonofreference-+:forwardstrandofCrickofreference--:reversestrandofCrickofreference8insertionsizeforpair-endmap,0meanssingle-endunpairedmap9Wastonreferencesequenceatthemapnumberofmismatchesofcurrent#hitsof0mismatchto#hitsofmax_mismatches,by在光标处键入“q”,回到 在 [lirj@big[lirj@bigBSMAP]$bsmap-areads1.fastq-dchr22.fa-o-w100-s14-v3BSMAPBSMAPStart Thu 118:42:29Loadin1dbseqs,totalsize bp.2secspassedCreateseedtable.5secsmaxmismatches:startfromread#1maxmulti-hits:maxNs:seedsize: basequalitychar:maxfragemtQuery:reads1.fastqReference:chr22.faOutput:map 0000readsfinished.5secspassed20000readsfinished.5secsFinishedatThu 118:42:34 5 bsmap-areads1.fastq-dchr22.fa-o_bsmap.sam-w100-14-v[lirj@big[lirj@bigBSMAP]$ls可以看到,文件夹里多了一个文件map_bsmap.sam,这就是运行BSMAP将[lirj@bigBSMAP]$[lirj@bigBSMAP]$lessS _bsmap.sam1QuerypairNAMEifpaired;orQueryNAMEif2UM:0x0,MA:0x100,OF:0x100,NM:0x4,QC:formaponBSCorBSWC:FLAG=FLAG+0x10forpair-endmap:ifit'sthefirstreadinpair,ifit'sthesecondreadinpair,FLAG=FLAG+0x80ifmapsarepaired,FLAG=FLAG+0x2ifmateisunmapped,ifmateismappedonBSCorBSWC,3Referencesequence41-basedleftmostPOSition/coordinateoftheclipped5MAP6extendedCIGAR7MateReferencesequenceNaMe;“=”ifthesameas81-basedleftmostMatePOSitionoftheclipped9inferredInsertqueryqueryXR:Z:<referencesequence>从map位置开始的waston参考序列在 [lirj@big[lirj@bigBSMAP]$bsmap-areads1.fastq-dchr22.fa-o-w100-s14-v3BSMAPBSMAPStart Thu 118:44:15Loadin1dbseqs,totalCreateseedtable.4secsbp.1secsmaxmismatches:startfromread#1maxmulti-hits:maxNs:seedsize: basequalitychar:maxfragemtQuery:reads1.fastqReference:chr22.faOutput:map 10000readsfinished.4secspassed20000readsfinished.4secsFinishedatThuSep Totaltimeconsumed:4secsConvertingSAMtoBAM...[samopen]SAMheaderispresent:1sequences.SortingBAM...IndexingBAM bsmap-areads1.fastq-dchr22.fa-o_bsmap.bam-w100-14-v[lirj@big[lirj@bigBSMAP]$ls这就是运行BSMAP将重亚硫酸盐处理后的短序列文件比对到组后的[lirj@big[lirj@bigBSMAP]$samtoolsview _bsmap.bam|lessS可以节约空间,可用samtools工具查看。pairend在 [lirj@big[lirj@bigBSMAP]$bsmap-areads1.fastq–breads2.fastq-dchr22.fa-_bsmap_pairend.bsp-2_bsmap_unpair.bsp-w100-s14-v3single180single180bsmap-areads1.fastq-breads2.fastq-dchr22.fa-_bsmap_pairend.bsp-2_bsmap_unpair.bsp-w100-s14-v[lirj@big[lirj@bigBSMAP]$lsmap_bsmap_unpair.bsp,这就是运行BSMAP将重亚硫酸盐处理后的pairend短序列文件比对到组后的生成文件。那么我们现在看看比对结果,我们 _bsmap_pairend.bsp屏幕上将显示输出文件的内容在 [lirj@big[lirj@bigBSMAP]$bsmap-areads1.fastq–breads2.fastq-dchr22.fa-o _bsmap_pairend.sam-w100-s14-v3&single180single180bsmap-areads1.fastq-breads2.fastq-dchr22.fa-_bsmap_pairend.sam-w100-s14-v[lirj@big[lirj@bigBSMAP]$lsBSMAP将重亚硫酸盐处理后的pairend短序列文件比对到组后的生成[lirj@big[lirj@bigBSMAP]$lessS _bsmap_pairend.sam屏幕上将显示输出文件的内容在 [lirj@big[lirj@bigBSMAP]$bsmap-areads1.fastq–breads2.fastq-dchr22.fa-o _bsmap_pairend.bam-w100-s14-v3&single180single180bsmap-areads1.fastq-breads2.fastq-dchr22.fa-_bsmap_pairend.bam-w100-s14-v[lirj@big[lirj@bigBSMAP]$ls[lirj@big[lirj@bigBSMAP]$samtoolsview _bsmap_pairend.bam|lessS屏幕上将显示输出文件的内容RMAP软件的地址:文章在 Bioinformatics2009,25:2841 接下来我们练下 到BSMAP的上一级 RMAP[lirj@biglirj]$ls[lirj@biglirj]$ls RMAP ./[lirj@bigRMAP]$ls就会看到这一 下面有三个文件,分别是chr22.fa,reads1.fastq -o,-Nameofoutputfile(default:-c,-FASTAfileordircontaining-S,-numberofwidthof-m,-umallowed-M,-max-umallowedmapsfora-Q,-usequalityscores(inputmustbeprintmorerun-f,-fasterseeds(sensitiveto2[lirj@big[lirj@bigRMAP]$reads1.fastq& -c 12在此上的起始位34reads56比对到了参考序列的哪一股(正或负[lirj@big[lirj@bigRMAP]$ -c mapped_bs_locations2.bedBSSeeker软件的地址:Seeker/BS文章 BMCBioinformatics2010,接下来我们练下BSSeeker的使用方法。目前我们还在RMAP [lirj@bigRMAP]$[lirj@bigRMAP]$cd..到RMAP的上一级 [lirj@big[lirj@biglirj]$ BS_Seeker[lirj@biglirj]$ls [lirj@big[lirj@biglirj]$ / ./命令:pythonPreprocessing_genome.pyshowthishelpmessageandInputyourreferencegenomefilename(fastaReadscontainingtags? PathtoBowtie[~/bowtie-Add">log_Preprocessing_genome.txt"attheendofcommandstosavetheIDassignmentstothereferencesequences[lirj@big[lirj@bigBS_Seeker]$ chr22.fa log_Preprocessing_genome.txt&1465900:00:001821200:00:021821500:00:00 Done-f [lirj@big[lirj@bigBS_Seeker]$ls窗口显示 reference_genome第二步,比对组-h,--showthishelpmessageandInputyourreadfile(supportinformat:Solexaseq,sequences,illuminafastq,qseq)Readscontainingtags?Y/N ifreadshavetags:-fFWtag-rRCtagThelastcyclenumberofyourreadtobemappedPathtoBowtie[~/bowtie-PathtoReferencegenomelibraryNumberofmismatches(0,1,2,3)[lirj@big[lirj@bigBS_Seeker]$ - N-e36- -o[lirj@big[lirj@bigBS_Seeker]$ls我们会发现,我们多了两个文件:log_myoutput.txt myoutput.txt1ReadID(fromthefirst4columnsinSolexaseqfile,oraserialnumberoftheoriginalinput)2NumberofmismatchesbetweenthegenomicseqandtheBSreadlistincolumns6and7.ThebisulfiteconvertedsitesbetweenreadTstogenomicCsarenotincluded.3Thestrandwhichthereadmaybefrom(+FW,+RC,-RC,-4Thecoordinateofthemappedposition:thefirst4digits(ID)indicatethechromosome,the"+"or"-"indicatethemappedstrand.Thelast10digitsarethe1-based,5'-endcoordinateofthemappedgenomicsequenceontheWatsonstrand.5Thegenomicsequenceofthemappedregionplus+2and-26BSreadsequencesfrom5'to3':ifthereadsareuniquelymappedastheywereFWreads,theoriginalreadsareshown.IfthereadsareuniquelymappedastheywereRCreads,theirre
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 2025年度红十字应急救护知识培训课件
- 2024年度甲方租赁乙方云计算资源用于数据处理的合同
- 2024年度装修工程安全文明施工合同
- 2024年度储罐用高性能焊接材料供应合同
- 《就业指导案例》课件
- 2024年度股权投资合同投资金额与用途
- 《复合材料板簧》课件
- 2024年度融资租赁合同标的租赁物、租金与还款协议
- 人教版小学语文二年级下册全册教
- 《胸腔穿刺术》课件
- 2024年广东省公需课《百县千镇万村高质量发展工程与城乡区域协调发展》考试答案
- 2024学年初中坚韧不拔坚守信仰班会教学设计
- 2024年北京市中考英语试卷真题(含答案)
- 2024-2030年中国稀土永磁电机行业市场发展分析及前景趋势与投资风险研究报告
- 欠工人工资用车抵押合同
- 2024壬二酸科学祛痘消费者报告-质润x美丽修行-202406
- 章丘铁锅运营方案
- 2024年武汉城投集团公开招聘【151人】高频考题难、易错点模拟试题(共500题)附带答案详解
- 小升初小学语文总复习:关联词语、修改病句、修辞、标点符号、积累与运用
- 使用单位特种设备安全风险管控清单
- 对辊破碎机使用说明书
评论
0/150
提交评论