第23期培训课件双击桌面上的图标putty exe后显示窗口_第1页
第23期培训课件双击桌面上的图标putty exe后显示窗口_第2页
第23期培训课件双击桌面上的图标putty exe后显示窗口_第3页
第23期培训课件双击桌面上的图标putty exe后显示窗口_第4页
第23期培训课件双击桌面上的图标putty exe后显示窗口_第5页
已阅读5页,还剩25页未读 继续免费阅读

下载本文档

版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领

文档简介

“HostName:”IP192.168.118.204”[lirj@big[lirj@big~]$pwd 当前位置更换到我们事先已经为您建好的计算下面。我们在光标所在的位置键入(例如:你的用户名是lirj:[lirj@big[lirj@big~]$ BSMAP软件的地址: 文章在BMCBioinformatics2009, BSMAP[lirj@big[lirj@biglirj]$ls 接下来关于BSMAP软件的运算练习将在这个 ./[lirj@bigBSMAP]$ls readlengthuptoallowupto20mismatchessupportpairendmapsupportparallelmapqueryfilebforpairenddata,FASTA/FASTQ/BAMformat.IftheinputisinBAMformat,itshouldbethesameasthefilespecifiedby"-a"outputalignmentfile,iffilenamehas.samsuffix,theoutputwillbeinSAMformat,ifthefilenamehas.bamsuffix,theoutputfilebeinsortedBAMfile,andafilename.baiindexfilewillbegenerated,forotherfilenamesuffixtheoutputisinBSPoutputalignmentfileforunpairedreadsinpairendmap,onlyusedforBSPformatoutput.IftheoutputformatisspecifiedinBAM/SAMformat,thisoptionwillbeignored,allalignmentswillbewritentooneBAM/SAMoutputfilespecifiedbythe"-o"seedsize,default=16,min=8,[seed_size*seed_seg_num+3<=read_length]longerseedsizeisfaster,~1.5timesfasterwitheachadditionalntmaxnumberofmismatchesallowedonaread,default=2,maxmaxnumberofequalbesthitstocount,smallerwillbeinitialquality,[Illuminauses'@',SangerInstituteuses'!'],filterlow-qualityreadscontaining>nNs,numberofprocessorstouse,maxinsertionsizeforpairendmap,mininsertionsizeforpairendmap,indexinterval(1~16),default=4,meaningthereferencegenomewillbeevery4bp,largerindexintervalneedslessmemory,andslightlymapsensitivity.(~0.5%3-endadaptersequence,default=none,requiresatleast4ntmatched,mismatchincludethereferencesequencesastheXR:Z:<string>fieldinSAMdefault=donotstartfromthenthreadorreadpair,default:endatthenthreadorreadpair,default:setrestrictionenzymedigestionsiteandactivateRRBSmapmode,readsmustbemappedtodigestionsitesthedigestionsitemustbepalindromic,digestionpositionismarkedby'-',forexample:'-DC-CGG'(MspI)randomeseedinselectingmultiplehits.default:0(seedsetfromsystemsingleend在 [lirj@big[lirj@bigBSMAP]$bsmap-areads1.fastq-dchr22.fa-o-w100-s14-v3[lirj@big[lirj@bigBSMAP]$psPIDPIDTTYTIME3399pts/300:00:007820pts/300:00:037826pts/300:00:0010000readsfinished.5secs10000readsfinished.5secs20000readsfinished.5secsTotalnumberofalignedreads:20000(1e+02%)FinishedatThuSep Totaltimeconsumed: 5secs bsmap-areads1.fastq-dchr22.fa-o_bsmap.bsp-w100-14-v[lirj@big[lirj@bigBSMAP]$ls可以看到,文件夹里多了一个文件map_bsmap.bsp,这就是运行BSMAP将[lirj@big[lirj@bigBSMAP]$lessS _bsmap.bsp屏幕上将显示输出文件的内容1read2mappedread3qualityofthequery4UM:uniquemapOF:overmapNM:nomapQC:lowquality56maplocation(1based,5'-endcoordinatesoftheregionontheWatsonstrandof7++:forwardstrandofWatsonofreference+-:reversestrandofWatsonofreference-+:forwardstrandofCrickofreference--:reversestrandofCrickofreference8insertionsizeforpair-endmap,0meanssingle-endunpairedmap9Wastonreferencesequenceatthemapnumberofmismatchesofcurrent#hitsof0mismatchto#hitsofmax_mismatches,by在光标处键入“q”,回到 在 [lirj@big[lirj@bigBSMAP]$bsmap-areads1.fastq-dchr22.fa-o-w100-s14-v3BSMAPBSMAPStart Thu 118:42:29Loadin1dbseqs,totalsize bp.2secspassedCreateseedtable.5secsmaxmismatches:startfromread#1maxmulti-hits:maxNs:seedsize: basequalitychar:maxfragemtQuery:reads1.fastqReference:chr22.faOutput:map 0000readsfinished.5secspassed20000readsfinished.5secsFinishedatThu 118:42:34 5 bsmap-areads1.fastq-dchr22.fa-o_bsmap.sam-w100-14-v[lirj@big[lirj@bigBSMAP]$ls可以看到,文件夹里多了一个文件map_bsmap.sam,这就是运行BSMAP将[lirj@bigBSMAP]$[lirj@bigBSMAP]$lessS _bsmap.sam1QuerypairNAMEifpaired;orQueryNAMEif2UM:0x0,MA:0x100,OF:0x100,NM:0x4,QC:formaponBSCorBSWC:FLAG=FLAG+0x10forpair-endmap:ifit'sthefirstreadinpair,ifit'sthesecondreadinpair,FLAG=FLAG+0x80ifmapsarepaired,FLAG=FLAG+0x2ifmateisunmapped,ifmateismappedonBSCorBSWC,3Referencesequence41-basedleftmostPOSition/coordinateoftheclipped5MAP6extendedCIGAR7MateReferencesequenceNaMe;“=”ifthesameas81-basedleftmostMatePOSitionoftheclipped9inferredInsertqueryqueryXR:Z:<referencesequence>从map位置开始的waston参考序列在 [lirj@big[lirj@bigBSMAP]$bsmap-areads1.fastq-dchr22.fa-o-w100-s14-v3BSMAPBSMAPStart Thu 118:44:15Loadin1dbseqs,totalCreateseedtable.4secsbp.1secsmaxmismatches:startfromread#1maxmulti-hits:maxNs:seedsize: basequalitychar:maxfragemtQuery:reads1.fastqReference:chr22.faOutput:map 10000readsfinished.4secspassed20000readsfinished.4secsFinishedatThuSep Totaltimeconsumed:4secsConvertingSAMtoBAM...[samopen]SAMheaderispresent:1sequences.SortingBAM...IndexingBAM bsmap-areads1.fastq-dchr22.fa-o_bsmap.bam-w100-14-v[lirj@big[lirj@bigBSMAP]$ls这就是运行BSMAP将重亚硫酸盐处理后的短序列文件比对到组后的[lirj@big[lirj@bigBSMAP]$samtoolsview _bsmap.bam|lessS可以节约空间,可用samtools工具查看。pairend在 [lirj@big[lirj@bigBSMAP]$bsmap-areads1.fastq–breads2.fastq-dchr22.fa-_bsmap_pairend.bsp-2_bsmap_unpair.bsp-w100-s14-v3single180single180bsmap-areads1.fastq-breads2.fastq-dchr22.fa-_bsmap_pairend.bsp-2_bsmap_unpair.bsp-w100-s14-v[lirj@big[lirj@bigBSMAP]$lsmap_bsmap_unpair.bsp,这就是运行BSMAP将重亚硫酸盐处理后的pairend短序列文件比对到组后的生成文件。那么我们现在看看比对结果,我们 _bsmap_pairend.bsp屏幕上将显示输出文件的内容在 [lirj@big[lirj@bigBSMAP]$bsmap-areads1.fastq–breads2.fastq-dchr22.fa-o _bsmap_pairend.sam-w100-s14-v3&single180single180bsmap-areads1.fastq-breads2.fastq-dchr22.fa-_bsmap_pairend.sam-w100-s14-v[lirj@big[lirj@bigBSMAP]$lsBSMAP将重亚硫酸盐处理后的pairend短序列文件比对到组后的生成[lirj@big[lirj@bigBSMAP]$lessS _bsmap_pairend.sam屏幕上将显示输出文件的内容在 [lirj@big[lirj@bigBSMAP]$bsmap-areads1.fastq–breads2.fastq-dchr22.fa-o _bsmap_pairend.bam-w100-s14-v3&single180single180bsmap-areads1.fastq-breads2.fastq-dchr22.fa-_bsmap_pairend.bam-w100-s14-v[lirj@big[lirj@bigBSMAP]$ls[lirj@big[lirj@bigBSMAP]$samtoolsview _bsmap_pairend.bam|lessS屏幕上将显示输出文件的内容RMAP软件的地址:文章在 Bioinformatics2009,25:2841 接下来我们练下 到BSMAP的上一级 RMAP[lirj@biglirj]$ls[lirj@biglirj]$ls RMAP ./[lirj@bigRMAP]$ls就会看到这一 下面有三个文件,分别是chr22.fa,reads1.fastq -o,-Nameofoutputfile(default:-c,-FASTAfileordircontaining-S,-numberofwidthof-m,-umallowed-M,-max-umallowedmapsfora-Q,-usequalityscores(inputmustbeprintmorerun-f,-fasterseeds(sensitiveto2[lirj@big[lirj@bigRMAP]$reads1.fastq& -c 12在此上的起始位34reads56比对到了参考序列的哪一股(正或负[lirj@big[lirj@bigRMAP]$ -c mapped_bs_locations2.bedBSSeeker软件的地址:Seeker/BS文章 BMCBioinformatics2010,接下来我们练下BSSeeker的使用方法。目前我们还在RMAP [lirj@bigRMAP]$[lirj@bigRMAP]$cd..到RMAP的上一级 [lirj@big[lirj@biglirj]$ BS_Seeker[lirj@biglirj]$ls [lirj@big[lirj@biglirj]$ / ./命令:pythonPreprocessing_genome.pyshowthishelpmessageandInputyourreferencegenomefilename(fastaReadscontainingtags? PathtoBowtie[~/bowtie-Add">log_Preprocessing_genome.txt"attheendofcommandstosavetheIDassignmentstothereferencesequences[lirj@big[lirj@bigBS_Seeker]$ chr22.fa log_Preprocessing_genome.txt&1465900:00:001821200:00:021821500:00:00 Done-f [lirj@big[lirj@bigBS_Seeker]$ls窗口显示 reference_genome第二步,比对组-h,--showthishelpmessageandInputyourreadfile(supportinformat:Solexaseq,sequences,illuminafastq,qseq)Readscontainingtags?Y/N ifreadshavetags:-fFWtag-rRCtagThelastcyclenumberofyourreadtobemappedPathtoBowtie[~/bowtie-PathtoReferencegenomelibraryNumberofmismatches(0,1,2,3)[lirj@big[lirj@bigBS_Seeker]$ - N-e36- -o[lirj@big[lirj@bigBS_Seeker]$ls我们会发现,我们多了两个文件:log_myoutput.txt myoutput.txt1ReadID(fromthefirst4columnsinSolexaseqfile,oraserialnumberoftheoriginalinput)2NumberofmismatchesbetweenthegenomicseqandtheBSreadlistincolumns6and7.ThebisulfiteconvertedsitesbetweenreadTstogenomicCsarenotincluded.3Thestrandwhichthereadmaybefrom(+FW,+RC,-RC,-4Thecoordinateofthemappedposition:thefirst4digits(ID)indicatethechromosome,the"+"or"-"indicatethemappedstrand.Thelast10digitsarethe1-based,5'-endcoordinateofthemappedgenomicsequenceontheWatsonstrand.5Thegenomicsequenceofthemappedregionplus+2and-26BSreadsequencesfrom5'to3':ifthereadsareuniquelymappedastheywereFWreads,theoriginalreadsareshown.IfthereadsareuniquelymappedastheywereRCreads,theirre

温馨提示

  • 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
  • 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
  • 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
  • 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
  • 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
  • 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
  • 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

评论

0/150

提交评论