版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
IntroductiontoDatabases
DanielaPuiuApplicationsSpecialistCenterfortheStudyofBiologicalComplexity,VCUdpuiu@804-827-0952qq群发GeneralConcepts DatabasedefinitionOrganizedcollectionoflogicallyrelateddataDataKnownfactsTypes:text,graphics,images,sound,videosDatabasemanagementsystem(DBMS)SoftwarepackagefordefiningandmanagingadatabaseDatabaseExamples ClassrosterHospitalpatientsLiterature(publishedarticlesinacertainfield)GenomicinformationProteinstructureTaxonomySinglenucleotidepolymorphismExample:MicrobialDatabase Organism:NameAccessionnumberGenomesizeGC%ReleasedateGenomecenterSequenceGene(proteincodingregions):NameAccessionnumberOrganismLocationonthechromosome(start,end)StrandSizeProductSequenceDataabouttheproteincodingregionsinthemicrobialgenomessequencedsofar.DatabaseModelsFlatfiles ‘60Hierarchical ‘60Network ‘70Relational ‘80Objectoriented ‘90Objectrelational ‘90Webenabled ‘90DatabaseTypes(cont.)TypeTypicalnumberofusersTypicalarchitectureTypicalsizePersonal1Desktop/Laptop/PDAMBWorkgroup5-25Client/server:2tierMB-GBDepartment25-100Client/server:3tierGBEnterprise>100Client/server:distributedGB-TBInternet>1000Websever&applicationservers MB-GBFlatFilesCharacteristics:DataisstoredasrecordsinregularfilesRecordsusuallyhaveasimplestructureandfixednumberoffieldsForfastaccessmaysupportindexingoffieldsintherecordsNomechanismsforrelatingdatabetweenfilesOneneedsspecialprogramsinordertoaccessandmanipulatethedataFlatFilesExampleMicrobialdatabase:Genbankformat:EscherichiacoliK12StreptococcuspneumoniaeR6…Fastaformat:multiplefilesEscherichiacoliK12:genome,genes,genepositionsStreptococcuspneumoniaeR6:genome,genes,genepositions…Datamanipulation:Sequenceextraction,searchIndexingFormatconversion…RelationalDatabaseCharacteristics:Dataisorganizedintotables:rows&columnsEachrowrepresentsaninstanceofanentityEachcolumnrepresentsanattributeofanentityMetadatadescribeseachtablecolumnRelationshipsbetweenentitiesarerepresentedbyvaluesstoredinthecolumnsofthecorrespondingtables(keys)AccessiblethroughStandardQueryLanguage(SQL)Enterprisedatamodel GraphicalrepresentationofthehighlevelentitiesExample:MicrobialdatabaseeachorganismhasmultiplecorrespondinggenesOne:ManyrelationOrganismGene1mMetadataDatathatdescribesthepropertiesorcharacteristicsofotherdataDoesnotincludesampledataAllowsdatabasedesignersanduserstounderstandthemeaningofthedataMetadata&DataTableNameTypeMaxLengthDescriptionNameAlphanumeric100OrganismnameSizeInteger10Genomelength(bases)GcFloat5PercentGCAccessionAlphanumeric10AccessionnumberReleaseDate8ReleasedateCenterAlphanumeric100GenomecenternameSequenceAlphanumericVariableSequenceOrganismNameSizeGcAccessionReleaseCenterSequenceEscherichiacoliK124,640,00050NC_00091309/05/1997Univ.WisconsinAGCTTTTCATT…StreptococcuspneumoniaeR62,040,00040NC_00309809/07/2001EliLillyandCompanyTTGAAAGAAAA……Metadata&DataTable(cont.)NameTypeMaxLengthDescriptionNameAlphanumeric100GenenameAccessionAlphanumeric10GeneaccessionnumberOAccesionAlphanumeric10OrganismaccessionnumberStartInteger10GenestartEndInteger10GeneendStrandCharacter1GenestrandProductAlphanumeric1000GeneannotationSequenceAlphanumericVariableGenesequenceGeneNameAccessionOAccessionStartEndStrandProductSequencethrL16127995NC_000913190255+theoperonleaderpeptideMKRI…thrA16127996NC_0009133372799+homoserinedehydrogenaseIMRVL…transposase_A15902058NC_0030982020720554+transposaseMWYN…Relationships UsedtoconnecttablesField(s)thathavethesamevalueintherelatedtablesOrganism.Accession=Gene.OAccessionOrganism.AccessionUniquePrimarykeyGene.OAccessionNotuniqueSecondarykeySQLANSI(AmericanNationalStandardsInstitute)standardcomputerlanguageforaccessingandmanipulatingdatabasesystems.SQLstatementsareusedtoretrieveandupdatedatainadatabase.Includes:DataManipulationLanguage(DML)DataDefinitionLanguage(DDL)DataManipulationLanguageSyntaxforexecutingqueries,updating,inserting,anddeletingrecords.SELECT-extractsdatafromoneormoretableINSERTINTO-insertsnewdataintoatableUPDATE-updatesdatainatableDELETEFROM-deletesdatafromatableDMLExampleSelectallEscherichiacoliK12geneswhichareinthe1MB-2MBregionofthechromosome: SELECT* FROMOrganism,Gene WHERE Organism.Name=“EscherichiacoliK12”AND Organism.Accession=Gene.OAccessionAND Gene.Start>=1,000,000AND Gene.End<=2,000,000DMLExample(cont.)INSERTINTOGene(Name,Accession,OAccession,Start,End,Strand,Sequence)VALUES(“thrL”,16127995,”NC_000913”,190,255,’+’,”throperonleaderpeptide”,“MKRI…”)UPDATEGeneSETStart=160WHEREAccession=”NC_000913”DELETEFROMGeneWHEREAccession=”NC_000913”DataDefinitionLanguageSyntaxforcreating,editing,deleting:DatabasesTablesViewsIndexesConstraintsUsersPrivilegesDDLExamplesCREATEDATABASEMicrobial;CREATETABLEOrganism( Namevarchar(100) Sizeint(10) Gc decimal(5) Accessionvarchar(10) Releasedate(8) Centervarchar(100));ALTERTABLEOrganismADDSequencevarchar;DROPTABLEOrganism;DBMSSoftwarepackagefordefiningandmanagingadatabase.Examples:Proprietary:MSAccess,MSSQLServer,DB2,Oracle,SybaseOpensource:MySql,PostgreSQLDBMSAdvantages Program-dataindependenceMinimaldataredundancyImproveddataconsistency&quality
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 2025经城关集资建房合同
- 食堂租赁与管理承包合同20253篇
- 二零二五年度电子设备承包装卸安全合同4篇
- 2025年生物降解材料生产与应用合同范本4篇
- 二零二五年度打印机设备融资租赁合同8篇
- 2025版公共场所消防安全隐患排查整改合同2篇
- 二零二四年度新能源项目增资入股协议书范本合同2篇
- 建筑工程资料承包合同
- 窗帘工程合同
- 2025版代理记账及企业财务报表编制与审核合同4篇
- 中医诊疗方案肾病科
- 2025年安庆港华燃气限公司招聘工作人员14人高频重点提升(共500题)附带答案详解
- 人教版(2025新版)七年级下册数学第七章 相交线与平行线 单元测试卷(含答案)
- GB/T 44351-2024退化林修复技术规程
- 从跨文化交际的角度解析中西方酒文化(合集5篇)xiexiebang.com
- 中药饮片培训课件
- 医院护理培训课件:《早产儿姿势管理与摆位》
- 空气自动站仪器运营维护项目操作说明以及简单故障处理
- 2022年12月Python-一级等级考试真题(附答案-解析)
- T-CHSA 020-2023 上颌骨缺损手术功能修复重建的专家共识
- Hypermesh lsdyna转动副连接课件完整版
评论
0/150
提交评论