版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
IntroductiontoDatabases
DanielaPuiuApplicationsSpecialistCenterfortheStudyofBiologicalComplexity,VCUdpuiu@804-827-0952qq群发GeneralConcepts DatabasedefinitionOrganizedcollectionoflogicallyrelateddataDataKnownfactsTypes:text,graphics,images,sound,videosDatabasemanagementsystem(DBMS)SoftwarepackagefordefiningandmanagingadatabaseDatabaseExamples ClassrosterHospitalpatientsLiterature(publishedarticlesinacertainfield)GenomicinformationProteinstructureTaxonomySinglenucleotidepolymorphismExample:MicrobialDatabase Organism:NameAccessionnumberGenomesizeGC%ReleasedateGenomecenterSequenceGene(proteincodingregions):NameAccessionnumberOrganismLocationonthechromosome(start,end)StrandSizeProductSequenceDataabouttheproteincodingregionsinthemicrobialgenomessequencedsofar.DatabaseModelsFlatfiles ‘60Hierarchical ‘60Network ‘70Relational ‘80Objectoriented ‘90Objectrelational ‘90Webenabled ‘90DatabaseTypes(cont.)TypeTypicalnumberofusersTypicalarchitectureTypicalsizePersonal1Desktop/Laptop/PDAMBWorkgroup5-25Client/server:2tierMB-GBDepartment25-100Client/server:3tierGBEnterprise>100Client/server:distributedGB-TBInternet>1000Websever&applicationservers MB-GBFlatFilesCharacteristics:DataisstoredasrecordsinregularfilesRecordsusuallyhaveasimplestructureandfixednumberoffieldsForfastaccessmaysupportindexingoffieldsintherecordsNomechanismsforrelatingdatabetweenfilesOneneedsspecialprogramsinordertoaccessandmanipulatethedataFlatFilesExampleMicrobialdatabase:Genbankformat:EscherichiacoliK12StreptococcuspneumoniaeR6…Fastaformat:multiplefilesEscherichiacoliK12:genome,genes,genepositionsStreptococcuspneumoniaeR6:genome,genes,genepositions…Datamanipulation:Sequenceextraction,searchIndexingFormatconversion…RelationalDatabaseCharacteristics:Dataisorganizedintotables:rows&columnsEachrowrepresentsaninstanceofanentityEachcolumnrepresentsanattributeofanentityMetadatadescribeseachtablecolumnRelationshipsbetweenentitiesarerepresentedbyvaluesstoredinthecolumnsofthecorrespondingtables(keys)AccessiblethroughStandardQueryLanguage(SQL)Enterprisedatamodel GraphicalrepresentationofthehighlevelentitiesExample:MicrobialdatabaseeachorganismhasmultiplecorrespondinggenesOne:ManyrelationOrganismGene1mMetadataDatathatdescribesthepropertiesorcharacteristicsofotherdataDoesnotincludesampledataAllowsdatabasedesignersanduserstounderstandthemeaningofthedataMetadata&DataTableNameTypeMaxLengthDescriptionNameAlphanumeric100OrganismnameSizeInteger10Genomelength(bases)GcFloat5PercentGCAccessionAlphanumeric10AccessionnumberReleaseDate8ReleasedateCenterAlphanumeric100GenomecenternameSequenceAlphanumericVariableSequenceOrganismNameSizeGcAccessionReleaseCenterSequenceEscherichiacoliK124,640,00050NC_00091309/05/1997Univ.WisconsinAGCTTTTCATT…StreptococcuspneumoniaeR62,040,00040NC_00309809/07/2001EliLillyandCompanyTTGAAAGAAAA……Metadata&DataTable(cont.)NameTypeMaxLengthDescriptionNameAlphanumeric100GenenameAccessionAlphanumeric10GeneaccessionnumberOAccesionAlphanumeric10OrganismaccessionnumberStartInteger10GenestartEndInteger10GeneendStrandCharacter1GenestrandProductAlphanumeric1000GeneannotationSequenceAlphanumericVariableGenesequenceGeneNameAccessionOAccessionStartEndStrandProductSequencethrL16127995NC_000913190255+theoperonleaderpeptideMKRI…thrA16127996NC_0009133372799+homoserinedehydrogenaseIMRVL…transposase_A15902058NC_0030982020720554+transposaseMWYN…Relationships UsedtoconnecttablesField(s)thathavethesamevalueintherelatedtablesOrganism.Accession=Gene.OAccessionOrganism.AccessionUniquePrimarykeyGene.OAccessionNotuniqueSecondarykeySQLANSI(AmericanNationalStandardsInstitute)standardcomputerlanguageforaccessingandmanipulatingdatabasesystems.SQLstatementsareusedtoretrieveandupdatedatainadatabase.Includes:DataManipulationLanguage(DML)DataDefinitionLanguage(DDL)DataManipulationLanguageSyntaxforexecutingqueries,updating,inserting,anddeletingrecords.SELECT-extractsdatafromoneormoretableINSERTINTO-insertsnewdataintoatableUPDATE-updatesdatainatableDELETEFROM-deletesdatafromatableDMLExampleSelectallEscherichiacoliK12geneswhichareinthe1MB-2MBregionofthechromosome: SELECT* FROMOrganism,Gene WHERE Organism.Name=“EscherichiacoliK12”AND Organism.Accession=Gene.OAccessionAND Gene.Start>=1,000,000AND Gene.End<=2,000,000DMLExample(cont.)INSERTINTOGene(Name,Accession,OAccession,Start,End,Strand,Sequence)VALUES(“thrL”,16127995,”NC_000913”,190,255,’+’,”throperonleaderpeptide”,“MKRI…”)UPDATEGeneSETStart=160WHEREAccession=”NC_000913”DELETEFROMGeneWHEREAccession=”NC_000913”DataDefinitionLanguageSyntaxforcreating,editing,deleting:DatabasesTablesViewsIndexesConstraintsUsersPrivilegesDDLExamplesCREATEDATABASEMicrobial;CREATETABLEOrganism( Namevarchar(100) Sizeint(10) Gc decimal(5) Accessionvarchar(10) Releasedate(8) Centervarchar(100));ALTERTABLEOrganismADDSequencevarchar;DROPTABLEOrganism;DBMSSoftwarepackagefordefiningandmanagingadatabase.Examples:Proprietary:MSAccess,MSSQLServer,DB2,Oracle,SybaseOpensource:MySql,PostgreSQLDBMSAdvantages Program-dataindependenceMinimaldataredundancyImproveddataconsistency&quality
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 江苏省南京市2023-2024学年高一上学期期末学情调研测试历史试卷(解析版)
- 标志设计知到智慧树章节测试课后答案2024年秋甘肃政法大学
- 高端红薯购销合同范例
- 节会采购合同范例
- 四川工程职业技术学院《矿图及CAD基础》2023-2024学年第一学期期末试卷
- 船舶设备维修合同范例
- 四川电影电视学院《债权法专题》2023-2024学年第一学期期末试卷
- 木制托盘定制合同范例
- 投资公司借款合同范例
- 快递委托经营合同范例
- DB11T 2081-2023 道路工程混凝土结构表层渗透防护技术规范
- 贵州省贵阳市2023-2024学年高一上学期期末考试 物理 含解析
- 我的教育故事
- 山东省青岛市2023-2024学年高一年级上册1月期末选科测试 生物 含解析
- 电工技术(第3版)表格式教案教学详案设计
- 中学教职工安全知识测试练习试题
- 2024年青岛市技师学院招考聘用48人高频500题难、易错点模拟试题附带答案详解
- 2024商业地产策划定位和规划设计合同书模板
- 玉溪大红山铁矿二期北采区采矿施工组织设计
- 2024新教科版四年级上册科学知识点总结精简版
- 中西文化鉴赏智慧树知到答案2024年郑州大学
评论
0/150
提交评论