Metadata Extraction Tool,Tutorial:元数据提取工具教程_第1页
Metadata Extraction Tool,Tutorial:元数据提取工具教程_第2页
Metadata Extraction Tool,Tutorial:元数据提取工具教程_第3页
Metadata Extraction Tool,Tutorial:元数据提取工具教程_第4页
Metadata Extraction Tool,Tutorial:元数据提取工具教程_第5页
已阅读5页,还剩3页未读 继续免费阅读

下载本文档

版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领

文档简介

Page

PAGE

2

of

NUMPAGES

8

MetadataExtractionTool

InstallationGuide

Version:3.4.

MetadataExtractionTool

InstallationGuide

Version:3.4.

TableofContents

TOC\o\h\z\t"Heading2,2"

WhatistheMetadataExtractionTool

3

SourceDirectoryStructure

4

BinaryDirectoryStructure

4

InstallingfromSource

5

InstallingfromBinary

5

ConfiguringyourEnvironment

5

RunningtheTool

5

Troubleshooting

7

WhatistheMetadataExtractionTool

ThemetadataextractiontoolisatoolbuiltbySytecResourcesfortheNationalLibraryofNewZealandTePunaMātaurangaoAotearoa(NationalLibrary)toprocessdigitalmasterfilesandextractmetadataaboutthosefiles.Metadataisdescriptiveinformationaboutanobject–nottheobjectitself.InthiscasemetadatainformationaboutaImagewouldbethingslikesize,colours,resolution,etc…Therearetwooutputformatsthatanoutputmetadatafilecantake

Nativeform.AnXML(ExtensibleMark-upLanguage)filebasedonaDTD(DocumentTypeDefinition)thatreflectsallinformationavailablefromthedigitalmaster.

NationalLibraryPreservationMetadataDataDictionary–nlnz-presmet.xsd.Thisistheprimaryformat.

FormoreinformationaboutanyofthesefileformatsseetheSolutionArchitectureorSoftwareArchitecturedocumentsforthisextractiontool.Note:TheProofofConceptoutputtypesofdemta.dtdandpmeta.dtdhavebeendeprecated;theyarenotsupportedintheproductiontool.

SourceDirectoryStructure

Thesourcedirectorystructureisasfollows:

Directory

Description

BASE

Containsthebuild.xmlandlicensefiles.IsthedestinationfortheZIPfilesforthedistributables.

BASE/dist

Thebuilddirectoryforassemblingthedistributables.Thisdirectorywillbedeletedandrecreatedwhenrunningthebuildscript.

BASE/docs

Containsthemaindocumentationfortheapplication.

BASE/docs/apidocs

Containsthejavadocsfortheapplication.

BASE/legal

ContainsthelicenseandnoticefilesforallthelibrariesdistributedwiththeMetadataMetadataExtractionTool.

BASE/lib

ContainsthelibrariesthattheMetadataMetadataExtractionToolisdependenton.

BASE/src

Rootdirectoryforallsourceelements.

BASE/src/java

RootdirectoryfortheJavasourcecode.

BASE/src/help

RootdirectoryfortheonlineHTMLhelpguide.

BASE/src/images

RootdirectoryfortheimagesusedintheGUI.

BASE/src/java

RootdirectoryfortheJavasourcecode.

BASE/src/scripts

Rootdirectoryforthebatchfilesandshellscriptsusedtoruntheapplication.

BASE/src/xml

RootdirectoryfortheXMLconfigurationfiles,DTDsandXSLTfiles.

BASE/target

ThedestinationdirectoryforthejavacANTtask.ThisdirectorywillbedeletedandrecreatedeachtimetheANTscriptisrun.

BinaryDirectoryStructure

Thebinarydistributable’sdirectoriesaredescribedbelow:

Directory

Description

BASE

Holdstheconfigurationfile,MetadataExtractionToollicensefileandbatch/shellscripts.

BASE/adapters

Holdsthefullsetofadapters.

BASE/help_files

Containstheonlinehelpfiles.

BASE/installed-adapters

ContainstheJARfilesforalloftheinstalledadapters.

BASE/legal

ContainsthelicenseandnoticefilesforallthelibrariesdistributedwiththeMetadataExtractor.

BASE/lib

ContainsthelibrariesthattheMetadataExtractorisdependenton.

BASE/xml

RootdirectoryfortheXMLconfigurationfiles,DTDsandXSLTfiles.

InstallingfromSource

TheMetadataExtractionToolisbuiltfromsourceusingANT.ANTcanbedownloadedfrom

/

.Thebuildfilehasbeentestedagainstversion1.6.1.

WithANTintheclasspath,changeintotherootdirectoryoftheMetadataExtractionToolandrunant.Thedefaulttargetwillcleanthedirectories,compilethecode,andproducethebinaryandsourcedistributables.

ToregeneratetheJavaDocs,runantjavadoc.

WhenANThasfinished,abinarydistributablewillbefoundin

BASE\metadata-bin-2-0.zip

Onceabinarydistributableversionisbuilt,youcaninstallfrombinaryasdescribedbelow.

InstallingfromBinary

UnziptheZIPfiletoadesiredlocation.Itisstronglyrecommendedtochooseadirectorynamethatdoesnotcontainspaces.

ConfiguringyourEnvironment

Configurationofthetoolisautomaticassumingthefollowing:

ForWindows,Javaisinthepath.

ForLinux,theJAVA_HOMEenvironmentvariableisset.

Ifthesearetrue,orifyouareunsure,justrunthemetadata.batormetadata.shscripts.

TheWindowsscriptsassumethatJavaisinyourpathandcanbefoundwithoutspecifyingitsexactlocation.Ifthemetadata.batorextract.batscriptsfailtorun,youmayneedtoeditthemandprovideanexplicitpathforyourJavainstallation.

TheLinuxscriptsrequiretheJAVA_HOMEvariabletobeset.Ifitisnotset,youcanadditintothemetadata.shandextract.shscriptsasfollows,beingsuretoreplacethepathwiththeappropriatepathforyourJavainstallation.

JAVA_HOME=/usr/java/jdk1.5.0

exportJAVA_HOME

Bothversionsattempttoguesstheinstallationdirectoryandwillattempttoconfiguretheinitialconfig.xmlfilewithoutmanualintervention.Ifthisfails,youwillbeaskedtoeditthescriptsandsettheMETAHOMEvariable.

RunningtheTool

Torunthetool,changeintotheBASEdirectoryandrunmetadata.batormetadata.shtoruntheMetadataExtractionTool.

Ifyouwishtorunthetoolmanually,orembeditinanotherapplication,youmustsettheclasspathtocontain:

AllJARsintheBASE/libdirectory.

TheBASEdirectoryitself–thisiswheretheconfig.xmlfileislocated.

Oncetheclasspathisconfiguredcorrectly,youcanrunthetoolusing:

$JAVA_HOME/bin/javat.natlib.meta.ui.Main(Linux)

%JAVA_HOME%\bin\javat.natlib.meta.ui.Main(Windows)

Troubleshooting

Thefollowingtablelistsasetofcommonlyencounteredissuesandtherequiredresolution.Themostcommonissuesarearoundthedirectorylocationsspecifiedintheconfigurationfile.FollowingtheinstructionsintheConfigurationsectionaboveshouldavoidanyofthoseissues.

Ifyougetanerrorduringharvesting,youwillneedtousetheLogViewertogetadditionalinformationabouttheerror,orconsulttheOutput.logfile.

Symptom

Description

Solution

Onstartup,youseethemessage:

Thesystemcannotfindthepathspecified.

or

-bash:java:commandnotfound

TheJAVA_HOMEvariableisnotsetcorrectly.

SettheJAVA_HOMEvariableaspertheinstructionsintheSettingJAVA_HOMEEnvironmentVariablesectionofthisdocument.

Onstartup,youseeerrorssuchas:

LOG:1000,Adapterclasst.natlib.adapter.bmp.BitmapAdapternotfound

java.lang.ClassNotFoundException:t.natlib.adapter.bmp.BitmapAdapter

Thejar-locationintheconfig.xmlfileisnotsetcorrectly.

Editconfig.xmlinthebasedirectoryandensurethatthejar-locationURLattributeispointingatavaliddirectory.

Onstartup,youseeanerrorsuchas:

java.io.FileNotFoundException:

METADATA_BASE\logs\nlnz_Jan302007_171007.log

Thelog-direlementinconfig.xmlpointstoadirectorythatdoesnotexist.

Editconfig.xmlinthebasedirectoryandensurethatthedirattributeofthelog-direlementissettoanexistingdirectory.

Notethattherearetwooccurrencesofthelog-direlementinthedefaultconfiguration.Oneatthetopoftheconfigurationfile,andoneintheprofilesectiontowardsthebottom.

Duringharvest,youseeanerrorsuchas:

java.io.FileNotFoundException:METADATA_BASE\harvested\…\filename.xml

atjava.io.FileOutputStream.open(NativeMethod)

atjava.io.FileOutputStream.<init>(FileOutputStream.java:179)

atjava.io.FileOutputStream.<init>(FileOutputStream.java:131)

att.natlib.meta.harv

温馨提示

  • 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
  • 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
  • 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
  • 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
  • 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
  • 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
  • 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

评论

0/150

提交评论