版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
1、机器视觉综述第1页,共73页。Todays TalkWhat is Computer Vision?Why Study Computer Vision?How Vision is Used Now?Overview of Computer Vision AlgorithmChallenges of Computer VisionQuestions2第2页,共73页。What is computer vision?Terminator 23Terminator 5第3页,共73页。Every picture tells a story4Goal of computer vision is to
2、write computer programs that can interpret images第4页,共73页。Can computers match (or beat) human vision?5第5页,共73页。What is Computer Vision?Automatic understanding of images and videoComputing properties of the 3D world from visual data (measurement) 6第6页,共73页。 1. Vision for measurementReal-time stereoSt
3、ructure from motionNASA Mars RoverPollefeys et al.Multi-view stereo forcommunity photo collectionsGoesele et al.Slide credit: L. Lazebnik7第7页,共73页。What is Computer Vision?Automatic understanding of images and videoComputing properties of the 3D world from visual data (measurement)Algorithms and repr
4、esentations to allow a machine to recognize objects, people, scenes, and activities. (perception and interpretation) 8第8页,共73页。2. Vision for perception, interpretationskywaterFerris wheelamusement parkCedar Point12 Etreetreetreecarouseldeckpeople waiting in linerideriderideumbrellaspedestriansmaxair
5、benchtreeLake Eriepeople sitting on rideObjectsActivitiesScenesLocationsText / writingFacesGesturesMotionsEmotionsThe Wicked Twister9第9页,共73页。What is Computer Vision?Automatic understanding of images and videoComputing properties of the 3D world from visual data (measurement)Algorithms and represent
6、ations to allow a machine to recognize objects, people, scenes, and activities. (perception and interpretation)Algorithms to mine, search, and interact with visual data (search and organization) 10第10页,共73页。3. Vision for search and organization11第11页,共73页。Components of a computer vision systemLighti
7、ngSceneCameraComputer Scene InterpretationSrinivasa Narasimhans slide12第12页,共73页。Computer vision vs human visionWhat we seeWhat a computer sees13第13页,共73页。Vision is really hardVision is an amazing feat of natural intelligenceVisual cortex occupies about 50% of brainMore human brain devoted to vision
8、 than anything elseIs that a queen or a bishop?14第14页,共73页。Vision is multidisciplinary From wikiComputer GraphicsHCI15第15页,共73页。Why computer vision mattersSafetyHealthSecurityComfortAccessFun16第16页,共73页。A little story about Computer VisionIn 1966, Marvin Minsky at MIT asked his undergraduate student
9、 Gerald Jay Sussman to “spend the summer linking a camera to acomputer and getting the computer to describe what it saw”. We now know that the problem is slightly more difficult than that. (Szeliski 2009, Computer Vision)17第17页,共73页。Ridiculously brief history of computer vision1966: Minsky assigns c
10、omputer vision as an undergraduate summer project1960s: interpretation of synthetic worlds1970s: some progress on interpreting selected images1980s: ANNs come and go; shift toward geometry and increased mathematical rigor1990s: face recognition; statistical analysis in vogue2000s: broader recognitio
11、n; large annotated datasets available; video processing starts2030s: robot uprising?Guzman 68Ohta Kanade 78Turk and Pentland 91第18页,共73页。19第19页,共73页。 Why study computer vision?Millions of images being captured all the timeLots of useful applicationsThe next slides show the current state of the artSo
12、urce: S. Lazebnik第20页,共73页。 Flickr1 billion2 billion3 billion4 billion5 billion6 billion第21页,共73页。 Other photo sharing sites10 billion20 billion50 billion30 billion40 billion第22页,共73页。 and growingFlickr: 1.7 million photos / dayFacebook: 100 million photos / dayYouTube: 35 hours of video every minut
13、e 57 billion photos will be taken (US) in 2010/windows_live/b/windowslive/archive/2010/04/09/what-to-do-with-57-billion-photos.aspx(as of November 2010)(compare with 17 billion negatives exposed in 1996)(as of February 2010)第23页,共73页。How vision is used nowExamples of state-of-the-art24第24页,共73页。1. O
14、ptical character recognition (OCR)Digit recognition, AT&T labs/yann/Technology to convert scanned docs to textIf you have a scanner, it probably came with OCR softwareLicense plate readers/wiki/Automatic_number_plate_recognition25第25页,共73页。2. Face detectionMany new digital cameras now detect facesCa
15、non, Sony, Fuji, 26第26页,共73页。3. Smile detectionSony Cyber-shot T70 Digital Still Camera 27第27页,共73页。4. 3D from thousands of imagesBuilding Rome in a Day: Agarwal et al. 200928The old city of Dubrovnik, 4,619 images, 3,485,717 points第28页,共73页。5. Object recognition (in supermarkets)LaneHawk by Evoluti
16、onRobotics“A smart camera is flush-mounted in the checkout lane, continuously watching for items. When an item is detected and recognized, the cashier verifies the quantity of items that were found under the basket, and continues to close the transaction. The item can remain under the basket, and wi
17、th LaneHawk, you are assured to get paid for it “29第29页,共73页。6. Vision-based biometrics“How the Afghan Girl was Identified by Her Iris Patterns” National Geographic30第30页,共73页。7. ForensicsSource: Nayar and Nishino, “Eyes for Relighting”第31页,共73页。Source: Nayar and Nishino, “Eyes for Relighting”第32页,共
18、73页。Source: Nayar and Nishino, “Eyes for Relighting”第33页,共73页。8. Login without a passwordFingerprint scanners on many new laptops, other devicesFace recognition systems now beginning to appear more widely/34第34页,共73页。9. Object recognition (in mobile phones)Point & Find, NokiaGoogle Goggles35第35页,共73
19、页。10. Vision in spaceVision systems (JPL) used for several tasksPanorama stitching3D terrain modelingObstacle detection, position trackingFor more, read “Computer Vision on Mars” by Matthies et al.NASAS Mars Exploration Rover Spirit captured this westward view from atop a low plateau where Spirit sp
20、ent the closing months of 2007. 36第36页,共73页。11. Industrial robotsVision-guided robots position nut runners on wheels37第37页,共73页。12. Mobile robots/NASAs Mars Spirit Rover/wiki/Spirit_roverSaxena et al. 2008STAIR at Stanford38第38页,共73页。13. Medical imagingImage guided surgeryGrimson et al., MIT3D imagi
21、ngMRI, CT39第39页,共73页。14. Digital cosmetics40第40页,共73页。15. InpaintingBertalmio et al. SIGGRAPH 0041第41页,共73页。16. DebluringFergus et al. SIGGRAPH 0642第42页,共73页。17. SportsSportvision first down lineNice explanation on /video.html43第43页,共73页。18. Smart carsMobileyeVision systems currently in high-end BMW
22、, GM, Volvo models By 2010: 70% of car manufacturers.44第44页,共73页。19. Google carsOct 9, 2010.Google Cars Drive Themselves, in Traffic.The New York Times. John MarkoffJune 24, 2011. Nevada state law paves the way for driverless cars.Financial Post. Christine DobbyAug 9, 2011, Human error blamed after
23、Googles driverless car sparks five-vehicle crash.The Star(Toronto)45第45页,共73页。20. Interactive Games: KinectObject Recognition: /watch?feature=iv&v=fQ59dXOo63oMario: /watch?v=8CTJL5lUjHg3D: /watch?v=7QrnwoO1-8ARobot: /watch?v=w8BmgtMKFbY46第46页,共73页。The Matrix movies, ESC Entertainment, XYZRGB, NRC21.
24、 Special effects: shape capture47第47页,共73页。Pirates of the Carribean, Industrial Light and Magic22. Special effects: motion capture48第48页,共73页。Computer Vision and Nearby FieldsComputer Graphics: Models to ImagesComp. Photography: Images to ImagesComputer Vision: Images to Models49第49页,共73页。Overview o
25、f Computer Vision Algorithm50So what do humans care about?第50页,共73页。Verification: is that a bus?slide by Fei Fei, Fergus & Torralba 51第51页,共73页。Detection: are there cars?slide by Fei Fei, Fergus & Torralba 52第52页,共73页。Identification: is that a picture of Mao?slide by Fei Fei, Fergus & Torralba 53第53
26、页,共73页。Object categorizationskybuildingflagwallbannerbuscarsbusfacestreet lampslide by Fei Fei, Fergus & Torralba 54第54页,共73页。Scene and context categorization outdoor city traffic slide by Fei Fei, Fergus & Torralba 55第55页,共73页。Rough 3D layout, depth ordering56第56页,共73页。Overview of Computer Vision A
27、lgorithmImage formationFeatures Grouping & fittingMulti-view geometryRecognition & learningMotion & tracking57第57页,共73页。1. Image formationHow does light in 3d world project to form 2d images?58第58页,共73页。2. Features and filtersTransforming and describing images; textures, colors, edges59第59页,共73页。3.
28、Grouping & fittingfig from Shi et alClustering, segmentation, fitting; what parts belong together?60第60页,共73页。4. Multiple viewsHartley and ZissermanMulti-view geometry, matching, invariant features, stereo visionFei-Fei Li61第61页,共73页。5. Recognition and learningRecognizing objects and categories, learning techniques62第62页,共73页。6. Motion and trackingTracking objects, video analysis, low level motion,
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- GB/T 12347-2025钢丝绳疲劳试验方法
- 2025年关于为淄博市检察机关公开招聘聘用制书记员的备考题库带答案详解
- 2026年医疗信息安全管理合同
- 2025年兴业银行济南分行社会招聘备考题库带答案详解
- 惠州市惠城区卫生健康局2025年公开选聘医疗卫生事业单位领导备考题库及完整答案详解一套
- 2025年永康市科学技术局工作人员招聘备考题库及完整答案详解一套
- 2025年首都医科大学附属北京朝阳医院石景山医院派遣合同制职工招聘备考题库及1套参考答案详解
- 2025年招商银行佛山分行社会招聘备考题库及参考答案详解一套
- 2025年医保系统年终工作总结
- 2026年高邮市卫健系统事业单位公开招聘高层次人才备考题库及一套答案详解
- 林地除草合同范本
- 云南高中体育会考试题及答案
- 2025广东惠州市城市建设投资集团有限公司社会招聘9人备考笔试试题及答案解析
- 2025湖北武汉市公安局蔡甸区分局第二批招聘警务辅助人员43人考试笔试参考题库及答案解析
- 军事地形学图课件
- 新生儿一例个案护理
- 2025年沈阳辅警招聘考试真题及一套参考答案详解
- 花中四君子课件
- QC成果-提高组合幕墙铝单板安装一次施工合格率(诏安县总医院扩建项目QC小组)
- 设备维护保养方案及设备更新改造计划
- 国网安全技术培训课件
评论
0/150
提交评论