Welcome to ImageLab
Research at ImageLab covers topics of Computer Vision, Pattern Recognition & Machine Learning, and Multimedia applied to optical images and videos as well as data from different sensors. In addition, ImageLab is currently involved in research activities on vision-based human-computer interaction.
The main research projects concern: Video surveillance, Machine vision and robot vision, Medical imaging, Human-centered Multimedia, Content-based retrieval, People detection and tracking, Human behaviour understanding, Egocentric vision and Embedded sensors.
Two papers accepted @IV 2017 · 16 Mar 2017
- Learning Where to Attend Like a Human Driver (A. Palazzi, F. Solera, S. Calderara, S. Alletto, R. Cucchiara)
- Embedded Recurrent Network for Head Pose Estimation in Car (G. Borghi, R. Gasparini, R. Vezzani, R. Cucchiara)
have been accepted in IEEE Intelligent Vehicles Symposium (IV 2017).
Two papers accepted @CVPR · 09 Mar 2017
- POSEidon: Face-from-Depth for Driver Pose Estimation (G. Borghi, M. Venturelli, R. Vezzani, R. Cucchiara)
- Hierarchical Boundary-Aware Neural Encoder for Video Captioning (L. Baraldi, C. Grana, R. Cucchiara)
have been accepted in CVPR 2017, that will take place at the Hawaii Convention Center from July 21 to July 26, 2017 in Honolulu, Hawaii.
In the meanwhile the e-version will be available, please take a look at the preliminar versions of the papers.
Seminario · 01 Mar 2017
Machine Vision in the deep learning era · 01 Feb 2017
Ingegneria dei Sistemi Intelligenti: Vision, Deep Learning e Cognitive Computing · 25 Jan 2017
Second edition of Master MuMeT on Visual Computing and Multimedia Technologies is starting! · 25 Jan 2017
MuMeT 2017 is an international Master of second level of the University of Modena and Reggio Emilia, co-sponsored by Italian MIUR (Ministero della Istruzione, Università e Ricerca) in the National Technology Clusters (project CittaEducante). MUMET II level Master wants to create new professional figures with advanced knowledge on the emerging technologies in visual computing, vision and machine learning, and intelligent multimedia systems.
The Master homepage can be found here.
Il fattore umano nelle auto a guida autonoma. Attenzione umana ed intelligenza artificiale · 18 Jan 2017
Invited talk "human behavior understanding in automotive and around" · 07 Nov 2016
Invited talk "Visual Intelligence For Human And Machine" · 28 Oct 2016
Multi-Camera Tracking: following people in large camera networks · 14 Oct 2016
Dr. Ergys Ristani is visiting ImageLab and he will give a talk on Multi-Camera Tracking on October, Tuesday 18th at 4pm in FA-0A
Multi-Camera Tracking is a canonical problem in computer vision which tries to determine "who is where" at all time across a large camera network. Typical applications include city-wide surveillance and airport/station monitoring.
Dr. Ergys Ristani is a PhD candidate at Duke University, under the supervision of Prof. Carlo Tomasi. His research interest include Multi-Target Tracking, Multi-Camera Tracking, and the estimation and tracking of motion boundaries.
To download the seminar flyer, please see: http://imagelab.ing.unimore.it/imagelab/uploadedFiles/seminario_ristani_18_10_2016.pdf.
Spotting prejudice team @Imagelab press release · 03 Oct 2016
The Prejudice team of unimore composed by Imagelab memebers Andrea Palazzi, Simone Calderara and Rita Cucchiara has gained the attention of the scientific community with their recent work about "Spotting prejudice with nonverbal behaviours" presented at UBICOMP 2016.
Check out our "Prejudice dataset" here.
Attached a Press release of all articles that talk about our work on the web and in printed papers:
https://www.newscientist.com/article/mg23130933-200-camera-spots-your-hidden-prejudices-from-your-body-language/ http://www.dailymail.co.uk/sciencetech/article-3813425/Can-computer-tell-RACIST-Algorithm-detect-hidden-prejudice-person-s-body-language.html http://www.repubblica.it/tecnologia/2016/09/29/news/studio_razzismo_ecco_l_algoritmo_italiano_per_smascherare_i_pregiudizi_anche_inconsapevoli-148760996/ http://myinforms.com/en-gb/a/41993012-camera-spots-your-hidden-prejudices-from-your-body-language/ http://anglenews.com/can-a-computer-tell-if-youre-racist-algorithm-can-detect-hidden-prejudice-from-a-persons-body-language/ http://tech.fanpage.it/l-algoritmo-italiano-che-svela-razzismo-e-pregiudizi/ https://www.tomshw.it/l-algoritmo-italiano-per-smascherare-razzismo-e-pregiudizi-80376 https://www.bcrmagazine.it/201633983/razzismo-ecco-lalgoritmo-italiano-per-smascherare-i-pregiudizi-anche-inconsapevoli.php http://23on.com/razzismo-ecco-lalgoritmo-italiano-per-smascherare-i-pregiudizi-anche-inconsapevoli/ http://www.modicanews.com/can-a-computer-tell-if-youre-racist-algorithm-can-detect-hidden-prejudice-from-a-persons-body-language/ http://huntnews.in/p/detail/2821262581067916?uc_param_str=dnfrpfbivesscpgimibtbmntnijblauputoggdnw&pos=1475155800173&channel=technology&chncat=category_english http://ebalsblog.blogspot.it/2016/09/can-computer-tell-if-youre-racist-new.html http://www.looooker.com/archives/34774 http://newsfeeds.pegitboard.com/news/can-a-computer-tell-if-you-re-racist-algorithm-can-detect-hidden-prejudice-from-a-person-s-body-language?uid=24563
New Scientist features ImageLab research about recognition of non-verbal behaviours · 29 Sep 2016
The international science magazine New Scientist, based in UK, talks about the ImageLab research about recognition of non-verbal behaviours in one of its cover articles. The link to the article for the online version of the science magazine can be found here: https://www.newscientist.com/article/mg23130933-200-camera-spots-your-hidden-prejudices-from-your-body-language/.
ImageLab algorithm in OpenCV · 26 Sep 2016
OpenCV accepted our Connected Components Labeling algorithm into their official source code. From the next release, our contribution will be the official one, recognizing its superior performance.
OpenCV (Open Source Computer Vision Library) is an open source computer vision and machine learning software library. OpenCV was built to provide a common infrastructure for computer vision applications and to accelerate the use of machine perception in the commercial products. The library has more than 2500 optimized algorithms, which includes a comprehensive set of both classic and state-of-the-art computer vision and machine learning algorithms. These algorithms can be used to detect and recognize faces, identify objects, classify human actions in videos, track camera movements, track moving objects, extract 3D models of objects, produce 3D point clouds from stereo cameras, stitch images together to produce a high resolution image of an entire scene, find similar images from an image database, remove red eyes from images taken using flash, follow eye movements, recognize scenery and establish markers to overlay it with augmented reality, etc. OpenCV has more than 47 thousand people of user community and estimated number of downloads exceeding 7 million. The library is used extensively in companies, research groups and by governmental bodies.
Facebook AI Research Partnership · 29 Aug 2016
Facebook has selected Imagelab as one of the 15 world-class research labs in Europe to receive a GPU-based server as part of the Facebook AI Research Partnership. Our application has been selected from a large pool of universities and research institutes working on important and innovative research. Facebook is committed to building strong research partnerships with institutions throughout Europe, and ensuring that Facebook's partners have the necessary hardware and tools to continue tackling some of the most important challenges in the disciplines of artificial intelligence and machine learning. Thanks to Lorenzo Baraldi, Costantino Grana and Simone Calderara!
Read more on the Facebook Research blog.
Report of Dagstuhl Seminar - Eyewear Computing · 19 Jul 2016
Invited talk alla scuola vismac 16 giugno 2016 · 16 Jun 2016
First International Workshop on Egocentric Perception, Interaction and Computing · 24 May 2016
ImageLab is organising the First International Workshop on Egocentric Perception, Interaction and Computing (EPIC@ECCV16) that will be held on the October 8-10, 2016, in conjunction with the European Conference on Computer Vision, Amsterdam, The Netherlands. Its goal is to give an overview of the recent technologies and system solutions, create a forum to exchange ideas and address challenges emerging in this field.
Talk by Prof. Tal Hassner · 17 May 2016
Prof. Tal Hassner (The Open University of Israel) is visiting Imagelab and is giving a talk on
"Faces, deep learning and the pursuit of training data"
Tuesday May 17, 2016 - 02:00 p.m. - Aula P 0.2 ex FA-0A
Abstract: The abilities of machines to detect and recognize faces improved remarkably over the last few years. This progress can at least partially be explained by the sizes of the training sets used to train deep learning models: huge numbers of face images downloaded and manually labeled. It is not clear, however, if the formidable task of collecting and labeling so many images is truly necessary. I will discuss the problems of data collection and describe a number of effective techniques for maximizing deep learning capabilities when collecting additional data is not an option. Importantly, though this talk will focus on face processing related tasks, these techniques can be applied in other image understanding problems where obtaining enough labeled examples for training deep learning systems is hard.
Nuovi sistemi intelligenti - Il Deep Learning per l'impresa del futuro · 16 Apr 2016
Il 16 maggio 2016 presso l'Auditorium del Tecnopolo di Modena si terrà una giornata dedicata al Deep Learning per l'impresa del futuro. Nel corso dell’iniziativa verranno presentate soluzioni, piattaforme software ed infrastrutture utilizzabili per portare il deep learning e le tecniche avanzate di apprendimento automatico nell’impresa, con testimonianze internazionali, accademiche e industriali. Seguirà una tavola rotonda sul loro utilizzo nell'industria e nell'impresa del futuro.
Per informazioni e registrazione: Pagina dedicata all'evento
Startcup regionale e PNI nazionale 2016 · 12 Apr 2016
Il 26 aprile si aprirà il bando per la Startcup Regionale, la competizione per favorire la nascita delle nuove imprese ad alto contenuto innovativo. Si rivolge a persone singole o gruppi interessati a sviluppare idee imprenditoriali innovative e/o ad alto grado di conoscenza. Le migliori idee imprenditoriali parteciperanno al Premio Nazionale Innovazione PNI Italia che si terrà a Modena ospitato da UNIMORE a dicembre 2016.
Per informazioni e contatti: http://www.unimore.it/evidenza/pni2016.html
Talk at "R&D: da Giovani Idee, Grandi Progetti" · 08 Apr 2016
In the context of the talk serie "R&D: da Giovani Idee, Grandi Progetti", Imagelab and Samsung will give a joint talk on 14/04/2016. Ing. Alletto Stefano will present the latest research efforts in the field of wearable computing, internet of things and object recognition. For further information please refer to the talk organizers' website: www.ideelab.it
Rita Cucchiara's interview at TGR TV News · 16 Mar 2016
Prof. Rita Cucchiara explains to the journalists of TV news TGR some of the last results obtained in the project Città Educante, in the context of exploting the new technologies to favour the acceptance of diversity in the primary schools.
The video can be found here.
Rita Cucchiara's talk at Area Chiar CVPR Workshop Vancouver 29 Feb 2015 · 01 Mar 2016
Hardware Grants from Nvidia and Cineca · 10 Feb 2016
Imagelab has received two important hardware grants:
The NVIDIA Hardware Grant, with the donation of one Tesla K40 GPU.
The Italian Supercomputing Resource Allocation (ISCRA) Grant from CINECA, which gives Imagelab access to the Galileo HPC Platform, containg 16 Tesla K80 GPUs.
The granted hardware will be mainly employed in the research project "Deep Learning in videos: concept detection and temporal video segmentation", as well as for other Imagelab activities.
Computer Vision Foundation: Advisory board · 04 Feb 2016
The Computer Vision Foundation appointed the new Advisory Board, chaired by Rene Vidal, composed by Anthony Hoogs, David Forsyth, Dimitri Metaxas,
Josef Sivic ,Kyoung Mu Lee ,Kyros Kutulakos ,Martial Hebert ,Rita Cucchiara ,Ruzena Bajcsy, Stan Sclaroff, Sven Dickinson, and Philip Torr.
Seminario Prof. Antonino Mazzeo · 02 Dec 2015
"Metodi, ambienti IDE, architetture e tecnologie per la progettazione dei sistemi digitali embedded".
Abstract: Il seminario intende focalizzarsi, con riferimento a specifici domini applicativi quali quelli dell’automotive dell’aerospazio e del ferroviario, sulla progettazione dei sistemi digitali embedded ad elevata complessità e assoggettati a vincoli di sicurezza, affidabilità, real-time e time to market e, in particolare, sugli aspetti delle differenti soluzioni oggi consentite dalle correnti tecnologie VLSI inquadrate in vari contesti architetturali dedicati (sistemi dedicati, general purpose e misti realizzati su FPGA, SoC e MPSoC ) e sulle metodologie di sviluppo a supporto.
Sono disponibili i lucidi del seminario al seguente indirizzo
Vision for cultural heritage · 25 Nov 2015
Mercoledì 25 e Giovedì 26 Novembre ImageLab/Softech-ICT ospiterà l'evento "Vision for cultural heritage", nell'ambito del progetto Dicet.
Il programma e il materiale dell'evento sono disponibili al seguente link: Vision for cultural heritage
Lectures by Prof. Nadia Magnenat Thalmann · 25 Sep 2015
Tutorial at CAIP 2015 · 01 Sep 2015
Costantino Grana and Giuseppe Serra gave a tutorial at the International Conference on Computer Analysis of Images and Patterns in Malta on The Bag of Visual Words model and recent advancements in image classification. Tutorial slides
International Workshop on Wearable and Ego-vision Systems for Augmented Experience (WEsAX) · 03 Jul 2015
ImageLab is organising the first International Workshop on Wearable and Ego-vision Systems for Augmented Experience (WEsAX) that will be held on the July 3, 2015, in conjunction with the IEEE International Conference on Multimedia and Expo (ICME), Turin, Italy. The goal of the first International Workshop on Wearable and Ego-vision Systems for Augmented Experience (WEsAX) is to give an overview of the recent technologies and system solutions, create a forum to exchange ideas and address challenges emerging in this field.
Lecture by Ing. Alessio Bazzica · 22 May 2015
Ing Alessio Bazzica is visiting Imagelab and is going to present a lecture on
Discovering the Music Information Retrieval Field
(Friday May 22, 2015 - 9:00am - Aula FA-1D )
Alessio Bazzica received the M.S. degree in Computer Engineering from the University of Florence (Italy) in 2012 and is now a 3nd year PhD candidate at the Multimedia Computing (MMC) Group in Delft (The Netherlands). He is interested in multimedia information retrieval and he is currently working on multimodal and crossmodal approaches for music information retrieval.
Attivazione master di II livello - MUMET · 08 May 2015
Al via il nuovo Master MUMET. Il master vuole creare nuove figure professionali (MULTIMEDIA DATA SCIENTIST), con competenze avanzate sulle tecnologie informatiche multimediali e sul visualcomputing, competenze attualmente molto rare ed assai richieste per i sistemi, servizi ed applicazioni nelle smartcities and communities.
Workshop tematico - "New Ideas for Education in Smart Communities" · 20 Apr 2015
Il progetto Città Educante ha lo scopo di ripensare radicalmente l'ambiente di apprendimento, attraverso l'applicazione delle più avanzate tecnologie informatiche. Il workshop, ad invito e a partecipazione libera, ha come obiettivo lo scambio di idee e la disseminazione delle attività svolte dai partners durante il primo anno di progetto.
L'evento che si inserisce tra la manifestazioni per i 25 anni di Ingegneria a Modena si svolgerà lunedì 20 aprile 2015, ore 10:00, presso l'aula P 0.5 (FA-0-D), del Dipartimento di Ingegneria "Enzo Ferrari".
Best Paper award @ AVSS2015 · 08 Jan 2015
Group Detection Paper accepted @TPAMI · 08 Jan 2015
Special Issue on: Wearable and Ego-vision Systems for Augmented Experience · 08 Jan 2015
The rapid progress in the development of low-level component technologies such as wearable cameras, wearable sensors, wearable displays and wearable computers is making it possible to augment everyday living. Wearable and egocentric vision systems can be exploited to analyze multi-modal data types (e.g. video, audio, motion) and to support understanding human interactions with the world (including gesture recognition, action recognition, social interaction recognition). Based on the processing of such data, wearable systems can be used to enhance our capabilities and augment our perception. State-of-the-art techniques for wearable sensing can support assistive technologies and advanced perception. This special issue intends to highlight research in support for human performance through egocentric sensing .
Best paper at INTETAIN 2015 · 08 Jan 2015
The paper “ Wearable Vision for Retrieving Architectural Details in Augmented Tourist Experiences ” by Stefano Alletto, Davide Abati, Giuseppe Serra and Rita Cucchiara was awarded the best paper award at INTETAIN in Turin. In this paper we propose an egocentric vision system to enhance tourists’ cultural heritage experience. Exploiting a wearable board and a glass-mounted camera, the visitor can retrieve architectural details of the historical building he is observing and receive related multimedia contents. To obtain an effective retrieval procedure we propose a visual descriptor based on the covariance of local features. Differently than the common Bag of Words approaches our feature vector does not rely on a generated visual vocabulary, removing the dependence from a specific dataset and obtaining a reduction of the computational cost. 3D modeling is used to achieve a precise visitor’s localization that allows browsing visible relevant details that the user may otherwise miss. Experimental results conducted on a publicly available cultural heritage dataset show that the proposed feature descriptor outperforms Bag of Words techniques.
Seminario Nvidia · 17 Nov 2014
Lunedì 17 Novembre 2014 dalle 14.00 alle 16.15 presso l'Aula P0.4 (ex FA-0C) - dipartimento DIEF - si terrà un seminario sulle architetture parallele GPU Nvidia.
14:00-15:00: Edmondo Orlotti, Business Development NVIDIA "Tecnologie per il calcolo parallelo e la visualizzazione, dal cloud al mobile"
15:00-16:15: Carlo Nardone, Solution Architect NVIDIA, "Gli strumenti di sviluppo per la programmazione in ambiente GPU". Locandina
Slides: Parte 1, Parte 2, Parte 3.
Eighth edition of the ACM/IEEE International Conference on Distributed Smart Cameras (ICDSC 2014) · 04 Nov 2014
Invited talk at the ECCV workshop · 08 Jan 2014
Prof. Cucchiara is presenting a talk on "Computer Vision for interactive experiences with art and artistic documents" at the ECCV workshop "Where Computer Vision Meets Art - VISART2014.
2nd International Workshop on Multimedia for Cultural Heritage · 09 Sep 2013
The 2nd International Workshop on Multimedia for Cultural Heritage, will be held on the 9-10th of September 2013, in conjunction with the 17th International Conference on Image Analysis and Processing (ICIAP), Naples, Italy, aims to be a profitable informal working day to discuss together hot topics in multimedia applied to cultural heritage.
Public website: http://imagelab.ing.unimore.it/MM4CH2013/
S5 - Second Short Spring School in Surveillance (Modena, May 7-9 2013) · 07 May 2013
This short 3-day school is meant to provide a self-contained comprehensive introduction to modern video surveillance methods and techniques, with a good balance between theory and practical applications.
This school is open to researchers, PhD and undergraduate students, scholars in the field of surveillance and security, and is conceived also for technicians from both industries and public entities.
Public website: http://imagelab.ing.unimore.it/s5/
3dPes · 12 Jan 2013
3DPeS (3D People Surveillance Dataset) is a new surveillance dataset, designed mainly for people re-identification in multi camera systems with non-overlapped field of views, but also applicable to many other tasks, such as people detection, tracking, action analysis and trajectory analysis.
Differently from other re-identification datasets here data for the complete processing chain are available: the camera setting and the 3D environment reconstruction, the hundreds of recorded videos, the camera calibration parameters, the identity of the hundreds of people, detected more than one time by different point of view.
Dataset web site: 3DPeS
S4 - Short Spring School in Surveillance · 17 May 2011
This short 3-day school is meant to provide a self-contained comprehensive introduction to modern video surveillance methods and techniques, with a good balance between theory and practical applications.
This school is open to PhD and undergraduate students, scholars in the field of surveillance and security, and is conceived also for technicians from both industries and public entities.
Public website: http://imagelab.ing.unimore.it/s4/
Multimedia in Forensics - MiFor2009 · 23 Oct 2009
With the proliferation of multimedia data on the web, surveillance cameras in cities, and mobile phones in everyday life we see an enormous growth in multimedia data that needs to be analyzed by forensic investigators. The sheer volume of such datasets makes manual inspection of all data impossible. Tools are needed to support the investigator in their quest for relevant clues and evidence and in their strive towards preventing crime.
The multimedia community has developed new solutions for management of large collections of video footage, images, audio and other multimedia content, knowledge extraction and categorization, pattern recognition, indexing and retrieval, searching, browsing and visualization, and modeling and simulation in various domains. Due to the inherent uncertainty and complexity of forensic data, applying those techniques to forensic data is not straightforward.
The time is ripe to tailor these results for forensics. Multimedia in forensics is the workshop aims at joining the topics and their communities.
This workshop aims at addressing the multimedia toolbox supporting the forensic process from the prevention of crime, capturing and annotation of the crime scene, the investigation of the data in the lab, up to the presentation of the results in court. It is a first attempt in bringing multimedia tools in to this exciting application field. Target audience are researchers working on innovative technology, representatives from companies developing tools, and forensic investigators in various disciplines
Presente e Futuro dei sistemi di Videosorveglianza per la sicurezza urbana · 20 Feb 2009
Il Comune di Modena e il Dipartimento di Ingegneria dell'Informazione dell'Università di Modena e Reggio Emilia organizzano il convegno "Presente e Futuro della Videosorveglianza per la sicurezza urbana" il giorno 20 Febbraio 2009 presso la sala della Fondazione Biagi.
L'argomento è la videosorveglianza e il trattamento dei dati video sia in tempo reale, per la prevenzione e la salvaguardia del cittadini, sia a posteriori per l'analisi forense e il supporto nelle fasi processuali.
Iciap 2007 · 12 Sep 2007
VSSN 2006 · 27 Oct 2006
Following a successful tradition, this will be the fourth edition of the VSSN Workshop to be held in conjunction with ACM Conference of Multimedia 2006 at Santa Barbara, California, October 22-28, 2006.
Workshop web site: http://imagelab.ing.unimore.it/vssn06
VSSN 2005 · 11 Nov 2005
Rita Cucchiara was program chair of VSSN 2005, the 3rd ACM International Workshop on Video Surveillance & Sensor Networks in conjunction with ACM Multimedia 2005.
Event web site: http://imagelab.ing.unimore.it/vssn05/