Conference Program

Keynote 1

Jean-Marc ODOBEZ

Idiap research institute and EPFL, Switzerland

Deep learning and personalized models for robust perception in human-robot interactions

In the future, more and more systems and robots will have to co-exist with humans in dynamic environments. In order for them to understand the social scene, to pro-actively interact autonomously and naturally with people or groups of people in function of their physical accessibility or social availability, and in general to monitor interactions, it is crucial to endow them with efficient multi-modal sensing and situated perception capabilities.
The aim of our research is to develop such sensing methodologies fusing the different streams of information (audio, RGB-Depth, robot and task state) to detect and track the perceived state and activity of the different people along several dimensions: physical states (location, trajectory, pose, speaking status), social category (gender, age group), non-verbal cues related to body language like gestures or attentional cues.
In this talk, I will present different works around this topic revolving around sound source and human voice localization, 3D head pose tracking under 360 degrees, as well as gaze and attention modeling. Our main goals were to investigate the use of computer vision and deep neural network architectures for the different sensing tasks, including relying on synthetic data for obtaining sufficient amount of training data, or building personalized models through online learning or adaptation, potentially taking advantage of priors on social interactions to obtain weak labels for model adaptation.

Home Page

10:00 AM

Coffe break

at the venue

10:30 AM

Oral session 1

Multimedia Analysis and Representation

Session chair: Benoit Huet, EURECOM

Semantic Signatures for Urban Visual Localization
Li Weng, Bahman Soheilian and Valérie Gouet-Brunet

Learning Prototypes for Visual Relationship Detection
François Plesse, Alexandru Ginsca, Bertrand Delezoide and Françoise Prêteux

Diffusion by Exchanges in HB-Graphs: Highlighting Complex Relationships
Xavier Ouvrard, Jean-Marie Le Goff and Stéphane Marchand-Maillet

A New Image Segmentation Approach Based on the Louvain Algorithm
Thanh-Khoa Nguyen, Mickaël Coustaty and Jean-Loup Guillaume

Bark Identification using Improved Statistical Radial Binary Patterns
Safia Boudra, Itheri Yahiaoui and Ali Behloul

12:15 PM

Lunch

at the venue

02:00 PM

Special session 1 on Analysis of Multimedia Data for Medicine and Health

Session chair: Klaus Schoeffmann, Klagenfurt University

Cardio-Pulmonary Resuscitation (CPR) Scene Retrieval from Medical Simulation Videos using Local Binary Patterns Over Three Orthogonal Planes
Anju Panicker Madhusoodhanan Sathik, Hichem Frigui and Aaron Calhoun

MRI Brain Tumor Segmentation Combining Saliency and Convolutional Network Features
Petra Takács and Andrea Manno-Kovacs

Towards Independent Stress Detection: a Dependent Model using Facial Action Units
Carla Viegas, Shing-Hon Lau, Roy Maxion and Alexander Hauptmann

Automatic Detection of Patient with Respiratory Diseases using Lung Sound Analysis
Gaetan Chambres, Pierre Hanna and Myriam Desainte-Catherine

Activity Recognition from Wearable Cameras
Panagiotis Giannakeris, Konstantinos Avgerinakis, Stefanos Vrochidis and Ioannis Kompatsiaris

03:45 PM

Coffe break

at the venue

04:00 PM

Oral session 2

Multimodal data Retrieval

Session chair: Jenny Benois-Pineau, LABRI

SSKSRIF: Scale and Rotation Invariant Features based on Spatial Space of Keypoints for Camera-Based Information Spotting
Quoc Bao Dang, Mickaël Coustaty, Muhammad Muzzamil Luqman, Jean-Marc Ogier and Cao De Tran

Embedding Images and Sentences in a Common Space with a Recurrent Capsule Network
Danny Francis, Benoit Huet and Bernard Merialdo

Generating Visual Clouds from Multiplex Networks for TV News Archive Query Visualization
Haolin Ren, Benjamin Renoust, Marie-Luce Viaud, Guy Melançon and Shin'Ichi Satoh

Visual Sentiment Analysis Based on Objective Text Description of Images
Alessandro Ortis, Giovanni Maria Farinella, Giovanni Torrisi and Sebastiano Battiato

Annotating Images with Drawings through Gameplay for Drawing based Image Retrieval
Zhan Xu and Guoping Qiu

06:00 PM -07:30 PM

Welcome reception

Icebreaking cocktail at the venue with Live music

Wednesday, September 5

08:15 AM

Registrations

08:45 AM

Day opening

09:00 AM

Keynote 2

Hervé GLOTIN

CNRS LIS Toulon

Content based multichannel bioacoustic indexing & autoencoder for 3D whale survey
in collaboration with Phd students Marion Poupard and Ferrari Maxence, CNRS LIS Toulon and with Seaproven SA

In this talk, we first recall the main objectives of massive bioacoustic indexing.
We will present usual and more advanced stereophonic detection of Whales from long term submarine observatory.
We demonstrate algorithms working on bottom mounted hydrophone antenna or fixed under mobile autonomous vehicles to computed detailed 3D tracks of the dive of the whale during hours.
We then propose a new paradigm towards deep learning of multichannel bioacoustic regresentation from raw audio. We thus take advantages of stereophony (or multiphony)
to elaborate new objective loss function for transient detection, denoising and accurate estimation Time Delay of Arrival (TDOA).
We train autoencoder on stereo transients from whale recordings, and show how to improve TDOA estimation taking advantages of estimator from learned limited development of a transfer function.
We then discuss perspectives towards embedded deep learning method in autonomous vehicle to index biodiversity of the abyss.

Home Page

10:00 AM

Coffe break

at the venue

10:30 AM

Oral session 3

Saliency and quality of multimedia contents

Session chair: Andrea Manno-Kovacs, SZTAKI

Saliency Weighted Convolutional Features for Instance Search
Eva Mohedano, Kevin McGuinness, Xavier Giro-I-Nieto and Noel O'Connor

Introduction of Explicit Visual Saliency in Training of Deep CNNs: Application to Architectural Styles Classification
Abraham Montoya Obeso, Jenny Benois-Pineau, Mireya S. Garcia Vazquez and Alejandro Alvaro Ramirez Acosta

Interestingness for Immersive Content: Prediction and Use
Gwenaelle Marquant, Claire-Hélène Demarty, Christel Chamaret, Joel Sirot and Louis Chevallier

Refining and Evaluating a Video Sharpness Metric
Stefanie Wechtitsch, Werner Bailer and Lucas Paletta

UNNA: A Unified Neural Network for Aesthetic Assessment
Larbi Abdenebaoui, Benjamin Meyer, Albert Bruns and Susanne Boll

12:15 PM

Lunch

at the venue

02:00 PM

Oral session: Best CBMI 2018 papers

Session chair: Alan HANJALIC, TU Delft

Latent Feature Combination for Multi-Context Music Recommendation
Martin Pichl and Eva Zangerle

Detection and Localization of Partial Audio Matches
Milica Maksimović, Patrick Aichroth and Luca Cuccovillo

Mastering Occlusions by Using Intelligent Facial Frameworks Based on the Propagation of Movement
Delphine Poux, Benjamin Allaert, José Mennesson, Nacim Ihaddadene, Ioan Marius Bilasco and Chaabane Djeraba

Deep Multimodal Features for Movie Genre and Interestingness Prediction
Olfa Ben Ahmed and Benoit Huet

03:45 PM

Coffe break

at the venue

04:00 PM

Posters and demos

Ridiculously Fast Shot Boundary Detection with Fully Convolutional Neural Networks
Michael Gygli

CAD-SE: Discovering CAD Models on the Web
Christian Brosch, Andreas Seifert and Thomas Köllmer

DeCP-Live: A Web-Interface for DeCP, a Distributed High-Throughput CBIR System
Gylfi Þór Guðmundsson, Christian Andreas Jacobsen, Hilmar Tryggvason and Björn Þór Jónsson

Musical Gesture Recognition Using Machine Learning and Audio Descriptors
Paul Best, Jean Bresson and Diemo Schwarz

Dynamic Index Finger Gesture Video Dataset for Mobile Interaction
Cagan Arslan, Ioan Marius Bilasco and Jean Martinet

05:30 PM -06:00 PM

-

06:00 PM -07:30 PM

Social Event

Boat trip on l’Espérance in the Bay of La Rochelle and around the Ré Island.
The boat l’Espérance is located in the old port of La Rochelle, at 10 minutes walk from the conference venue (see map below).

05:45 PM -07:00 PM

Open to the Public (in French) - at the Faculty of Arts, Languages, Arts and Humanities (FLASH), amphi 400.

Dominique Cardon

Sciences Po Paris/Médialab

Enjeux socio-politiques de l’IA (Socio-political issues of AI)

Profitant des réussites spectaculaires des techniques de deep learning, les promesses de l’Intelligence artificielle (IA) sont revenues occuper le débat public. Dans cette conférence, on propose de retracer quelques aspects de l’histoire de l’Intelligence artificielle afin de comprendre les défis présents. Le « troisième printemps de l’IA » se caractérise, en effet, par le retour du paradigme connexionniste qui avait été marginalisé lors du « second printemps » de l’Intelligence artificielle dans les années quatre-vingt. La nouvelle vague de promesses de l’IA a pris forme grâce aux opportunités nouvelles offertes par les données massives et aux nouvelles capacités de calcul des ordinateurs. Mais, elle propose surtout une autre manière de rendre les machines « intelligentes ». Il ne s’agit plus de demander aux calculateurs de raisonner, mais bien d’apprendre des données et de former des modèles de prédiction à partir des données. Ce changement de paradigme de l’IA a de nombreuses conséquences : il propose une représentation différentes de la société qui s’appuie sur le comportement des individus plutôt que sur leurs catégories d’appartenance, il ouvre des possibilités d’automatisation du travail, il présente des opportunités nouvelles mais aussi des risques de biais dans les calculs. Cette conférence propose d’examiner les défis éthiques et politiques que pose les développements de l’IA.

Home Page

07:30 PM - 10:30 PM

Gala Dinner

Gala Dinner aboard l’Espérance boat.
L’Espérance is located in the old port of La Rochelle, at 10 minutes walk from the conference venue (see map below).

Thursday, September 6

08:15 AM

Registrations

08:30 AM

Special Session 2 on Indexing, Retrieval, Annotation and Mining in Earth Observation (IR4EO)

BigEarth-Accurate and Scalable Processing of Big Data in Earth Observation
Begüm Demir

Urban Land Use Classification Based on Aerial and Ground Images
Rui Cao and Guoping Qiu

Overcoming Missing and Incomplete Modalities with Generative Adversarial Networks for Building Footprint Segmentation
Benjamin Bischke, Patrick Helber, Florian Koenig, Damian Borth and Andreas Dengel

A Study on the Sensitivity-Based Discriminative Hyperspectral Image Content Representation
Olfa Ben Ahmed, Thierry Urruty, Noel Richard and Christine Fernandez

Active Learning to Assist Annotation of Aerial Images in Environmental Surveys
Mathieu Laroze, Romain Dambreville, Chloé Friguet, Ewa Kijak and Sébastien Lefèvre

Session chairs: Sébastien Lefèvre, Université de Bretagne Sud and Josiane Mothe, IRIT Toulouse

10:15 AM
Coffee break

at the venue

10:30 AM
Oral session 4

Deep Learning for Multimedia Indexing and Analysis

Coupled Ensembles of Neural Networks
Anuvabh Dutt, Denis Pellerin and Georges Quénot

Temporal Compression and Fast Matching of Hand-crafted and Deep Features of Video Segments
Werner Bailer and Stefanie Wechtitsch

Sport Action Recognition with Siamese Spatio-Temporal CNNs: Application to Table Tennis
Pierre-Etienne Martin, Jenny Benois-Pineau, Renaud Peteri and Julien Morlier

Deep 3D Flow Features for Human Action Recognition
Athanasios Psaltis, Georgios Papadopoulos and Petros Daras

Action Tube Extraction based 3D-CNN for RGB-D Action Recognition
Zineng Xu, Veronica Vilaplana and Josep Ramon Morros

Session chair: Georges Quénot, LIG-CNRS

12:15 PM

Closing remarks

Click to download PDF version of CBMI2018 program

CBMI 2018 program at a glance

CBMI 2018
Conference Program
	Tuesday, September 4th	Wednesday, September 5th		Thrusday, September 6th
08:15-08:30	Registrations	Registrations		Registrations/Day opening
08:30-08:45				Special session 2 Indexing, Retrieval, Annotation And Mining in Earth Observation (IR4EO)
08:45-09:00	Welcome	Day opening
09:00-10:00	Keynote 1 Deep learning and personalized models For robust perception in human-robot interactions by Jean-Marc ODOBEZ (EPFL)	Keynote 2 Content based multichannel bioacoustic indexing & autoencoder for 3D whale survey by Hervé GLOTIN (Univ. Toulon, IUF)
10:00-10:15	Coffee break (at the venue)	Coffee break (at the venue)
10:15-10:30				Coffee break (at the venue)
10:30-12:15	Oral session 1 Multimedia Analysis and Representation	Oral session 3 Saliency and quality of multimedia contents		Oral session 4 Deep Learning for Multimedia Indexing and Analysis
12:15-13:00	Lunch (at the venue)	Lunch (at the venue)		Closing remarks
13:00-14:00
14:00-15:45	Special session 1 Analysis of Multimedia Data for Medicine and Health	Oral session Best CBMI 2018 papers
15:45-16:00	Coffee break (at the venue)	Coffee break (at the venue)
16:00-17:30	Oral session 2 Multimodal data Retrieval	Posters and Demos
17:30-17:45
17:45-18:00			Open to the Public (in french) Socio-political issues of AI by Dominique Cardon (Sciences Po Paris)
18:00-19:00	Welcome reception	Boat trip in the Bay of La Rochelle and around the Ré Island
19:00-19:30
19:30-22:30		Gala Dinner aboard l’Espérance boat in the port of La Rochelle