Bibliography
Major publications by the team in recent years
-
1X. Alameda-Pineda, R. Horaud.
A Geometric Approach to Sound Source Localization from Time-Delay Estimates, in: IEEE Transactions on Audio, Speech and Language Processing, June 2014, vol. 22, no 6, pp. 1082-1095. [ DOI : 10.1109/TASLP.2014.2317989 ]
https://hal.inria.fr/hal-00975293 -
2X. Alameda-Pineda, R. Horaud.
Vision-Guided Robot Hearing, in: International Journal of Robotics Research, April 2015, vol. 34, no 4-5, pp. 437-456. [ DOI : 10.1177/0278364914548050 ]
https://hal.inria.fr/hal-00990766 -
3N. Andreff, B. Espiau, R. Horaud.
Visual Servoing from Lines, in: International Journal of Robotics Research, 2002, vol. 21, no 8, pp. 679–700.
http://hal.inria.fr/hal-00520167 -
4S. Ba, X. Alameda-Pineda, A. Xompero, R. Horaud.
An On-line Variational Bayesian Model for Multi-Person Tracking from Cluttered Scenes, in: Computer Vision and Image Understanding, December 2016, vol. 153, pp. 64–76. [ DOI : 10.1016/j.cviu.2016.07.006 ]
https://hal.inria.fr/hal-01349763 -
5F. Cuzzolin, D. Mateus, R. Horaud.
Robust Temporally Coherent Laplacian Protrusion Segmentation of 3D Articulated Bodies, in: International Journal of Computer Vision, March 2015, vol. 112, no 1, pp. 43-70. [ DOI : 10.1007/s11263-014-0754-0 ]
https://hal.archives-ouvertes.fr/hal-01053737 -
6A. Deleforge, F. Forbes, R. Horaud.
Acoustic Space Learning for Sound-Source Separation and Localization on Binaural Manifolds, in: International Journal of Neural Systems, February 2015, vol. 25, no 1, 21 p. [ DOI : 10.1142/S0129065714400036 ]
https://hal.inria.fr/hal-00960796 -
7A. Deleforge, F. Forbes, R. Horaud.
High-Dimensional Regression with Gaussian Mixtures and Partially-Latent Response Variables, in: Statistics and Computing, September 2015, vol. 25, no 5, pp. 893-911. [ DOI : 10.1007/s11222-014-9461-5 ]
https://hal.inria.fr/hal-00863468 -
8A. Deleforge, R. Horaud, Y. Y. Schechner, L. Girin.
Co-Localization of Audio Sources in Images Using Binaural Features and Locally-Linear Regression, in: IEEE Transactions on Audio, Speech and Language Processing, April 2015, vol. 23, no 4, pp. 718-731. [ DOI : 10.1109/TASLP.2015.2405475 ]
https://hal.inria.fr/hal-01112834 -
9G. Evangelidis, M. Hansard, R. Horaud.
Fusion of Range and Stereo Data for High-Resolution Scene-Modeling, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, November 2015, vol. 37, no 11, pp. 2178 - 2192. [ DOI : 10.1109/TPAMI.2015.2400465 ]
https://hal.archives-ouvertes.fr/hal-01110031 -
10I. D. Gebru, X. Alameda-Pineda, F. Forbes, R. Horaud.
EM Algorithms for Weighted-Data Clustering with Application to Audio-Visual Scene Analysis, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, December 2016, vol. 38, no 12, pp. 2402 - 2415. [ DOI : 10.1109/TPAMI.2016.2522425 ]
https://hal.inria.fr/hal-01261374 -
11M. Hansard, G. Evangelidis, Q. Pelorson, R. Horaud.
Cross-Calibration of Time-of-flight and Colour Cameras, in: Computer Vision and Image Understanding, April 2015, vol. 134, pp. 105-115. [ DOI : 10.1016/j.cviu.2014.09.001 ]
https://hal.inria.fr/hal-01059891 -
12M. Hansard, R. Horaud, M. Amat, G. Evangelidis.
Automatic Detection of Calibration Grids in Time-of-Flight Images, in: Computer Vision and Image Understanding, April 2014, vol. 121, pp. 108-118. [ DOI : 10.1016/j.cviu.2014.01.007 ]
https://hal.inria.fr/hal-00936333 -
13M. Hansard, R. Horaud.
Cyclopean geometry of binocular vision, in: Journal of the Optical Society of America A, September 2008, vol. 25, no 9, pp. 2357-2369. [ DOI : 10.1364/JOSAA.25.002357 ]
http://hal.inria.fr/inria-00435548 -
14M. Hansard, R. Horaud.
Cyclorotation Models for Eyes and Cameras, in: IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, March 2010, vol. 40, no 1, pp. 151-161. [ DOI : 10.1109/TSMCB.2009.2024211 ]
http://hal.inria.fr/inria-00435549 -
15M. Hansard, R. Horaud.
A Differential Model of the Complex Cell, in: Neural Computation, September 2011, vol. 23, no 9, pp. 2324-2357. [ DOI : 10.1162/NECO_a_00163 ]
http://hal.inria.fr/inria-00590266 -
16M. Hansard, S. Lee, O. Choi, R. Horaud.
Time of Flight Cameras: Principles, Methods, and Applications, Springer Briefs in Computer Science, Springer, October 2012, 95 p.
http://hal.inria.fr/hal-00725654 -
17R. Horaud, G. Csurka, D. Demirdjian.
Stereo Calibration from Rigid Motions, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, December 2000, vol. 22, no 12, pp. 1446–1452. [ DOI : 10.1109/34.895977 ]
http://hal.inria.fr/inria-00590127 -
18R. Horaud, F. Forbes, M. Yguel, G. Dewaele, J. Zhang.
Rigid and Articulated Point Registration with Expectation Conditional Maximization, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, March 2011, vol. 33, no 3, pp. 587-602. [ DOI : 10.1109/TPAMI.2010.94 ]
http://hal.inria.fr/inria-00590265 -
19R. Horaud, M. Niskanen, G. Dewaele, E. Boyer.
Human Motion Tracking by Registering an Articulated Surface to 3-D Points and Normals, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, January 2009, vol. 31, no 1, pp. 158-163. [ DOI : 10.1109/TPAMI.2008.108 ]
http://hal.inria.fr/inria-00446898 -
20V. Khalidov, F. Forbes, R. Horaud.
Conjugate Mixture Models for Clustering Multimodal Data, in: Neural Computation, February 2011, vol. 23, no 2, pp. 517-557. [ DOI : 10.1162/NECO_a_00074 ]
http://hal.inria.fr/inria-00590267 -
21D. Knossow, R. Ronfard, R. Horaud.
Human Motion Tracking with a Kinematic Parameterization of Extremal Contours, in: International Journal of Computer Vision, September 2008, vol. 79, no 3, pp. 247-269. [ DOI : 10.1007/s11263-007-0116-2 ]
http://hal.inria.fr/inria-00590247 -
22D. Kounades-Bastian, L. Girin, X. Alameda-Pineda, S. Gannot, R. Horaud.
A Variational EM Algorithm for the Separation of Time-Varying Convolutive Audio Mixtures, in: IEEE/ACM Transactions on Audio, Speech and Language Processing, August 2016, vol. 24, no 8, pp. 1408-1423. [ DOI : 10.1109/TASLP.2016.2554286 ]
https://hal.inria.fr/hal-01301762 -
23X. Li, L. Girin, R. Horaud, S. Gannot.
Estimation of the Direct-Path Relative Transfer Function for Supervised Sound-Source Localization, in: IEEE/ACM Transactions on Audio, Speech and Language Processing, November 2016, vol. 24, no 11, pp. 2171 - 2186. [ DOI : 10.1109/TASLP.2016.2598319 ]
https://hal.inria.fr/hal-01349691 -
24M. Sapienza, M. Hansard, R. Horaud.
Real-time Visuomotor Update of an Active Binocular Head, in: Autonomous Robots, January 2013, vol. 34, no 1, pp. 33-45. [ DOI : 10.1007/s10514-012-9311-2 ]
http://hal.inria.fr/hal-00768615 -
25A. Zaharescu, E. Boyer, R. Horaud.
Topology-Adaptive Mesh Deformation for Surface Evolution, Morphing, and Multi-View Reconstruction, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, April 2011, vol. 33, no 4, pp. 823-837. [ DOI : 10.1109/TPAMI.2010.116 ]
http://hal.inria.fr/inria-00590271 -
26A. Zaharescu, E. Boyer, R. Horaud.
Keypoints and Local Descriptors of Scalar Functions on 2D Manifolds, in: International Journal of Computer Vision, October 2012, vol. 100, no 1, pp. 78-98. [ DOI : 10.1007/s11263-012-0528-5 ]
http://hal.inria.fr/hal-00699620 -
27A. Zaharescu, R. Horaud.
Robust Factorization Methods Using A Gaussian/Uniform Mixture Model, in: International Journal of Computer Vision, March 2009, vol. 81, no 3, pp. 240-258. [ DOI : 10.1007/s11263-008-0169-x ]
http://hal.inria.fr/inria-00446987
Doctoral Dissertations and Habilitation Theses
-
28V. Drouard.
From Images and Sounds to Face Localization and Tracking, Université Grenoble Alpes, December 2017.
https://hal.inria.fr/tel-01667740 -
29D. Kounades-Bastian.
Some Contributions to Audio Source Separation and Diarisation of Multichannel Convolutive Mixtures, Université Grenoble - Alpes, February 2017.
https://hal.inria.fr/tel-01543101
Articles in International Peer-Reviewed Journals
-
30V. Drouard, R. Horaud, A. Deleforge, S. Ba, G. Evangelidis.
Robust Head-Pose Estimation Based on Partially-Latent Mixture of Linear Regressions, in: IEEE Transactions on Image Processing, March 2017, vol. 26, no 3, pp. 1428 - 1440, https://arxiv.org/abs/1603.09732. [ DOI : 10.1109/TIP.2017.2654165 ]
https://hal.inria.fr/hal-01413406 -
31G. Evangelidis, R. Horaud.
Joint Alignment of Multiple Point Sets with Batch and Incremental Expectation-Maximization, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, June 2017, vol. XX, https://arxiv.org/abs/1609.01466 - 14 pages, 12 figures, 5 tables. [ DOI : 10.1109/TPAMI.2017.2717829 ]
https://hal.inria.fr/hal-01413414 -
32D. Fabre, T. Hueber, L. Girin, X. Alameda-Pineda, P. Badin.
Automatic animation of an articulatory tongue model from ultrasound images of the vocal tract, in: Speech Communication, October 2017, vol. 93, pp. 63 - 75. [ DOI : 10.1016/j.specom.2017.08.002 ]
https://hal.archives-ouvertes.fr/hal-01578315 -
33I. Gebru, S. Ba, X. Li, R. Horaud.
Audio-Visual Speaker Diarization Based on Spatiotemporal Bayesian Fusion, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, January 2017, vol. 39, https://arxiv.org/abs/1603.09725 - 14 pages. [ DOI : 10.1109/TPAMI.2017.2648793 ]
https://hal.inria.fr/hal-01413403 -
34L. Girin, T. Hueber, X. Alameda-Pineda.
Extending the Cascaded Gaussian Mixture Regression Framework for Cross-Speaker Acoustic-Articulatory Mapping, in: IEEE/ACM Transactions on Audio, Speech and Language Processing, March 2017, vol. 25, no 3, pp. 662-673. [ DOI : 10.1109/TASLP.2017.2651398 ]
https://hal.archives-ouvertes.fr/hal-01485540 -
35X. Li, L. Girin, R. Horaud, S. Gannot.
Multiple-Speaker Localization Based on Direct-Path Features and Likelihood Maximization with Spatial Sparsity Regularization, in: IEEE/ACM Transactions on Audio, Speech and Language Processing, October 2017, vol. 25, no 10, pp. 1997 - 2012, https://arxiv.org/abs/1611.01172 - 16 pages, 4 figures, 4 tables. [ DOI : 10.1109/TASLP.2017.2740001 ]
https://hal.inria.fr/hal-01413417 -
36B. Massé, S. Ba, R. Horaud.
Tracking Gaze and Visual Focus of Attention of People Involved in Social Interaction, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, December 2017, vol. PP, no 99, pp. 1-15, https://arxiv.org/abs/1703.04727. [ DOI : 10.1109/TPAMI.2017.2782819 ]
https://hal.inria.fr/hal-01511414
International Conferences with Proceedings
-
37X. Alameda-Pineda, A. Pilzer, D. Xu, N. Sebe, E. Ricci.
Viraliency: Pooling Local Virality, in: IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, Hawaii, United States, July 2017.
https://hal.inria.fr/hal-01558137 -
39Y. Ban, L. Girin, X. Alameda-Pineda, R. Horaud.
Exploiting the Complementarity of Audio and Visual Data in Multi-Speaker Tracking, in: ICCV Workshop on Computer Vision for Audio-Visual Media, Venezia, Italy, October 2017.
https://hal.inria.fr/hal-01577965 -
40V. Drouard, S. Ba, R. Horaud.
Switching Linear Inverse-Regression Model for Tracking Head Pose, in: IEEE Winter Conference on Applications of Computer Vision, Santa Rosa, CA, United States, March 2017. [ DOI : 10.1109/WACV.2017.142 ]
https://hal.inria.fr/hal-01430727 -
42L. Girin, R. Badeau.
On the Use of Latent Mixing Filters in Audio Source Separation, in: 13th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA 2017), Grenoble, France, Proc. 13th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA 2017), Springer, February 2017, vol. 10169, pp. 225-235. [ DOI : 10.1007/978-3-319-53547-0_22 ]
https://hal.archives-ouvertes.fr/hal-01400965 -
43L. Girin, T. Hueber, X. Alameda-Pineda.
Adaptation of a Gaussian Mixture Regressor to a New Input Distribution: Extending the C-GMR Framework, in: LVA ICA 2017- International Conference on Latent Variable Analysis and Signal Separation, Grenoble, France, February 2017.
https://hal.inria.fr/hal-01646098 -
44D. Kounades-Bastian, L. Girin, X. Alameda-Pineda, S. Gannot, R. Horaud.
An EM Algorithm for Joint Source Separation and Diarisation of Multichannel Convolutive Speech Mixtures, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, United States, March 2017.
https://hal.inria.fr/hal-01430761 -
45D. Kounades-Bastian, L. Girin, X. Alameda-Pineda, R. Horaud, S. Gannot.
Exploiting the Intermittency of Speech for Joint Separation and Diarization, in: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, United States, October 2017.
https://hal.inria.fr/hal-01568813 -
46S. Lathuilière, G. Evangelidis, R. Horaud.
Recognition of Group Activities in Videos Based on Single- and Two-Person Descriptors, in: IEEE Winter Conference on Applications of Computer Vision, Santa Rosa, CA, United States, March 2017. [ DOI : 10.1109/WACV.2017.31 ]
https://hal.inria.fr/hal-01430732 -
47S. Lathuilière, R. Juge, P. Mesejo, R. Muñoz-Salinas, R. Horaud.
Deep Mixture of Linear Inverse Regressions Applied to Head-Pose Estimation, in: IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, Hawaii, United States, IEEE Computer Society, July 2017.
https://hal.inria.fr/hal-01504847 -
48X. Li, L. Girin, R. Horaud.
An EM Algorithm for Audio Source Separation Based on the Convolutive Transfer Function, in: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, United States, October 2017.
https://hal.inria.fr/hal-01568818 -
49X. Li, L. Girin, R. Horaud.
Audio Source Separation Based on Convolutive Transfer Function and Frequency-Domain Lasso Optimization, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, United States, March 2017.
https://hal.inria.fr/hal-01430754 -
50H. Löllmann, A. Moore, P. Naylor, B. Rafaely, R. Horaud, A. Mazel, W. Kellermann.
Microphone Array Signal Processing for Robot Audition, in: IEEE Workshop on Hands-free Speech Communication and Microphone Arrays, San Francisco, United States, IEEE Signal Processing Society, March 2017. [ DOI : 10.1109/HSCMA.2017.7895560 ]
https://hal.inria.fr/hal-01485322 -
51D. Xu, W. Ouyang, X. Alameda-Pineda, E. Ricci, X. Wang, N. Sebe.
Learning Deep Structured Multi-Scale Features using Attention-Gated CRFs for Contour Prediction, in: Advances in Neural Information Processing Systems, Long Beach, United States, December 2017.
https://hal.inria.fr/hal-01646112
Other Publications
-
52S. Lathuilière, B. Massé, P. Mesejo, R. Horaud.
Neural Network Reinforcement Learning for Audio-Visual Gaze Control in Human-Robot Interaction, November 2017, https://arxiv.org/abs/1711.06834 - 14 pages.
https://hal.inria.fr/hal-01643775 -
53X. Li, S. Gannot, R. Horaud.
Blind MultiChannel Identification and Equalization for Dereverberation and Noise Reduction based on Convolutive Transfer Function, November 2017, https://arxiv.org/abs/1706.03652 - 13 pages, 5 figures, 5 tables.
https://hal.inria.fr/hal-01568835 -
54X. Li, L. Girin, S. Gannot, R. Horaud.
Multichannel Source Separation and Speech Enhancement Using the Convolutive Transfer Function, November 2017, https://arxiv.org/abs/1711.07911 - 13 pages, 5 figures.
https://hal.inria.fr/hal-01645749 -
55R. T. Marriott, A. Pashevich, R. Horaud.
Plane-extraction from depth-data using a Gaussian mixture regression model, December 2017, https://arxiv.org/abs/1710.01925 - 10 pages, 2 figures, 1 table.
https://hal.inria.fr/hal-01663984 -
56K. Tombre, L. Quan, R. Horaud, P. Gros, C. Schmid, P. Sturm.
In Memoriam Roger Mohr, Société Informatique de France, September 2017, pp. 91-98, Article qui rappelle la carrière scientifique de Roger Mohr.
https://hal.inria.fr/hal-01598085