EN FR
EN FR


Bibliography

Publications of the year

Doctoral Dissertations and Habilitation Theses

  • 1B. Bramas.

    Optimization and parallelization of the boundary element method for the wave equation in time domain, Université de Bordeaux, February 2016.

    https://tel.archives-ouvertes.fr/tel-01306571
  • 2J. M. Couteyen Carpaye.

    Contributions to the parallelization and the scalability of the FLUSEPA code, Université de Bordeaux, September 2016.

    https://tel.archives-ouvertes.fr/tel-01399952
  • 3P. Maria.

    Load Balancing for Parallel Coupled Simulations, Université de Bordeaux, 2016.

Articles in International Peer-Reviewed Journals

  • 4E. Agullo, P. R. Amestoy, A. Buttari, A. Guermouche, J.-Y. L'Excellent, F.-H. Rouet.

    Robust memory-aware mappings for parallel multifrontal factorizations, in: SIAM Journal on Scientific Computing, July 2016, vol. 38, no 3, 23 p.

    https://hal.inria.fr/hal-01334113
  • 5E. Agullo, B. Bramas, O. Coulaud, E. Darve, M. Messner, T. Takahashi.

    Task-based FMM for heterogeneous architectures, in: Concurrency and Computation: Practice and Experience, June 2016, vol. 28, no 9. [ DOI : 10.1002/cpe.3723 ]

    https://hal.inria.fr/hal-01359458
  • 6E. Agullo, A. Buttari, A. Guermouche, F. Lopez.

    Implementing multifrontal sparse solvers for multicore architectures with Sequential Task Flow runtime systems, in: ACM Transactions on Mathematical Software, July 2016. [ DOI : 10.1145/0000000.0000000 ]

    https://hal.inria.fr/hal-01333645
  • 7E. Agullo, L. Giraud, A. Guermouche, J. Roman, M. Zounon.

    Numerical recovery strategies for parallel resilient Krylov linear solvers, in: Numerical Linear Algebra with Applications, May 2016.

    https://hal.inria.fr/hal-01323192
  • 8E. Agullo, L. Giraud, P. Salas, M. Zounon.

    Interpolation-restart strategies for resilient eigensolvers, in: SIAM Journal on Scientific Computing, 2016, vol. 38, no 5, pp. C560-C583. [ DOI : 10.1137/15M1042115 ]

    https://hal.inria.fr/hal-01347793
  • 9R. Garnier, M. Odunlami, V. Le Bris, D. Bégué, I. Baraille, O. Coulaud.

    Adaptive vibrational configuration interaction (A-VCI): a posteriori error estimation to efficiently compute anharmonic IR spectra, in: Journal of Chemical Physics, May 2016, vol. 144, no 20.

    https://hal.inria.fr/hal-01310708

Invited Conferences

  • 10E. Agullo, S. Cools, L. Giraud, A. Moreau, P. Salas, W. Vanroose, E. F. Yetkin, M. Zounon.

    Hard faults and soft errors: possible numerical remedies in linear algebra solvers, in: VecPar - International meeting on High Performance Computing for Computational science, Porto, Portugal, June 2016.

    https://hal.inria.fr/hal-01334675

International Conferences with Proceedings

  • 11E. Agullo, O. Beaumont, L. Eyraud-Dubois, S. Kumar.

    Are Static Schedules so Bad ? A Case Study on Cholesky Factorization, in: IEEE International Parallel & Distributed Processing Symposium (IPDPS 2016), Chicago, IL, United States, IEEE, May 2016.

    https://hal.inria.fr/hal-01223573
  • 12E. Agullo, G. Bosilca, A. Buttari, A. Guermouche, F. Lopez.

    Exploiting a Parametrized Task Graph model for the parallelization of a sparse direct multifrontal solver, in: Euro-Par 2016: Parallel Processing Workshops, Grenoble, France, August 2016.

    https://hal.archives-ouvertes.fr/hal-01337748
  • 13E. Agullo, L. Giraud, A. Guermouche, S. Nakov, J. Roman.

    Task-based Conjugate Gradient: from multi-GPU towards heterogeneous architectures, in: HeteroPar'2016 worshop of Euro-Par, Grenoble, France, August 2016.

    https://hal.inria.fr/hal-01334734
  • 14E. Agullo, L. Giraud, S. Nakov.

    Task-based sparse hybrid linear solver for distributed memory heterogeneous architectures, in: HeteroPar'2016 worshop of Euro-Par, Grenoble, France, August 2016.

    https://hal.inria.fr/hal-01334738
  • 15T. Cojean, A. Guermouche, A. Hugo, R. Namyst, P.-A. Wacrenier.

    Resource aggregation for task-based Cholesky Factorization on top of heterogeneous machines, in: HeteroPar'2016 worshop of Euro-Par, Grenoble, France, August 2016.

    https://hal.inria.fr/hal-01181135
  • 16T. Cojean, A. Guermouche, A.-E. Hugo, R. Namyst, P.-A. Wacrenier.

    Resource aggregation in task-based applications over accelerator-based multicore machines, in: HeteroPar'2016 worshop of Euro-Par, Grenoble, France, August 2016.

    https://hal.inria.fr/hal-01355385
  • 17J. M. Couteyen Carpaye, J. Roman, P. Brenner.

    Towards an efficient Task-based Parallelization over a Runtime System of an Explicit Finite-Volume CFD Code with Adaptive Time Stepping, in: International Parallel and Distributed Processing Symposium, Chicago, IL, United States, PDSEC'2016 workshop of IPDPS, May 2016, 10 p. [ DOI : 10.1109/IPDPSW.2016.125 ]

    https://hal.inria.fr/hal-01324331
  • 18V. Garcia Pinto, L. Stanisic, A. Legrand, L. Mello Schnorr, S. Thibault, V. Danjean.

    Analyzing Dynamic Task-Based Applications on Hybrid Platforms: An Agile Scripting Approach, in: 3rd Workshop on Visual Performance Analysis (VPA), Salt Lake City, United States, November 2016, Held in conjunction with SC16.

    https://hal.inria.fr/hal-01353962
  • 19M. Predari, A. Esnard.

    A k-way Greedy Graph Partitioning with Initial Fixed Vertices for Parallel Applications, in: 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, heraklion, Greece, Parallel, Distributed, and Network-Based Processing (PDP 2016), February 2016, 8 p.

    https://hal.inria.fr/hal-01277392

Conferences without Proceedings

  • 20E. Agullo, S. Cools, L. Giraud, W. Vanroose, E. F. Yetkin.

    Soft errors in PCG: detection and correction, in: SIAM Conference on Parallel Processing for Scientific Computing (SIAM PP 2016), Paris, France, April 2016.

    https://hal.inria.fr/hal-01301240
  • 21E. Agullo, L. Giraud, S. Nakov.

    Combining Software Pipelining with Numerical Pipelining in the Conjugate Gradient Algorithm, in: SIAM Conference on Parallel Processing for Scientific Computing (SIAM PP 2016), Paris, France, April 2016.

    https://hal.inria.fr/hal-01301237
  • 22E. Agullo, L. Giraud, P. Salas, M. Zounon.

    Numerical fault tolerant strategies for resilient parallel eigensolvers, in: IMA Conference on Numerical Linear Algebra and Optimization, Birmingham, United Kingdom, September 2016.

    https://hal.inria.fr/hal-01334631
  • 23E. Agullo, M. Kuhn, S. Lanteri, L. Moya.

    High order scalable HDG method fro frequency-domain electromagnetics, in: Icosahom 2016 - International Conference on Spectral and High Order Methods, Rio de Janeiro, Brazil, June 2016.

    https://hal.inria.fr/hal-01404669
  • 24O. Beaumont, T. Cojean, L. Eyraud-Dubois, A. Guermouche, S. Kumar.

    Scheduling of Linear Algebra Kernels on Multiple Heterogeneous Resources, in: International Conference on High Performance Computing, Data, and Analytics (HiPC 2016), Hyderabad, India, December 2016.

    https://hal.inria.fr/hal-01361992
  • 25P. Blanchard, O. Coulaud, A. Etcheverry, L. Dupuy, E. Darve.

    An Efficient Interpolation Based FMM for Dislocation Dynamics Simulations: Based on uniform grids and FFT acceleration, in: Platform for Advanced Scientific Computing, Lausanne, Switzerland, USI and CSCS and EPFL, June 2016.

    https://hal.archives-ouvertes.fr/hal-01334842
  • 26L. Boillot, C. Rossignon, G. Bosilca, E. Agullo, H. Calandra, H. Barucq, J. Diaz.

    Handling clusters with a task-based runtime system: application to Geophysics, in: Rice - Oil & Gas HPC Workshop, HOUSTON, United States, March 2016.

    https://hal.inria.fr/hal-01303373
  • 27L. Boillot, C. Rossignon, G. Bosilca, E. Agullo, H. Calandra.

    Optimizing numerical simulations of elastodynamic wave propagation thanks to task-based parallel programming, in: SIAM Conference on Parallel Processing for Scientific Computing (SIAM PP 2016), Paris, France, April 2016.

    https://hal.inria.fr/hal-01303379
  • 28M. Faverge, G. Pichon, P. Ramet.

    Exploiting Kepler architecture in sparse direct solver with runtime systems, in: 9th International Workshop on Parallel Matrix Algorithms and Applications (PMAA'2016), Bordeaux, France, July 2016.

    https://hal.inria.fr/hal-01421372
  • 29Y.-F. Jing, E. Agullo, B. Carpentieri, L. Giraud, T.-Z. Huang.

    Two New Block Krylov Methods for Linear Systems with Multiple Right-hand Sides, in: IMA Conference on Numerical Linear Algebra and Optimization, Birmingham, United Kingdom, September 2016.

    https://hal.inria.fr/hal-01334648
  • 30G. Pichon, E. Darve, M. Faverge, P. Ramet, J. Roman.

    Exploiting H-Matrices in Sparse Direct Solvers, in: SIAM Conference on Parallel Processing for Scientific Computing (SIAM PP 2016), Paris, France, April 2016.

    https://hal.inria.fr/hal-01251812
  • 31G. Pichon, E. Darve, M. Faverge, P. Ramet, J. Roman.

    On the use of low rank approximations for sparse direct solvers, in: SIAM Annual Meeting (AN'16), Boston, United States, July 2016.

    https://hal.inria.fr/hal-01421376
  • 32G. Pichon, E. Darve, M. Faverge, P. Ramet, J. Roman.

    Sparse Supernodal Solver Using Hierarchical Compression, in: Workshop on Fast Direct Solvers, Purdue, United States, November 2016.

    https://hal.inria.fr/hal-01421368
  • 33G. Pichon, E. Darve, M. Faverge, P. Ramet, J. Roman.

    Sparse Supernodal Solver Using Hierarchical Compression over Runtime System, in: SIAM Conference on Computation Science and Engineering (CSE'17), Atlanta, United States, February 2017.

    https://hal.inria.fr/hal-01421379
  • 34G. Pichon, M. Faverge, P. Ramet.

    Exploiting Modern Manycore Architecture in Sparse Direct Solver with Runtime Systems, in: SIAM Conference on Computation Science and Engineering (CSE'17), Atlanta, United States, February 2017.

    https://hal.inria.fr/hal-01421383
  • 35G. Pichon, M. Faverge, P. Ramet, J. Roman.

    Impact of Blocking Strategies for Sparse Direct Solvers on Top of Generic Runtimes, in: SIAM Conference on Parallel Processing for Scientific Computing (SIAM PP 2016), Paris, France, April 2016.

    https://hal.inria.fr/hal-01251808
  • 36G. Pichon, M. Faverge, P. Ramet, J. Roman.

    Impact of Blocking Strategies for Sparse Direct Solvers on Top of Generic Runtimes, in: SIAM Conference on Computation Science and Engineering (CSE'17), Atlanta, United States, February 2017.

    https://hal.inria.fr/hal-01421384
  • 37L. Poirel, E. Agullo, L. Giraud.

    Coarse Grid Correction for Algebraic Domain Decomposition Solvers, in: ECCOMAS Congress 2016, Hersonissos, Greece, June 2016.

    https://hal.inria.fr/hal-01355534

Internal Reports

  • 38E. Agullo, O. Aumage, B. Bramas, O. Coulaud, S. Pitoiset.

    Bridging the gap between OpenMP 4.0 and native runtime systems for the fast multipole method, Inria, March 2016, no RR-8953, 49 p.

    https://hal.inria.fr/hal-01372022
  • 39E. Agullo, O. Aumage, M. Faverge, N. Furmento, F. Pruvost, M. Sergent, S. Thibault.

    Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model, Inria Bordeaux Sud-Ouest ; Bordeaux INP ; CNRS ; Université de Bordeaux ; CEA, June 2016, no RR-8927, 27 p.

    https://hal.inria.fr/hal-01332774
  • 40E. Agullo, B. Bramas, O. Coulaud, M. Khannouz, L. Stanisic.

    Task-based fast multipole method for clusters of multicore processors, Inria Bordeaux Sud-Ouest, October 2016, no RR-8970, 15 p.

    https://hal.inria.fr/hal-01387482
  • 41E. Agullo, E. Darve, L. Giraud, Y. Harness.

    Nearly optimal fast preconditioning of symmetric positive definite matrices, Inria Bordeaux Sud-Ouest, November 2016, no RR-8984, 34 p.

    https://hal.inria.fr/hal-01403480
  • 42E. Agullo, L. Giraud, A. Guermouche, S. Nakov, J. Roman.

    Task-based Conjugate Gradient: from multi-GPU towards heterogeneous architectures, Inria, May 2016, no RR-8912.

    https://hal.inria.fr/hal-01316982
  • 43E. Agullo, L. Giraud, S. Nakov.

    Task-based hybrid linear solver for distributed memory heterogeneous architectures, Inria Bordeaux Sud-Ouest, May 2016, no RR-8913.

    https://hal.inria.fr/hal-01316783
  • 44E. Agullo, L. Giraud, S. Nakov, J. Roman.

    Hierarchical hybrid sparse linear solver for multicore platforms, Inria Bordeaux, October 2016, no RR-8960, 25 p.

    https://hal.inria.fr/hal-01379227
  • 45E. Agullo, L. Giraud, L. Poirel.

    Robust coarse spaces for Abstract Schwarz preconditioners via generalized eigenproblems, Inria Bordeaux, November 2016, no RR-8978.

    https://hal.inria.fr/hal-01399203
  • 46S. Cools, E. F. Yetkin, E. Agullo, L. Giraud, W. Vanroose.

    Analysis of rounding error accumulation in Conjugate Gradients to improve the maximal attainable accuracy of pipelined CG, Inria Bordeaux Sud-Ouest, January 2016, no RR-8849.

    https://hal.inria.fr/hal-01262716
  • 47Y. Dudouit, L. Giraud, F. Millot, S. Pernet.

    Interior penalty discontinuous Galerkin method for coupled elasto-acoustic media, Inria Bordeaux Sud-Ouest, December 2016, no RR-8986.

    https://hal.inria.fr/hal-01406158
  • 48M. Faverge, J. Langou, Y. Robert, J. Dongarra.

    Bidiagonalization with Parallel Tiled Algorithms, Inria, October 2016, no RR-8969.

    https://hal.inria.fr/hal-01389232
  • 49G. Pichon, M. Faverge, P. Ramet, J. Roman.

    Reordering strategy for blocking optimization in sparse linear solvers, Inria Bordeaux Sud-Ouest ; LaBRI - Laboratoire Bordelais de Recherche en Informatique ; Bordeaux INP ; Université de Bordeaux, February 2016, no RR-8860, 26 p.

    https://hal.inria.fr/hal-01276746
  • 50D. Sukkari, H. Ltaief, M. Faverge, D. Keyes.

    Asynchronous Task-Based Polar Decomposition on Manycore Architectures, KAUST, October 2016.

    https://hal.inria.fr/hal-01387575

Other Publications

  • 51P. BLANCHARD, O. Coulaud, E. Darve, A. Franc.

    FMR: Fast randomized algorithms for covariance matrix computations, June 2016, Platform for Advanced Scientific Computing (PASC), Poster.

    https://hal.archives-ouvertes.fr/hal-01334747
  • 52O. Beaumont, L. Eyraud-Dubois, S. Kumar.

    Approximation Proofs of a Fast and Efficient List Scheduling Algorithm for Task-Based Runtime Systems on Multicores and GPUs, October 2016, working paper or preprint.

    https://hal.inria.fr/hal-01386174
  • 53T. Cojean, A. Guermouche, A. A. Hugo, R. A. Namyst, P.-A. Wacrenier.

    Resource aggregation for task-based Cholesky Factorization on top of modern architectures, November 2016, This paper is submitted for review to the Parallel Computing special issue for HCW and HeteroPar 16 workshops.

    https://hal.inria.fr/hal-01409965
  • 54M. Predari, A. Esnard.

    Graph partitioning techniques for load balancing of coupled simulations, October 2016, SIAM Workshop on Combinatorial Scientific Computing , Poster.

    https://hal.archives-ouvertes.fr/hal-01399392