Bibliography
Publications of the year
Doctoral Dissertations and Habilitation Theses
-
1L. Marchal.
Memory and data aware scheduling, École Normale Supérieure de Lyon, March 2018, Habilitation à diriger des recherches.
https://hal.inria.fr/tel-01934712 -
2G. Moreau.
On the Solution Phase of Direct Solvers for Sparse Linear Systems with Multiple Sparse Right-Hand Sides, ENS Lyon ; Université de Lyon, December 2018.
https://hal.archives-ouvertes.fr/tel-01959367 -
3L. Pottier.
Co-scheduling for large-scale applications : memory and resilience, Université de Lyon, September 2018.
https://tel.archives-ouvertes.fr/tel-01892395 -
4B. Simon.
Scheduling task graphs on modern computing platforms, Université de Lyon, July 2018.
https://tel.archives-ouvertes.fr/tel-01843558
Articles in International Peer-Reviewed Journals
-
5P. R. Amestoy, A. Buttari, J.-Y. L'Excellent, T. Mary.
Performance and Scalability of the Block Low-Rank Multifrontal Factorization on Multicore Architectures, in: ACM Transactions on Mathematical Software, 2018.
https://hal.inria.fr/hal-01955766 -
6P. Amestoy, J.-Y. L'Excellent, G. Moreau.
On exploiting sparsity of multiple right-hand sides in sparse direct solvers, in: SIAM Journal on Scientific Computing, 2018, pp. 1-19.
https://hal.inria.fr/hal-01955659 -
7G. Aupy, A. Benoit, S. Dai, L. Pottier, P. Raghavan, Y. Robert, M. Shantharam.
Co-scheduling Amdahl applications on cache-partitioned systems, in: International Journal of High Performance Computing Applications, 2018, vol. 32, no 1, pp. 123-138.
https://hal.inria.fr/hal-01968422 -
8A. Benoit, L. Lefèvre, A.-C. Orgerie, I. Raïs.
Reducing the energy consumption of large scale computing systems through combined shutdown policies with multiple constraints, in: International Journal of High Performance Computing Applications, January 2018, vol. 32, no 1, pp. 176-188. [ DOI : 10.1177/1094342017714530 ]
https://hal.inria.fr/hal-01557025 -
9H. Casanova, J. Herrmann, Y. Robert.
Computing the expected makespan of task graphs in the presence of silent errors, in: Parallel Computing, July 2018, vol. 75, pp. 41-60.
https://hal.inria.fr/hal-01968433 -
10F. Dufossé, K. Kaya, I. Panagiotas, B. Uçar.
Further notes on Birkhoff-von Neumann decomposition of doubly stochastic matrices, in: Linear Algebra and Applications, 2018, vol. 554, pp. 68–78. [ DOI : 10.1016/j.laa.2018.05.017 ]
https://hal.inria.fr/hal-01586245 -
11L. Han, L.-C. Canon, H. Casanova, Y. Robert, F. Vivien.
Checkpointing Workflows for Fail-Stop Errors, in: IEEE Transactions on Computers, February 2018, vol. 67, no 8, 16 p. [ DOI : 10.1109/TC.2018.2801300 ]
https://hal.inria.fr/hal-01701611 -
12O. Kaya, Y. Robert.
Computing Dense Tensor Decompositions with Optimal Dimension Trees, in: Algorithmica, 2018.
https://hal.inria.fr/hal-01974471 -
13O. Kaya, B. Uçar.
Parallel Candecomp/Parafac Decomposition of Sparse Tensors Using Dimension Trees, in: SIAM Journal on Scientific Computing, 2018, vol. 40, no 1, pp. C99 - C130. [ DOI : 10.1137/16M1102744 ]
https://hal.inria.fr/hal-01397464 -
14E. Kayaaslan, C. Aykanat, B. Uçar.
1.5D Parallel Sparse Matrix-Vector Multiply, in: SIAM Journal on Scientific Computing, January 2018, vol. 40, no 1, pp. C25 - C46. [ DOI : 10.1137/16M1105591 ]
https://hal.inria.fr/hal-01897555 -
15E. Kayaaslan, T. Lambert, L. Marchal, B. Uçar.
Scheduling series-parallel task graphs to minimize peak memory, in: Theoretical Computer Science, January 2018, vol. 707, pp. 1-23. [ DOI : 10.1016/j.tcs.2017.09.037 ]
https://hal.inria.fr/hal-01891937 -
16L. Marchal, B. Simon, O. Sinnen, F. Vivien.
Malleable task-graph scheduling with a practical speed-up model, in: IEEE Transactions on Parallel and Distributed Systems, June 2018, vol. 29, no 6, pp. 1357-1370. [ DOI : 10.1109/TPDS.2018.2793886 ]
https://hal.inria.fr/hal-01687189
International Conferences with Proceedings
-
17G. Aupy, A. Benoit, B. Goglin, L. Pottier, Y. Robert.
Co-scheduling HPC workloads on cache-partitioned CMP platforms, in: IEEE Cluster 2018, Belfast, United Kingdom, Proceedings the 20th IEEE Cluster Conference, September 2018, pp. 335-345.
https://hal.inria.fr/hal-01874154 -
18G. Aupy, A. Gainaru, V. Honoré, P. Raghavan, Y. Robert, H. Sun.
Reservation Strategies for Stochastic Jobs, in: IPDPS 2019 - 33rd IEEE International Parallel and Distributed Processing Symposium, Rio de Janeiro, Brazil, May 2019.
https://hal.inria.fr/hal-01968419 -
19O. Beaumont, T. Lambert, L. Marchal, B. Thomas.
Data-Locality Aware Dynamic Schedulers for Independent Tasks with Replicated Inputs, in: IPDPSW 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, Vancouver, Canada, IEEE, May 2018, pp. 1-8. [ DOI : 10.1109/IPDPSW.2018.00187 ]
https://hal.inria.fr/hal-01878977 -
20A. Benoit, A. Cavelan, F. Ciorba, V. Le Fèvre, Y. Robert.
Combining Checkpointing and Replication for Reliable Execution of Linear Workflows, in: APDCM'18 workshop, in conjunction with IPDPS'18, Vancouver, Canada, May 2018.
https://hal.inria.fr/hal-01963655 -
21A. Benoit, S. Perarnau, L. Pottier, Y. Robert.
A performance model to execute workflows on high-bandwidth-memory architectures, in: ICPP 2018 - 47th International Conference on Parallel Processing, Eugene, OR, United States, ACM, August 2018, pp. 1-10. [ DOI : 10.1145/3225058.3225110 ]
https://hal.inria.fr/hal-01798726 -
22Y. Caniou, E. Caron, A. Kong Win Chang, Y. Robert.
Budget-aware scheduling algorithms for scientific workflows with stochastic task weights on heterogeneous IaaS Cloud platforms, in: IPDPSW 2018 - IEEE International Parallel and Distributed Processing Symposium Workshops, Vancouver, Canada, IEEE, May 2018, pp. 15-26. [ DOI : 10.1109/IPDPSW.2018.00014 ]
https://hal.inria.fr/hal-01808831 -
23L.-C. Canon, A. Kong Win Chang, Y. Robert, F. Vivien.
Scheduling independent stochastic tasks under deadline and budget constraints, in: SBAC-PAD 2018 - 30th International Symposium on Computer Architecture and High Performance Computing, Lyon, France, September 2018, pp. 1-8.
https://hal.inria.fr/hal-01868727 -
24L.-C. Canon, L. Marchal, B. Simon, F. Vivien.
Online Scheduling of Task Graphs on Hybrid Platforms, in: Euro-Par 2018 - 24th International European Conference On Parallel And Distributed Computing, Turin, Italy, August 2018, pp. 1-14.
https://hal.inria.fr/hal-01828301 -
25F. Dufossé, K. Kaya, I. Panagiotas, B. Uçar.
Approximation algorithms for maximum matchings in undirected graphs, in: CSC 2018 - SIAM Workshop on Combinatorial Scientific Computing, Bergen, Norway, Proceedings of the Seventh SIAM Workshop on Combinatorial Scientific Computing, SIAM, June 2018, pp. 56-65. [ DOI : 10.1137/1.9781611975215.6 ]
https://hal.archives-ouvertes.fr/hal-01740403 -
26C. Gou, A. Benoit, M. Chen, L. Marchal, T. Wei.
Reliability-aware energy optimization for throughput-constrained applications on MPSoC, in: ICPADS - 24th International Conference on Parallel and Distributed Systems, Sentosa, Singapore, IEEE, December 2018, pp. 1-10.
https://hal.inria.fr/hal-01929927 -
27C. Gou, A. Benoit, L. Marchal.
Memory-aware tree partitioning on homogeneous platforms, in: PDP 2018 - 26th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, Cambridge, United Kingdom, March 2018, pp. 321-324. [ DOI : 10.1109/PDP2018.2018.00056 ]
https://hal.inria.fr/hal-01892022 -
28L. Han, V. Le Fèvre, L.-C. Canon, Y. Robert, F. Vivien.
A Generic Approach to Scheduling and Checkpointing Workflows, in: ICPP 2018 - 47th International Conference on Parallel Processing, Eugene, OR, United States, ACM, August 2018, pp. 1-10. [ DOI : 10.1145/3225058.3225145 ]
https://hal.inria.fr/hal-01798627 -
30V. Le Fèvre, G. Bosilca, A. Bouteiller, T. Hérault, A. Hori, Y. Robert, J. Dongarra.
Do Moldable Applications Perform Better on Failure-Prone HPC Platforms?, in: Resilience - EuroPar workshop, Torino, Italy, 2018, pp. 787-799.
https://hal.inria.fr/hal-01968448 -
31L. Marchal, H. Nagy, B. Simon, F. Vivien.
Parallel scheduling of DAGs under memory constraints, in: IPDPS 2018 - 32nd IEEE International Parallel and Distributed Processing Symposium, Vancouver, Canada, IEEE, May 2018, pp. 1-10. [ DOI : 10.1109/IPDPS.2018.00030 ]
https://hal.inria.fr/hal-01828312 -
32I. Raïs, M. Boutigny, L. Lefèvre, A.-C. Orgerie, A. Benoit.
Building the Table of Energy and Power Leverages for Energy Efficient Large Scale Systems, in: HPCS: International Conference on High Performance Computing & Simulation, Orléans, France, July 2018, pp. 284-291. [ DOI : 10.1109/HPCS.2018.00056 ]
https://hal.archives-ouvertes.fr/hal-01845970 -
33I. Raïs, L. Lefèvre, A.-C. Orgerie, A. Benoit.
Exploiting the Table of Energy and Power Leverages, in: ICA3PP 2018 - 18th International Conference on Algorithms and Architectures for Parallel Processing, Guangzhou, China, November 2018, pp. 1-10.
https://hal.archives-ouvertes.fr/hal-01927829 -
34A. Yasar, B. Uçar, U. V. Catalyurek.
SINA: A Scalable Iterative Network Aligner, in: 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), Barcelona, Spain, August 2018.
https://hal.inria.fr/hal-01918744
Scientific Books (or Scientific Book chapters)
-
35G. Aupy, Y. Robert.
Scheduling for Fault-Tolerance: An Introduction, in: Topic in parallel and distributed computing: Enhancing the Undergraduate Curriculum: Performance, Concurrency, and Programming on Modern Platforms, Springer International Publishing, September 2018, pp. 143-170.
https://hal.inria.fr/hal-01968454
Internal Reports
-
36P. Amestoy, A. Buttari, J.-Y. L'Excellent, T. Mary.
Bridging the gap between flat and hierarchical low-rank matrix formats: the multilevel BLR format, University of Manchester, April 2018.
https://hal.archives-ouvertes.fr/hal-01774642 -
37P. R. Amestoy, S. de la Kethulle De Ryhove, J.-Y. L'Excellent, G. Moreau, D. V. Shantsev.
Efficient use of sparsity by direct solvers applied to 3D controlled-source EM problems, Inria Grenoble Rhône-Alpes ; LIP - ENS Lyon, November 2018, no RR-9220, 26 p.
https://hal.inria.fr/hal-01912713 -
38G. Aupy, A. Benoit, B. Goglin, L. Pottier, Y. Robert.
Co-scheduling HPC workloads on cache-partitioned CMP platforms, Inria, February 2018, no RR-9154.
https://hal.inria.fr/hal-01719728 -
39G. Aupy, A. Gainaru, V. Honoré, P. Raghavan, Y. Robert, H. Sun.
Reservation Strategies for Stochastic Jobs (Extended Version), Inria & Labri, Univ. Bordeaux ; Department of EECS, Vanderbilt University, Nashville, TN, USA ; Laboratoire LIP, ENS Lyon & University of Tennessee Knoxville, Lyon, France, October 2018, no RR-9211, pp. 1-38.
https://hal.inria.fr/hal-01903592 -
40A. Benoit, A. Cavelan, F. Ciorba, V. Le Fèvre, Y. Robert.
Combining Checkpointing and Replication for Reliable Execution of Linear Workflows with Fail-Stop and Silent Errors, ROMA (Inria Rhône-Alpes / LIP Laboratoire de l’Informatique du Parallélisme) ; LIP - Laboratoire de l’Informatique du Parallélisme, December 2018, pp. 1-32.
https://hal.inria.fr/hal-01955859 -
41A. Benoit, A. Cavelan, F. Ciorba, V. Le Fèvre, Y. Robert.
Combining Checkpointing and Replication for Reliable Execution of Linear Workflows, Inria - Research Centre Grenoble – Rhône-Alpes, February 2018, no RR-9152, pp. 1-36.
https://hal.inria.fr/hal-01714978 -
42A. Benoit, S. Perarnau, L. Pottier, Y. Robert.
A performance model to execute workflows on high-bandwidth memory architectures, ENS Lyon ; Inria Grenoble Rhône-Alpes ; University of Tennessee Knoxville ; Georgia Institute of Technology ; Argonne National Laboratory, April 2018, no RR-9165, pp. 1-28.
https://hal.inria.fr/hal-01767888 -
43G. Bosilca, A. Bouteiller, T. Hérault, V. Le Fèvre, Y. Robert, J. J. Dongarra.
Distributed Termination Detection for HPC Task-Based Environments, Inria - Research Centre Grenoble – Rhône-Alpes, June 2018, no RR-9181, pp. 1-28.
https://hal.inria.fr/hal-01811823 -
44L.-C. Canon, A. Kong Win Chang, Y. Robert, F. Vivien.
Scheduling independent stochastic tasks deadline and budget constraints, Inria - Research Centre Grenoble – Rhône-Alpes, June 2018, no RR-9178, pp. 1-34.
https://hal.inria.fr/hal-01811885 -
45L.-C. Canon, L. Marchal, B. Simon, F. Vivien.
Online Scheduling of Sequential Task Graphs on Hybrid Platforms, LIP - ENS Lyon, February 2018, no RR-9150.
https://hal.inria.fr/hal-01720064 -
46F. Dufossé, K. Kaya, I. Panagiotas, B. Uçar.
Effective heuristics for matchings in hypergraphs, Inria Grenoble Rhône-Alpes, November 2018, no RR-9224, pp. 1-18.
https://hal.archives-ouvertes.fr/hal-01924180 -
47F. Dufossé, K. Kaya, I. Panagiotas, B. Uçar.
Scaling matrices and counting the perfect matchings in graphs, Inria Grenoble Rhône-Alpes, March 2018, no RR-9161, pp. 1-22.
https://hal.inria.fr/hal-01743802 -
48C. Gou, A. Benoit, M. Chen, L. Marchal, T. Wei.
Reliability-aware energy optimization for throughput-constrained applications on MPSoC, Laboratoire LIP, École Normale Supérieure de Lyon & CNRS & Inria, France ; Shanghai Key Lab. of Trustworthy Computing, East China Normal University, China ; Georgia Institute of Technology, USA, April 2018, no RR-9168, pp. 1-35.
https://hal.inria.fr/hal-01766763 -
49L. Han, V. Le Fèvre, L.-C. Canon, Y. Robert, F. Vivien.
A Generic Approach to Scheduling and Checkpointing Workflows, Inria, April 2018, no RR-9167, pp. 1-29.
https://hal.inria.fr/hal-01766352 -
50J. Herrmann, M. Yusuf Özkaya, B. Uçar, K. Kaya, U. V. Catalyurek.
Acyclic partitioning of large directed acyclic graphs, Inria - Research Centre Grenoble – Rhône-Alpes, March 2018, no RR-9163.
https://hal.inria.fr/hal-01744603 -
51V. Le Fèvre, G. Bosilca, A. Bouteiller, T. Hérault, A. Hori, Y. Robert, J. J. Dongarra.
Do moldable applications perform better on failure-prone HPC platforms?, Inria Grenoble Rhône-Alpes, May 2018, no RR-9174, pp. 1-24.
https://hal.inria.fr/hal-01799498 -
52L. Marchal, B. Simon, F. Vivien.
Limiting the memory footprint when dynamically scheduling DAGs on shared-memory platforms, Inria Grenoble Rhône-Alpes, December 2018, no RR-9231, pp. 1-41.
https://hal.inria.fr/hal-01948462 -
53M. Y. Özkaya, A. Benoit, B. Uçar, J. Herrmann, U. V. Catalyurek.
A scalable clustering-based task scheduler for homogeneous processors using DAG partitioning, Inria Grenoble Rhône-Alpes, June 2018, no RR-9185, pp. 1-30.
https://hal.inria.fr/hal-01817501
- 54Blue Waters Newsletter, dec 2012.
-
55Blue Waters Resources, 2013.
https://bluewaters.ncsa.illinois.edu/data -
56The BOINC project, 2013.
http://boinc.berkeley.edu/ -
57Final report of the Department of Energy Fault Management Workshop, December 2012.
https://science.energy.gov/~/media/ascr/pdf/program-documents/docs/FaultManagement-wrkshpRpt-v4-final.pdf -
58System Resilience at Extreme Scale: white paper, 2008, DARPA.
https://pdfs.semanticscholar.org/9fcb/154d6afce23cd9951fd7c116b86255d91b5c.pdf -
59Top500 List - November, 2011.
http://www.top500.org/list/2011/11/ -
60Top500 List - November, 2012.
http://www.top500.org/list/2012/11/ -
61M. Amaris, G. Lucarelli, C. Mommessin, D. Trystram.
Generic Algorithms for Scheduling Applications on Hybrid Multi-core Machines, in: Euro-Par 2017: Parallel Processing, 2017, pp. 220–231. -
62I. Assayad, A. Girault, H. Kalla.
Tradeoff exploration between reliability power consumption and execution time, in: Proceedings of SAFECOMP, the Conf. on Computer Safety, Reliability and Security, Washington, DC, USA, 2011. -
63H. Aydin, Q. Yang.
Energy-aware partitioning for multiprocessor real-time systems, in: IPDPS'03, the IEEE Int. Parallel and Distributed Processing Symposium, 2003, pp. 113–121. -
64N. Bansal, T. Kimbrel, K. Pruhs.
Speed Scaling to Manage Energy and Temperature, in: Journal of the ACM, 2007, vol. 54, no 1, pp. 1 – 39.
http://doi.acm.org/10.1145/1206035.1206038 -
65A. Benoit, L. Marchal, J.-F. Pineau, Y. Robert, F. Vivien.
Scheduling concurrent bag-of-tasks applications on heterogeneous platforms, in: IEEE Transactions on Computers, 2010, vol. 59, no 2, pp. 202-217. -
66S. Blackford, J. Choi, A. Cleary, E. D'Azevedo, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker, R. C. Whaley.
ScaLAPACK Users' Guide, SIAM, 1997. -
67S. Blackford, J. Dongarra.
Installation Guide for LAPACK, LAPACK Working Note, June 1999, no 41, originally released March 1992. -
68R. A. Brualdi.
Notes on the Birkhoff algorithm for doubly stochastic matrices, in: Canadian Mathematical Bulletin, 1982, vol. 25, no 2, pp. 191–199. -
69A. Buttari, J. Langou, J. Kurzak, J. Dongarra.
Parallel tiled QR factorization for multicore architectures, in: Concurrency: Practice and Experience, 2008, vol. 20, no 13, pp. 1573-1590. -
70J.-J. Chen, T.-W. Kuo.
Multiprocessor energy-efficient scheduling for real-time tasks, in: ICPP'05, the Int. Conference on Parallel Processing, 2005, pp. 13–20. -
71S. Donfack, L. Grigori, W. Gropp, L. V. Kale.
Hybrid Static/dynamic Scheduling for Already Optimized Dense Matrix Factorization, in: Parallel Distributed Processing Symposium (IPDPS), 2012 IEEE 26th International, 2012, pp. 496-507.
http://dx.doi.org/10.1109/IPDPS.2012.53 -
72J. Dongarra, J.-F. Pineau, Y. Robert, Z. Shi, F. Vivien.
Revisiting Matrix Product on Master-Worker Platforms, in: International Journal of Foundations of Computer Science, 2008, vol. 19, no 6, pp. 1317-1336. -
73J. Dongarra, J.-F. Pineau, Y. Robert, F. Vivien.
Matrix Product on Heterogeneous Master-Worker Platforms, in: 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Salt Lake City, Utah, February 2008, pp. 53–62. -
74I. S. Duff, J. K. Reid.
The multifrontal solution of indefinite sparse symmetric linear systems, in: "ACM Transactions on Mathematical Software", 1983, vol. 9, pp. 302-325. -
75I. S. Duff, J. K. Reid.
The multifrontal solution of unsymmetric sets of linear systems, in: SIAM Journal on Scientific and Statistical Computing, 1984, vol. 5, pp. 633-641. -
76L. Grigori, J. W. Demmel, H. Xiang.
Communication avoiding Gaussian elimination, in: Proceedings of the 2008 ACM/IEEE conference on Supercomputing, Piscataway, NJ, USA, SC '08, IEEE Press, 2008, 29:1 p.
http://dl.acm.org/citation.cfm?id=1413370.1413400 -
77B. Hadri, H. Ltaief, E. Agullo, J. Dongarra.
Tile QR Factorization with Parallel Panel Processing for Multicore Architectures, in: IPDPS'10, the 24st IEEE Int. Parallel and Distributed Processing Symposium, 2010. -
78J. W. H. Liu.
An application of generalized tree pebbling to sparse matrix factorization, in: SIAM Journal on Algebraic and Discrete Methods, 1987, vol. 8, no 3, pp. 375–395. -
79J. W. H. Liu.
The multifrontal method for sparse matrix solution: Theory and Practice, in: SIAM Review, 1992, vol. 34, pp. 82–109. -
80R. Melhem, D. Mossé, E. Elnozahy.
The Interplay of Power Management and Fault Recovery in Real-Time Systems, in: IEEE Transactions on Computers, 2004, vol. 53, no 2, pp. 217-231. -
81A. J. Oliner, R. K. Sahoo, J. E. Moreira, M. Gupta, A. Sivasubramaniam.
Fault-aware job scheduling for bluegene/l systems, in: IPDPS'04, the IEEE Int. Parallel and Distributed Processing Symposium, 2004, pp. 64–73. -
82G. Quintana-Ortí, E. Quintana-Ortí, R. A. van de Geijn, F. G. V. Zee, E. Chan.
Programming Matrix Algorithms-by-Blocks for Thread-Level Parallelism, in: ACM Transactions on Mathematical Software, 2009, vol. 36, no 3. -
83Y. Robert, F. Vivien.
Algorithmic Issues in Grid Computing, in: Algorithms and Theory of Computation Handbook, Chapman and Hall/CRC Press, 2009. -
84G. Zheng, X. Ni, L. V. Kale.
A scalable double in-memory checkpoint and restart scheme towards exascale, in: Dependable Systems and Networks Workshops (DSN-W), 2012.
http://dx.doi.org/10.1109/DSNW.2012.6264677 -
85D. Zhu, R. Melhem, D. Mossé.
The effects of energy management on reliability in real-time embedded systems, in: Proc. of IEEE/ACM Int. Conf. on Computer-Aided Design (ICCAD), 2004, pp. 35–40.