More than 1000 citations for top 5 papers in Google Scholar
Selected Publications:
FTI: high performance Fault Tolerance Interface for hybrid systems, L. Bautista Gomez; D. Komatitsch, N. Maruyama; S. Tsuboi, F. Cappello, S. Matsuoka, T Nakamura, Proceedings of IEEE/ACM SC11
Modeling and Tolerating Heterogeneous Failures in Large Parallel Systems, E. M. Heien, D. Kondo, A. Gainaru, D. Lapine, B. Kramer, F. Cappello, Proceedings of IEEE/ACM SC11
BlobCR: Efficient Checkpoint-Restart for HPC Applications on IaaS Clouds using Virtual Disk Image Snapshots, B. Nicolae, F. Cappello, Proceedings of IEEE/ACM SC11
Uncoordinated Checkpointing Without Domino Effect for Send-Deterministic Message Passing ApplicationsAmina Guermouche, Thomas Ropars, Elisabeth Brunet, Marc Snir, Franck Cappello, Proceedings of IPDPS 2011
Preventive Migration vs. Preventive Checkpointing for Extreme Scale Supercomputers
Franck Cappello, Henri Casanova, Yves Robert, Parallel Processing Letters 21(2): 111-132 (2011)
On Communication Determinism in Parallel HPC Applications
Franck Cappello, Amina Guermouche, Marc Snir, Proceedings of IEEE ICCCN 2010
Toward Exascale Resilience, Franck Cappello, Al Geist, Bill Gropp, Laxmikant Kale, Bill Kramer, Marc Snir, IJHPCA 23(4): 374-388 (2009)
Fault Tolerance in Petascale/ Exascale Systems: Current Knowledge, Challenges and Research Opportunities, Franck Cappello, INRIA, IJHPCA 23(3): 212-226 (2009)
Grid'5000: a large scale, reconfigurable, controlable and monitorable Grid platform, In IEEE/ACM GRID 2005, 6th International Workshop on Grid Computing, Franck Cappello, et al. [pdf]
MPI versus MPI+OpenMP on the IBM SP for the NAS Benchmarks, ACM/IEEE SC’00 “International Conference for High Performance Computing, Networking, Storage and Analysis”, 2000, Franck Cappello, Daniel Etiemble. [pdf]
MPICH-V: Toward a Scalable Fault Tolerant MPI for Volatile Nodes, ACM/IEEE SC’02 “International Conference for High Performance Computing, Networking, Storage and Analysis”, George Bosilca, Aurelien Bouteiller, Franck Cappello, Samir Djilali, Gilles Fedak, Cecile Germain, Thomas Herault, Pierre Lemarinier, Oleg Lodygensky, Frederic Magniette, Vincent Neri, Anton Selikhov, [pdf]
Franck CAPPELLO
Co-Director of the INRIA-Illinois Joint Laboratory on PetaScale Computing
fci@lri.fr, cappello@illinois.edu
tel: +33 6 70 31 03 39
tel: +1 217 417 8557
Main research area: Resilience & Fault Tolerance at extreme scale
-Investigating determinism in HPC
-Send-determinism and its derivatives
-New hybrid fault tolerant protocols
-Multilevel&diskless checkpointing
-System log analysis
-Root cause finding and fault prediction
-Optimized checkpointing for Clouds
Documents:
-IESP RoadMap on Resilience
-Toward Exascale Resilience
-Fault tolerance for Petascale/Exascale systems
Past project:
MPICH-V Fault Tolerant MPI
Conference organization
Technical Paper co-chair SC 2011
Program Chair HiPC 2010
Program Chair IEEE NCA 2010
Program co-chair IEEE CCGRID’09
Area co-chair IEEE-ACM SC’09
Recent Invited Presentations
1.Keynote IEEE/ACM SC11/ScalA
2.Keynote IEEE IPDPS/DPDNS11
3.Keynote PDP 2011
4.Keynote Intel Exascale Leadership Conference 2011
5.Invited talk HiPC workshop on Reaching Exascale in this Decade 2010
Recent Events: