[1]
|
Julie Langou, Julien Langou, Piotr Luszczek, Jakub Kurzak, Alfredo Buttari, and
Jack Dongarra.
Exploiting the performance of 32 bit floating point arithmetic in
obtaining 64 bit accuracy (revisiting iterative refinement for linear
systems).
In SC '06: Proceedings of the 2006 ACM/IEEE conference on
Supercomputing, page 113, New York, NY, USA, 2006. ACM.
[ bib |
DOI |
.pdf ]
|
[2]
|
Alfredo Buttari, Jack Dongarra, Jakub Kurzak, Piotr Luszczek, and Stanimire
Tomov.
Using mixed precision for sparse matrix computations to enhance the
performance while achieving 64-bit accuracy.
ACM Transactions on Mathematical Software, 34(4), 2008.
[ bib |
.pdf ]
|
[3]
|
Gino Bella, Francesco del Citto, Salvatore Filippone, Alfredo Buttari, and
Alessandro de Maio.
FAST-EVP: Parallel high performance computing in engine
applications.
International Journal of Computational Science and Engineering
(IJCSE), 2006.
To appear.
[ bib |
.pdf ]
|
[4]
|
J. Demmel et al.
Handbook of Parallel Computing: Models, Algorithms and
Applications, volume 17 of Chapman & HallCRC Computer & Information
Science, chapter Prospectus for a Linear Algebra Software Library for Dense
Matrix Problems.
CRC Press, 1 edition, December 2007.
[ bib |
.pdf ]
|
[5]
|
Alfredo Buttari.
Software Tools for Sparse Linear Algebra Computations.
PhD thesis, University of Rome Tor Vergata, 2006.
[ bib |
.pdf ]
|
[6]
|
A. Buttari, P. D'Ambra, D. di Serafino, and S. Filippone.
Extending PSBLAS to Build Parallel Schwarz
Preconditioners.
In Springer, editor, Applied Parallel Computing. State of the
Art in Scientific Computing: 7th International Conference, PARA 2004, Lyngby,
Denmark, June 20-23, 2004., volume 3732 of Lecture Notes in Computer
Science, pages 593-602, February 2006.
[ bib |
DOI |
.pdf ]
|
[7]
|
Alfredo Buttari, Pasqua D'Ambra, Daniela di Serafino, and Salvatore Filippone.
2LEV-D2P4: a package of high-performance preconditioners for
scientific and engineering applications.
Appl. Algebra Eng., Commun. Comput., 18(3):223-239, 2007.
[ bib |
DOI |
.pdf ]
|
[8]
|
G. Bella, A. Buttari, A. De Maio, F. Del Citto, S. Filippone, and F. Gasperini.
FAST-EVP: an engine simulation tool.
In Springer, editor, High Perfromance Computing and
Communications. First International Conference, HPCC 2005, Proceedings,
volume 3726 of Lecture Notes in Computer Science, pages 976-986,
September 2005.
[ bib |
DOI |
.pdf ]
|
[9]
|
Alfredo Buttari, Piotr Luszczek, Jakub Kurzak, Jack Dongarra, and George
Bosilca.
SCOP3: A rough guide to scientific computing on the PlayStation
3. version 0.1.
Technical Report UT-CS-07-595, Innovative Computing Laboratory,
University of Tennessee Knoxville, April 2007.
[ bib |
.pdf ]
|
[10]
|
Alfredo Buttari, Jack Dongarra, Jakub Kurzak, Julien Langou, Piotr Luszczek,
and Stanimire Tomov.
The impact of multicore on math software.
In PARA, pages 1-10, 2006.
[ bib |
DOI |
.pdf ]
|
[11]
|
High Performance Computing and Grids in Action, chapter Exploiting Mixed
Precision Floating Point Hardware in Scientific Computations.
2007.
[ bib |
.pdf ]
|
[12]
|
James W. Demmel, Jack Dongarra, Beresford Parlett, W. Kahan, Ming Gu, David
Bindel, Yozo Hida, Xiaoye S. Li, Osni A. Marques, E. Jason Riedy, Christof
Vömel, Julien Langou, Piotr Luszczek, Jakub Kurzak, Alfredo Buttari, Julie
Langou, and Stanimire Tomov.
Prospectus for the next lapack and scalapack libraries.
In PARA'06: State-of-the-Art in Scientific and Parallel
Computing, Umeå, Sweden, June 2006. High Performance Computing Center
North (HPC2N) and the Department of Computing Science, UmeåUniversity,
Springer.
[ bib |
DOI |
.pdf ]
|
[13]
|
Jakub Kurzak, Alfredo Buttari, and Jack Dongarra.
Solving Systems of Linear Equations on the CELL Processor
Using Cholesky Factorization.
IEEE Transactions on Parallel and Distributed Systems, 2007.
To appear.
[ bib |
DOI |
.pdf ]
|
[14]
|
Alfredo Buttari, Jakub Kurzak, and Jack Dongarra.
Limitations of the PlayStation 3 for High Performance
Cluster Computing.
Technical Report UT-CS-07-597, Innovative Computing Laboratory,
University of Tennessee Knoxville, April 2007.
LAPACK Working Note 185.
[ bib |
.pdf ]
|
[15]
|
Alfredo Buttari, Julien Langou, Jakub Kurzak, and Jack Dongarra.
Parallel Tiled QR Factorization for Multicore
Architectures.
Concurrency and Computation: Practice and Experience, 2007.
To appear. LAPACK Working Note 190.
[ bib |
.pdf ]
|
[16]
|
Alfredo Buttari, Julien Langou, Jakub Kurzak, and Jack Dongarra.
A class of parallel tiled linear algebra algorithms for multicore
architectures.
Technical Report UT-CS-07-600, Innovative Computing Laboratory,
University of Tennessee Knoxville, September 2007.
Submitted to Parallel Computing journal. LAPACK
Working Note 191.
[ bib |
.pdf ]
|
[17]
|
Alfredo Buttari, Jack Dongarra, Julie Langou, Julien Langou, Piotr Luszczek,
and Jakub Kurzak.
Mixed precision iterative refinement techniques for the solution of
dense linear systems.
Int. J. High Perform. Comput. Appl., 21(4):457-466, 2007.
[ bib |
DOI |
.pdf ]
|
[18]
|
Alfredo Buttari, Victor Eijkhout, Julien Langou, and Salvatore Filippone.
Performance optimization and modeling of blocked sparse kernels.
Int. J. High Perform. Comput. Appl., 21(4):467-484, 2007.
[ bib |
DOI |
.pdf ]
|
[19]
|
Alfredo Buttari, Jack Dongarra, Parry Husbands, Jakub Kurzak, and Katherine
Yelick.
Multithreading for synchronization tolerance in matrix factorization.
In Proceedings of the SciDAC 2007 Conference, Boston,
Massachusetts, 2007. Journal of Physics: Conference Series.
[ bib |
DOI |
.pdf ]
|
[20]
|
A. Buttari and S. Filippone.
PSBLAS-2.0 User's Manual.
University of Rome Tor Vergata, 2005.
[ bib |
.pdf ]
|
[21]
|
Jaku Kurzak, Alfredo Buttari, Piotr Luszczek, and Jack Dongarra.
The playstation 3 for high performance scientific computing.
to appear in Computing in Science and Engineering.
[ bib |
.pdf ]
|