2009 | ||
---|---|---|
73 | Marghoob Mohiyuddin, Mark Hoemmen, James Demmel, Katherine A. Yelick: Minimizing communication in sparse matrix solvers. SC 2009 | |
72 | Grey Ballard, James Demmel, Olga Holtz, Oded Schwartz: Communication-optimal parallel and sequential Cholesky decomposition: extended abstract. SPAA 2009: 245-252 | |
71 | James Demmel, Yozo Hida, E. Jason Riedy, Xiaoye S. Li: Extra-Precise Iterative Refinement for Overdetermined Least Squares Problems. ACM Trans. Math. Softw. 35(4): (2009) | |
70 | Grey Ballard, James Demmel, Olga Holtz, Oded Schwartz: Communication-optimal Parallel and Sequential Cholesky Decomposition CoRR abs/0902.2537: (2009) | |
69 | Grey Ballard, James Demmel, Olga Holtz, Oded Schwartz: Minimizing Communication in Linear Algebra CoRR abs/0905.2485: (2009) | |
68 | Krste Asanovic, Rastislav Bodík, James Demmel, Tony Keaveny, Kurt Keutzer, John Kubiatowicz, Nelson Morgan, David A. Patterson, Koushik Sen, John Wawrzynek, David Wessel, Katherine A. Yelick: A view of the parallel computing landscape. Commun. ACM 52(10): 56-67 (2009) | |
67 | Samuel Williams, Leonid Oliker, Richard W. Vuduc, John Shalf, Katherine A. Yelick, James Demmel: Optimization of sparse matrix-vector multiplication on emerging multicore platforms. Parallel Computing 35(3): 178-194 (2009) | |
66 | James Demmel, Mark Hoemmen, Yozo Hida, E. Jason Riedy: Nonnegative Diagonals and High Performance on Low-Profile Matrices from Householder QR. SIAM J. Scientific Computing 31(4): 2832-2841 (2009) | |
2008 | ||
65 | James Demmel, Mark Hoemmen, Marghoob Mohiyuddin, Katherine A. Yelick: Avoiding communication in sparse matrix computations. IPDPS 2008: 1-12 | |
64 | Laura Grigori, James Demmel, Hua Xiang: Communication avoiding Gaussian elimination. SC 2008: 29 | |
63 | Vasily Volkov, James Demmel: Benchmarking GPUs to tune dense linear algebra. SC 2008: 31 | |
62 | Gary W. Howell, James Demmel, Charles T. Fulton, Sven Hammarling, Karen Marmol: Cache efficient bidiagonalization using BLAS 2.5 operators. ACM Trans. Math. Softw. 34(3): (2008) | |
61 | Osni Marques, Christof Vömel, James Demmel, Beresford N. Parlett: Algorithm 880: A testing infrastructure for symmetric tridiagonal eigensolvers. ACM Trans. Math. Softw. 35(1): (2008) | |
60 | James Demmel, Laura Grigori, Mark Hoemmen, Julien Langou: Communication-avoiding parallel and sequential QR factorizations CoRR abs/0806.2159: (2008) | |
59 | Jiawang Nie, James Demmel, Ming Gu: Global minimization of rational functions and the nearest GCDs. J. Global Optimization 40(4): 697-718 (2008) | |
58 | David Bindel, James Demmel, Mark J. Friedman: Continuation of Invariant Subspaces in Large Bifurcation Problems. SIAM J. Scientific Computing 30(2): 637-656 (2008) | |
57 | James Demmel, Osni Marques, Beresford N. Parlett, Christof Vömel: Performance and Accuracy of LAPACK's Symmetric Tridiagonal Eigensolvers. SIAM J. Scientific Computing 30(3): 1508-1526 (2008) | |
2007 | ||
56 | Sukun Kim, Shamim Pakzad, David E. Culler, James Demmel, Gregory Fenves, Steve Glaser, Martin Turon: Health monitoring of civil infrastructures using wireless sensor networks. IPSN 2007: 254-263 | |
55 | Samuel Williams, Leonid Oliker, Richard W. Vuduc, John Shalf, Katherine A. Yelick, James Demmel: Optimization of sparse matrix-vector multiplication on emerging multicore platforms. SC 2007: 38 | |
54 | Rajesh Nishtala, Richard W. Vuduc, James Demmel, Katherine A. Yelick: When cache blocking of sparse matrix vector multiply works and why. Appl. Algebra Eng. Commun. Comput. 18(3): 297-311 (2007) | |
53 | Rajesh Nishtala, Richard W. Vuduc, James Demmel, Katherine A. Yelick: When cache blocking of sparse matrix vector multiply works and why. Appl. Algebra Eng. Commun. Comput. 18(3): 297-311 (2007) | |
52 | James Demmel, Ioana Dumitriu, Olga Holtz, Plamen Koev: Accurate and Efficient Expression Evaluation and Linear Algebra CoRR abs/0712.4027: (2007) | |
51 | Laura Grigori, James Demmel, Xiaoye S. Li: Parallel Symbolic Factorization for Sparse LU with Static Pivoting. SIAM J. Scientific Computing 29(3): 1289-1314 (2007) | |
2006 | ||
50 | James Demmel, Jack Dongarra, Beresford N. Parlett, William Kahan, Ming Gu, David Bindel, Yozo Hida, Xiaoye S. Li, Osni Marques, E. Jason Riedy, Christof Vömel, Julien Langou, Piotr Luszczek, Jakub Kurzak, Alfredo Buttari, Julie Langou, Stanimire Tomov: Prospectus for the Next LAPACK and ScaLAPACK Libraries. PARA 2006: 11-23 | |
49 | Takahiro Katagiri, Christof Vömel, James Demmel: Automatic Performance Tuning for the Multi-section with Multiple Eigenvalues Method for Symmetric Tridiagonal Eigenproblems. PARA 2006: 938-948 | |
48 | Sukun Kim, Shamim Pakzad, David E. Culler, James Demmel, Gregory Fenves, Steve Glaser, Martin Turon: Wireless sensor networks for structural health monitoring. SenSys 2006: 427-428 | |
47 | James Demmel, Yozo Hida, William Kahan, Xiaoye S. Li, Sonil Mukherjee, E. Jason Riedy: Error bounds from extra-precise iterative refinement. ACM Trans. Math. Softw. 32(2): 325-351 (2006) | |
46 | James Demmel, Ioana Dumitriu, Olga Holtz, Robert Kleinberg: Fast matrix multiplication is stable CoRR abs/math/0603207: (2006) | |
45 | James Demmel, Ioana Dumitriu, Olga Holtz: Fast linear algebra is stable CoRR abs/math/0612264: (2006) | |
44 | Jiawang Nie, James Demmel, Bernd Sturmfels: Minimizing Polynomials via Sum of Squares over the Gradient Ideal. Math. Program. 106(3): 587-606 (2006) | |
2005 | ||
43 | James Demmel, Ioana Dumitriu, Olga Holtz: Toward accurate polynomial evaluation in rounded arithmetic (short report). Algebraic and Numerical Algorithms and Computer-assisted Proofs 2005 | |
42 | David Bindel, James Demmel, Mark J. Friedman, Willy Govaerts, Yuri A. Kuznetsov: Bifurcation Analysis of Large Equilibrium Systems in Matlab. International Conference on Computational Science (1) 2005: 50-57 | |
41 | James Demmel, Ioana Dumitriu, Olga Holtz: Toward accurate polynomial evaluation in rounded arithmetic CoRR abs/math/0508350: (2005) | |
2004 | ||
40 | Benjamin C. Lee, Richard W. Vuduc, James Demmel, Katherine A. Yelick: Performance Models for Evaluation and Automatic Tuning of Symmetric Sparse Matrix-Vector Multiply. ICPP 2004: 169-176 | |
39 | David Bindel, Zhaojun Bai, James Demmel: Model Reduction for RF MEMS Simulation. PARA 2004: 286-295 | |
38 | Eun-Jin Im, Ismail Bustany, Cleve Ashcraft, James Demmel, Katherine A. Yelick: Performance Tuning of Matrix Triple Products Based on Matrix Structure. PARA 2004: 740-746 | |
37 | Richard W. Vuduc, James Demmel, Jeff A. Bilmes: Statistical Models for Empirical Search-Based Performance Tuning. IJHPCA 18(1): 65-94 (2004) | |
2003 | ||
36 | Rich Vuduc, Attila Gyulassy, James Demmel, Katherine A. Yelick: Memory Hierarchy Optimizations and Performance ounds for Sparse A. International Conference on Computational Science 2003: 705-714 | |
35 | Eiji Mizutani, James Demmel: Iterative Scaled Trust-Region Learning in Krylov Subspaces via Pearlmutter's Implicit Sparse Hessian-Vector Multiply. NIPS 2003 | |
34 | Xiaoye S. Li, James Demmel: SuperLU_DIST: A scalable distributed-memory sparse direct solver for unsymmetric linear systems. ACM Trans. Math. Softw. 29(2): 110-140 (2003) | |
33 | Eiji Mizutani, James Demmel: On structure-exploiting trust-region regularized nonlinear least squares algorithms for neural-network learning. Neural Networks 16(5-6): 745-753 (2003) | |
2002 | ||
32 | Rich Vuduc, James Demmel, Katherine A. Yelick, Shoaib Kamil, Rajesh Nishtala, Benjamin C. Lee: Performance optimizations and bounds for sparse matrix-vector multiply. SC 2002: 1-35 | |
31 | Xiaoye S. Li, James Demmel, David H. Bailey, Greg Henry, Yozo Hida, Jimmy Iskandar, William Kahan, Suh Y. Kang, Anil Kapur, Michael C. Martin, Brandon Thompson, Teresa Tung, Daniel J. Yoo: Design, implementation and testing of extended and mixed precision BLAS. ACM Trans. Math. Softw. 28(2): 152-205 (2002) | |
30 | David Bindel, James Demmel, William Kahan, Osni Marques: On computing givens rotations reliably and efficiently. ACM Trans. Math. Softw. 28(2): 206-238 (2002) | |
2001 | ||
29 | Rich Vuduc, James Demmel, Jeff Bilmes: Statistical Models for Automatic Performance Tuning. International Conference on Computational Science (1) 2001: 117-126 | |
28 | L. Anthony Drummond, James Demmel, Carlos R. Mechoso, H. Robinson, Keith Sklower, Joseph A. Spahr: A Data Broker for Distributed Computing Environments. International Conference on Computational Science (1) 2001: 31-40 | |
27 | James Demmel, B. Diament, Gregorio Malajovich: On the Complexity of Computing Error Bounds. Foundations of Computational Mathematics 1(1): 101-125 (2001) | |
2000 | ||
26 | Eiji Mizutani, James Demmel: On Iterative Krylov-Dogleg Trust-Region Steps for Solving Neural Networks Nonlinear Least Squares Problems. NIPS 2000: 605-611 | |
25 | Rich Vuduc, James Demmel: Code Generators for Automatic Tuning of Numerical Kernels: Experiences with FFTW. SAIG 2000: 190-211 | |
1999 | ||
24 | Xiaoye S. Li, James Demmel: A Scalable Sparse Direct Solver Using Static Pivoting. PPSC 1999 | |
23 | James Demmel: Making Sparse Matrix Computations Scalable (Invited Talk Abstract). SPAA 1999: 43 | |
1998 | ||
22 | Joel H. Saltz, Alan Sussman, Susan L. Graham, James Demmel, Scott B. Baden, Jack Dongarra: Programming Tools and Environments. Commun. ACM 41(11): 64-73 (1998) | |
1997 | ||
21 | Jeff Bilmes, Krste Asanovic, Chee-Whye Chin, James Demmel: Optimizing Matrix Multiply Using PHiPAC: A Portable, High-Performance, ANSI C Coding Methodology. International Conference on Supercomputing 1997: 340-347 | |
20 | L. Susan Blackford, Jaeyoung Choi, Andrew J. Cleary, Eduardo F. D'Azevedo, James Demmel, Inderjit S. Dhillon, Jack Dongarra, Sven Hammarling, Greg Henry, Antoine Petitet, Ken Stanley, David W. Walker, R. Clinton Whaley: ScaLAPACK: A Linear Algebra Library for Message-Passing Computers. PPSC 1997 | |
19 | L. Susan Blackford, Andrew J. Cleary, Antoine Petitet, R. Clinton Whaley, James Demmel, Inderjit S. Dhillon, H. Ren, Ken Stanley, Jack Dongarra, Sven Hammarling: Practical Experience in the Numerical Dangers of Heterogeneous Computing. ACM Trans. Math. Softw. 23(2): 133-147 (1997) | |
18 | Soumen Chakrabarti, James Demmel, Katherine A. Yelick: Models and Scheduling Algorithms for Mixed Data and Task Parallel Programs. J. Parallel Distrib. Comput. 47(2): 168-184 (1997) | |
1996 | ||
17 | Andrew J. Cleary, James Demmel, Inderjit S. Dhillon, Jack Dongarra, Sven Hammarling, Antoine Petitet, H. Ren, Ken Stanley, R. Clinton Whaley: Practical Experience in the Dangers of Heterogeneous Computing. PARA 1996: 57-64 | |
1995 | ||
16 | Jaeyoung Choi, James Demmel, Inderjit S. Dhillon, Jack Dongarra, Susan Ostrouchov, Antoine Petitet, Ken Stanley, David W. Walker, R. Clinton Whaley: ScaLAPACK: A Portable Linear Algebra Library for Distributed Memory Computers - Design Issues and Performance. PARA 1995: 95-106 | |
15 | James Demmel, Ken Stanley: The Performance of Finding Eigenvalues and Eigenvaectors of Dense Symmetric Matrices on Distributed Memory Computers. PPSC 1995: 528-533 | |
14 | James Demmel, Sharon Smith: Performance of a Parallel Global Atmospheric Chemical Tracer Model. SC 1995 | |
13 | Soumen Chakrabarti, James Demmel, Katherine A. Yelick: Modeling the Benefits of Mixed Data and Task Parallelism. SPAA 1995: 74-83 | |
12 | Zhaojun Bai, David Day, James Demmel, Jack Dongarra, Ming Gu, Axel Ruhe, Henk A. van der Vorst: Templates for Linear Algebra Problems. Computer Science Today 1995: 115-140 | |
11 | Dinesh Manocha, James Demmel: Algorithms for Intersecting Parametric and Algebraic Curves II: Multiple Intersections. CVGIP: Graphical Model and Image Processing 57(2): 81-100 (1995) | |
1994 | ||
10 | Dinesh Manocha, James Demmel: Algorithms for intersecting parametric and algebraic curves I: simple intersections. ACM Trans. Graph. 13(1): 73-100 (1994) | |
9 | James Demmel, Xiaoye S. Li: Faster Numerical Algorithms via Exception Handling. IEEE Trans. Computers 43(8): 983-992 (1994) | |
1993 | ||
8 | James Demmel, Xiaoye S. Li: Faster numerical algorithms via exception handling. IEEE Symposium on Computer Arithmetic 1993: 234-241 | |
7 | James Demmel, Jack Dongarra, Robert A. van de Geijn, David W. Walker: LAPACK for Distributed Memory Architectures: The Next Generation. PPSC 1993: 323-329 | |
6 | Zhaojun Bai, James Demmel: Design of a Parallel Nonsymmetric Eigenroutine Toolbox, Part I. PPSC 1993: 391-398 | |
5 | Victor Pan, James Demmel: A New Algorithm for the Symmetric Tridiagonal Eigenvalue Problem. J. Complexity 9(3): 387-405 (1993) | |
1991 | ||
4 | James Demmel: LAPACK: A portable linear algebra library for high-performance computers. Concurrency - Practice and Experience 3(6): 655-666 (1991) | |
1990 | ||
3 | Ed Anderson, Zhaojun Bai, Jack Dongarra, A. Greenbaum, A. McKenney, Jeremy Du Croz, Sven Hammarling, James Demmel, Christian H. Bischof, Danny C. Sorensen: LAPACK: a portable linear algebra library for high-performance computers. SC 1990: 2-11 | |
1989 | ||
2 | Zhaojun Bai, James Demmel: On a Block Implementation of Hessenberg Multishift QR Iteration. International Journal of High Speed Computing 1(1): 97-112 (1989) | |
1987 | ||
1 | James Demmel: The geometry of III-conditioning. J. Complexity 3(2): 201-229 (1987) |