Datasets


Here are a number of datasets for regression analysis, CVT basis calculations, K-means analysis, and so on.

The dataset directories include:

  1. burgers, a dataset directory which contains 40 solutions of the Burgers equation at equally spaced times from 0 to 1, with values at 41 equally spaced nodes in [0,1];
  2. case1_flow, a dataset directory which 401 solutions of a flow problem in a channel;
  3. cavity_flow, a dataset directory which contains 500 time steps of Navier-Stokes flow in a driven cavity;
  4. census, a dataset directory which contains US census data;
  5. cities, a dataset directory which contains sets of information about cities and the distances between them;
  6. cvt, a dataset directory which contains examples of Centroidal Voronoi Tessellations;
  7. cvt_mod, a dataset directory which contains examples of Centroidal Voronoi Tessellations on a "logical torus" or "wrap around" unit hypercube;
  8. faure, a dataset directory which contains examples of the Faure quasirandom sequence;
  9. grid, a dataset directory which contains examples of sets of points on a uniform grid, with random holes;
  10. halton, a dataset directory which contains examples of the Halton quasirandom sequence;
  11. hammersley, a dataset directory which contains examples of the Hammersley quasirandom sequence;
  12. hartigan, a dataset directory which contains datasets for testing clustering algorithms;
  13. hbsmc, a dataset directory which contains the Harwell Boeing Sparse Matrix Collection;
  14. hex_grid, a dataset directory which contains datasets of points forming hex grids, either in squares or more general 2D regions;
  15. ihs, a dataset directory which contains examples of the Improved Distributed Hypercube Sampling quasirandom sequence;
  16. inout_flow, a dataset directory which contains 500 time steps of Navier-Stokes flow in a region with specified inflow and outflow;
  17. inout_flow2, a dataset directory which contains 800 time steps of Navier-Stokes flow in a region with specified inflow and outflow;
  18. latin_center, a dataset directory which contains examples of the Latin Center Square quasirandom sequence;
  19. latin_edge, a dataset directory which contains examples of the Latin Edge Square quasirandom sequence;
  20. latin_random, a dataset directory which contains examples of the Latin Random Square quasirandom sequence;
  21. lcvt, a dataset directory which contains examples of "Latinized" Centroidal Voronoi Tessellations;
  22. lcvt_mod, a dataset directory which contains examples of "Latinized" Centroidal Voronoi Tessellations on a logical torus;
  23. lhs, a dataset directory which contains datasets related to Latin Hypercube Sampling;
  24. martinez, a dataset directory which contains datasets for computational statistics;
  25. mds, a dataset directory which contains datasets for multidimensional scaling;
  26. mhd_control, a dataset directory which contains datasets from the control of a magnetic hydrodynamic 2D flow;
  27. mps, a dataset directory which contains linear programming datasets;
  28. mpsc, a dataset directory which contains linear programming datasets in compressed format;
  29. niederreiter2, a dataset directory which contains examples of the Niederreiter quasirandom sequence using a base of 2;
  30. oa, a dataset directory which contains samples of orthogonal arrays, as computed by Art Owen;
  31. pcl, a dataset directory which contains datasets from a gene expression experiment on Arabidopsis;
  32. quadrature_rules, a dataset directory which contains quadrature rules for 1D intervals, 2D rectangles or multidimensional rectangular regions, stored as a file of abscissas, a file of weights, and a file of region limits.
  33. quadrature_rules_chebyshev1, a dataset directory which contains quadrature rules for integration on [-1,+1], using a Gauss-Chebyshev type 1 rule.
  34. quadrature_rules_chebyshev2, a dataset directory which contains quadrature rules for integration on [-1,+1], using a Gauss-Chebyshev type 2 rule.
  35. quadrature_rules_gegenbauer, a dataset directory which contains quadrature rules for integration on [-1,+1], using a Gauss-Gegenbauer rule.
  36. quadrature_rules_gen_hermite, a dataset directory which contains quadrature rules for integration on an infinite interval, using a generalized Gauss-Hermite rule.
  37. quadrature_rules_gen_laguerre, a dataset directory which contains quadrature rules for integration on a semi-infinite interval, using a generalized Gauss-Laguerre rule.
  38. quadrature_rules_halton, a dataset directory which contains quadrature rules for multidimensional unit cubes, based on a Halton quasirandom sequence. stored as a file of abscissas, a file of weights, and a file of region limits.
  39. quadrature_rules_hermite, a dataset directory which contains Gauss-Hermite quadrature rules, for integration on the interval (-oo,+oo), with weight function exp(-x*x).
  40. quadrature_rules_jacobi, a dataset directory which contains Gauss-Jacobi quadrature rules for the interval [-1,+1] with weight function (1-x)^ALPHA * (1+x)^BETA.
  41. quadrature_rules_laguerre, a dataset directory which contains Gauss-Laguerre quadrature rules for integration on the interval [A,+oo), with weight function exp(-x).
  42. quadrature_rules_latin_center, a dataset directory which contains quadrature rules for multidimensional unit cubes, based on centered Latin hypercubes. stored as a file of abscissas, a file of weights, and a file of region limits.
  43. quadrature_rules_legendre, a dataset directory which contains Gauss-Legendre quadrature rules for the interval [-1,+1].
  44. quadrature_rules_tet, a dataset directory which contains quadrature rules for tetrahedrons, stored as a file of abscissas, a file of weights, and a file of vertices.
  45. quadrature_rules_tri, a dataset directory which contains quadrature rules for triangles, stored as a file of abscissas, a file of weights, and a file of vertices.
  46. quadrature_rules_uniform, a dataset directory which contains quadrature rules for multidimensional unit cubes, based on a uniform pseudorandom sequence. stored as a file of abscissas, a file of weights, and a file of region limits.
  47. regression, a dataset directory which contains datasets for testing linear regression;
  48. romero, a dataset directory which contains datasets of Latin Square points used as starting points for CVT calculations;
  49. sample_2d, a dataset directory which contains datasets of points that "sample" the unit square;
  50. sgb, a dataset directory which contains files used as input data for demonstrations and tests of Donald Knuth's Stanford Graph Base.
  51. sobol, a dataset directory which contains samples of the Sobol quasirandom sequence;
  52. spaeth, a dataset directory which contains datasets for cluster analysis;
  53. spaeth2, a dataset directory which contains datasets for cluster analysis;
  54. sparse_grid_cc, a dataset directory which contains multidimensional Smolyak sparse grids based on the 1D Clenshaw Curtis rule;
  55. sparse_grid_f2, a dataset directory which contains multidimensional Smolyak sparse grids based on the 1D Fejer type 2 rule;
  56. sparse_grid_gl, a dataset directory which contains multidimensional Smolyak sparse grids based on the 1D Gauss-Legendre rule;
  57. sparse_grid_gp, a dataset directory which contains multidimensional Smolyak sparse grids based on the 1D Gauss-Patterson rule;
  58. sparse_grid_hermite, a dataset directory which contains multidimensional Smolyak sparse grids based on the 1D Gauss-Hermite rule;
  59. sparse_grid_laguerre, a dataset directory which contains multidimensional Smolyak sparse grids based on the 1D Gauss-Laguerre rule;
  60. sparse_grid_mixed, a dataset directory which contains multidimensional Smolyak sparse grids based on a mixture of 1D rules.
  61. sparse_grid_ncc, a dataset directory which contains multidimensional Smolyak sparse grids based on the 1D Newton Cotes Closed rule;
  62. sparse_grid_nco, a dataset directory which contains multidimensional Smolyak sparse grids based on the 1D Newton Cotes Open rule;
  63. sphere_grid, a dataset directory which contains grids of points, lines, triangles or quadrilaterals on a sphere;
  64. tcell_flow, a dataset directory which contains 500 time steps of Navier-Stokes flow in a T-cell;
  65. test_con, a dataset directory which contains sequences of points that lie on multidimensional curves defined by sets of nonlinear equations;
  66. text, a dataset directory which contains actual "texts", such as the Gettysburg Address;
  67. uniform, a dataset directory which contains examples of a uniform pseudorandom sequence;
  68. van_der_corput, a dataset directory which contains examples of the one-dimensional van der Corput, for various bases;
  69. words, a dataset directory which contains lists of words;

You can go up one level to the main web page.


Last revised on 06 October 2008.