Datasets
Here are a number of datasets for regression analysis, CVT basis
calculations, K-means analysis, and so on.
The dataset directories include:
-
burgers,
a dataset directory which
contains 40 solutions of the Burgers equation at equally
spaced times from 0 to 1, with values
at 41 equally spaced nodes in [0,1];
-
case1_flow,
a dataset directory which
401 solutions of a flow problem in a channel;
-
cavity_flow,
a dataset directory which
contains 500 time steps of Navier-Stokes flow in
a driven cavity;
-
census,
a dataset directory which
contains US census data;
-
cities,
a dataset directory which
contains sets of information about cities and the distances
between them;
-
cvt,
a dataset directory which
contains examples of Centroidal Voronoi Tessellations;
-
cvt_mod,
a dataset directory which
contains examples of Centroidal Voronoi Tessellations
on a "logical torus" or "wrap around" unit hypercube;
-
faure,
a dataset directory which
contains examples of the Faure quasirandom sequence;
-
grid,
a dataset directory which
contains examples of sets of points on a uniform grid, with random holes;
-
halton,
a dataset directory which
contains examples of the Halton quasirandom sequence;
-
hammersley,
a dataset directory which
contains examples of the Hammersley quasirandom sequence;
-
hartigan,
a dataset directory which
contains datasets for testing clustering algorithms;
-
hbsmc,
a dataset directory which
contains the Harwell Boeing Sparse Matrix Collection;
-
hex_grid,
a dataset directory which
contains datasets of points forming hex grids, either in
squares or more general 2D regions;
-
ihs,
a dataset directory which
contains examples of the Improved Distributed Hypercube
Sampling quasirandom sequence;
-
inout_flow,
a dataset directory which
contains 500 time steps of Navier-Stokes flow in a region with
specified inflow and outflow;
-
inout_flow2,
a dataset directory which
contains 800 time steps of Navier-Stokes flow in a region with
specified inflow and outflow;
-
latin_center,
a dataset directory which
contains examples of the Latin Center Square quasirandom sequence;
-
latin_edge,
a dataset directory which
contains examples of the Latin Edge Square quasirandom sequence;
-
latin_random,
a dataset directory which
contains examples of the Latin Random Square quasirandom sequence;
-
lcvt,
a dataset directory which
contains examples of "Latinized" Centroidal Voronoi
Tessellations;
-
lcvt_mod,
a dataset directory which
contains examples of "Latinized" Centroidal Voronoi
Tessellations on a logical torus;
-
lhs,
a dataset directory which
contains datasets related to Latin Hypercube Sampling;
-
martinez,
a dataset directory which
contains datasets for computational statistics;
-
mds,
a dataset directory which
contains datasets for multidimensional scaling;
-
mhd_control,
a dataset directory which
contains datasets from the control of a magnetic
hydrodynamic 2D flow;
-
mps,
a dataset directory which
contains linear programming datasets;
-
mpsc,
a dataset directory which
contains linear programming datasets in compressed format;
-
niederreiter2,
a dataset directory which
contains examples of the Niederreiter quasirandom sequence
using a base of 2;
-
oa,
a dataset directory which
contains samples of orthogonal arrays, as computed by Art Owen;
-
pcl,
a dataset directory which
contains datasets from a gene expression experiment on Arabidopsis;
-
quadrature_rules,
a dataset directory which
contains quadrature rules for 1D intervals,
2D rectangles or multidimensional rectangular regions,
stored as a file of abscissas, a file of weights,
and a file of region limits.
-
quadrature_rules_chebyshev1,
a dataset directory which
contains quadrature rules for integration on [-1,+1],
using a Gauss-Chebyshev type 1 rule.
-
quadrature_rules_chebyshev2,
a dataset directory which
contains quadrature rules for integration on [-1,+1],
using a Gauss-Chebyshev type 2 rule.
-
quadrature_rules_gegenbauer,
a dataset directory which
contains quadrature rules for integration on [-1,+1],
using a Gauss-Gegenbauer rule.
-
quadrature_rules_gen_hermite,
a dataset directory which
contains quadrature rules for integration on an infinite interval,
using a generalized Gauss-Hermite rule.
-
quadrature_rules_gen_laguerre,
a dataset directory which
contains quadrature rules for integration on a semi-infinite interval,
using a generalized Gauss-Laguerre rule.
-
quadrature_rules_halton,
a dataset directory which
contains quadrature rules for multidimensional unit cubes,
based on a Halton quasirandom sequence.
stored as a file of abscissas, a file of weights,
and a file of region limits.
-
quadrature_rules_hermite,
a dataset directory which
contains Gauss-Hermite quadrature rules, for integration
on the interval (-oo,+oo), with weight function exp(-x*x).
-
quadrature_rules_jacobi,
a dataset directory which
contains Gauss-Jacobi quadrature rules for the interval [-1,+1]
with weight function (1-x)^ALPHA * (1+x)^BETA.
-
quadrature_rules_laguerre,
a dataset directory which
contains Gauss-Laguerre quadrature rules for integration on
the interval [A,+oo), with weight function exp(-x).
-
quadrature_rules_latin_center,
a dataset directory which
contains quadrature rules for multidimensional unit cubes,
based on centered Latin hypercubes.
stored as a file of abscissas, a file of weights,
and a file of region limits.
-
quadrature_rules_legendre,
a dataset directory which
contains Gauss-Legendre quadrature rules for the interval [-1,+1].
-
quadrature_rules_tet,
a dataset directory which
contains quadrature rules for tetrahedrons,
stored as a file of abscissas, a file of weights,
and a file of vertices.
-
quadrature_rules_tri,
a dataset directory which
contains quadrature rules for triangles,
stored as a file of abscissas, a file of weights,
and a file of vertices.
-
quadrature_rules_uniform,
a dataset directory which
contains quadrature rules for multidimensional unit cubes,
based on a uniform pseudorandom sequence.
stored as a file of abscissas, a file of weights,
and a file of region limits.
-
regression,
a dataset directory which
contains datasets for testing linear regression;
-
romero,
a dataset directory which
contains datasets of Latin Square points used as starting
points for CVT calculations;
-
sample_2d,
a dataset directory which
contains datasets of points that "sample" the unit square;
-
sgb,
a dataset directory which
contains files used as input data for
demonstrations and tests of Donald Knuth's Stanford Graph Base.
-
sobol,
a dataset directory which
contains samples of the Sobol quasirandom sequence;
-
spaeth,
a dataset directory which
contains datasets for cluster analysis;
-
spaeth2,
a dataset directory which
contains datasets for cluster analysis;
-
sparse_grid_cc,
a dataset directory which
contains multidimensional Smolyak sparse grids
based on the 1D Clenshaw Curtis rule;
-
sparse_grid_f2,
a dataset directory which
contains multidimensional Smolyak sparse grids
based on the 1D Fejer type 2 rule;
-
sparse_grid_gl,
a dataset directory which
contains multidimensional Smolyak sparse grids
based on the 1D Gauss-Legendre rule;
-
sparse_grid_gp,
a dataset directory which
contains multidimensional Smolyak sparse grids
based on the 1D Gauss-Patterson rule;
-
sparse_grid_hermite,
a dataset directory which
contains multidimensional Smolyak sparse grids
based on the 1D Gauss-Hermite rule;
-
sparse_grid_laguerre,
a dataset directory which
contains multidimensional Smolyak sparse grids
based on the 1D Gauss-Laguerre rule;
-
sparse_grid_mixed,
a dataset directory which
contains multidimensional Smolyak sparse grids
based on a mixture of 1D rules.
-
sparse_grid_ncc,
a dataset directory which
contains multidimensional Smolyak sparse grids
based on the 1D Newton Cotes Closed rule;
-
sparse_grid_nco,
a dataset directory which
contains multidimensional Smolyak sparse grids
based on the 1D Newton Cotes Open rule;
-
sphere_grid,
a dataset directory which
contains grids of points, lines, triangles or quadrilaterals on a sphere;
-
tcell_flow,
a dataset directory which
contains 500 time steps of Navier-Stokes flow in a T-cell;
-
test_con,
a dataset directory which
contains sequences of points that lie on multidimensional curves defined by
sets of nonlinear equations;
-
text,
a dataset directory which
contains actual "texts", such as the Gettysburg Address;
-
uniform,
a dataset directory which
contains examples of a uniform pseudorandom sequence;
-
van_der_corput,
a dataset directory which
contains examples of the one-dimensional van der Corput,
for various bases;
-
words,
a dataset directory which
contains lists of words;
You can go up one level to
the main web page.
Last revised on 06 October 2008.