copt.datasets.load_gisette

copt.datasets.load_gisette(subset='full', data_dir='/builder/home/copt_data')

Download and return the covtype dataset.

Properties:

n_samples: 6000 (train) n_features: 5000 density: 22% of nonzero coefficients on train set.

This is the binary classification version of the dataset as found in the LIBSVM dataset project:

Args

standardize: boolean If True, each feature will have zero mean and unit variance.

data_dir: string

Directory from which to read the data. Defaults to $HOME/copt_data/

Returns

scipy.sparse CSR matrix target: numpy array

Labels, only takes values 0 or 1.

Return type

data