copt.datasets.load_madelon

copt.datasets.load_madelon(subset='full', data_dir='/builder/home/copt_data')

Download and return the madelon dataset.

Properties:

n_samples: 2600 n_features: 500

This is the binary classification version of the dataset as found in the LIBSVM dataset project:

Parameters
  • md5_check – bool Whether to do an md5 check on the downloaded files.

  • subset – string Can be one of ‘full’ for full dataset, ‘train’ for only the train set or ‘test’ for only the test set.

  • standardize – boolean If True, each feature will have zero mean and unit variance.

Returns

scipy.sparse CSR

Return data as CSR sparse matrix of shape=(2600, 500).

target: array of shape 2600

Labels, only takes values 0 or 1.

Return type

data

Examples