asreview.models.query.ClusterQuery

class asreview.models.query.ClusterQuery(cluster_size=350, update_interval=200, random_state=None)[source]

Clustering query strategy (cluster).

Use clustering after feature extraction on the dataset. Then the highest probabilities within random clusters are sampled.

Parameters:
  • cluster_size (int) – Size of the clusters to be made. If the size of the clusters is smaller than the size of the pool, fall back to max sampling.

  • update_interval (int) – Update the clustering every x instances.

  • random_state (int, asreview.utils.SeededRandomState) – State/seed of the RNG.

Attributes

default_param

Get the default parameters of the model.

label

name

param

Get the (assigned) parameters of the model.

Methods

full_hyper_space()

hyper_space()

query(X, classifier[, n_instances, ...])

Query method for strategies which use class probabilities.