asreview.review.BaseReview

class asreview.review.BaseReview(as_data, project, model=<asreview.models.classifiers.nb.NaiveBayesClassifier object>, query_model=<asreview.models.query.max.MaxQuery object>, balance_model=<asreview.models.balance.simple.SimpleBalance object>, feature_model=<asreview.models.feature_extraction.tfidf.Tfidf object>, n_papers=None, n_instances=1, stop_if=None, start_idx=[])[source]

Base class for Systematic Review.

Parameters:

as_data (asreview.ASReviewData) – The data object which contains the text, labels, etc.
project (path-like) – Path to the project file.
model (BaseTrainClassifier) – Initialized model to fit the data during active learning. See asreview.models.utils.py for possible models.
query_model (BaseQueryStrategy) – Initialized model to query new instances for review, such as random sampling or max sampling. See asreview.query_strategies.utils.py for query models.
balance_model (BaseBalance) – Initialized model to redistribute the training data during the active learning process. They might either resample or undersample specific papers.
feature_model (BaseFeatureExtraction) – Feature extraction model that converts texts and keywords to feature matrices.
n_instances (int) – Number of papers to query at each step in the active learning process.
stop_if (int) – Number of steps/queries to perform. Set to None for no limit.
start_idx (numpy.ndarray) – Start the simulation/review with these indices. They are assumed to be already labeled. Failing to do so might result bad behaviour.

Attributes

settings

Get an ASReview settings object

Methods

`review`()	Do a full review.
`train`()	Train a new model on the labeled data.