asreview.search.fuzzy_find

asreview.search.fuzzy_find(as_data, keywords, threshold=60, max_return=10, exclude=None)[source]

Find a record using keywords.

It looks for keywords in the title/authors/keywords (for as much is available). Using the diflib package it creates a ranking based on token set matching.

Parameters:
  • as_data (asreview.Dataset) – ASReview data object to search

  • keywords (str) – A string of keywords together, can be a combination.

  • threshold (float) – Don’t return records below this threshold.

  • max_return (int) – Maximum number of records to return.

  • exclude (list, numpy.ndarray) – List of indices that should be excluded in the search. You would put papers that were already labeled here for example.

Returns:

list – Sorted list of indexes that match best the keywords.