asreview.ASReviewData.duplicated
- ASReviewData.duplicated(pid='doi')[source]
Return boolean Series denoting duplicate rows.
Identify duplicates based on titles and abstracts and if available, on a persistent identifier (PID) such as the Digital Object Identifier (DOI).
- Parameters
pid (string) – Which persistent identifier to use for deduplication. Default is ‘doi’.
- Returns
pandas.Series – Boolean series for each duplicated rows.