Collapses rows that refer to the same publication when combining results from multiple sources (SciELO + OpenAlex frequently share Brazilian pedology papers). DOI is the primary key when available; a normalised lower-case title is the fallback.
Usage
causal_corpus_deduplicate(corpus, by = c("doi", "title"))