How to Remove and Detect Duplicates in Spreadsheets using Python
Okay…so you might be wondering why the hell you need to remove duplicates in spreadsheets using Python when you can simply click on the Remove Duplicates button in Excel.
And you would be justified in asking this question. However, the remove duplicates button retains the FIRST unique value and removes all the rest.
What if you want to remove ALL duplicates? Or you want to remove all but the LAST duplicate? Or you just want to know which records are duplicates?