Submitted by ross on Thu, 03/03/2011 - 10:38
Today I was tasked with de-duping a csv file that had around 25,000 single entry rows of emails. There were numerous duplicates, and I needed a way to read the file, find out how many duplicates, and then rewrite the file without duplicate entries. I wanted to use python because I tire of php and have little to no experience with any other programming language. Maybe I should have chosen another language to complicate my life, but for time's sake, python became my goto language.