I'm running SPSS Statistics 23 and I'm trying to anonymize a bunch of usernames in a data set of about 500 K records with the following command:
SPSSINC ANON VARIABLES =user_name
/OPTIONS ONETOONE=user_name MAXRVALUE=9999999
If I take a random sample of 5 K users it only takes about one minute to complete this command successfully.
With the full data set this is very slow, it has now been running about 45 minutes with no end in sight (this is my third time trying to run this command, I've rebooted my laptop but no help).
I've presorted the file with the user_name in ascending order.
Is my data set simply too big for this command?