I am working on a data set about voter behavior with few level 1 variables such as age, gender, etc. I have created a household size variable, meaning I counted the number of times people with the same last name live at the same address. Thus, if there are three Schmidts at the exact same address, each were assigned a 3 as a count of family size.
Now, I want to create a variable describing the party mix for household (are people in a family voting homogeneously?). We have a variable describing which party a person is affiliated with.
I have no idea how to do that, and was hoping I might find some help or inspiration here... The dataset has 4 million registered voters in it, and I am a little lost on how to proceed.
Looking forward to your ideas!