Last week as we began to brainstorm our attributes and entities I began to make a spread sheet of my initial data collection thoughts/intentions. So far, this is what I have come up with:
There are many more venues that I have to add to this list, however before I continue I know I need to make some important decisions.
I have to decide whether I want to separate any of these attributes into their own separate entities and whether or not I would like to include the following:
A. a cap on venue size – if it us under a certain amount it will not be included in my data set
B. how far back I want to look for venues that have been closed to include in the data set
This is a fantastic start! As we move forward, we should think about how you’re going to derive some of this data (e.g., average age of spectator), and perhaps consider some sampling strategies: this is an ambitious list! But congrats: you’re thinking like a database!