A Data Science Central Community
Imagine that someone asks you this weird but serious question, in a professional, data science context: does size matter, when it comes to risk of breast cancer?
How would you proceed to prove or disprove that breast size is a risk factor: the bigger, the higher the risk. How would you quantify the risk compared with other factors (smoking)? Assuming it is a risk, how would you assess solutions that reduce breast size, in terms of reducing cancer risk, given that these solutions probably have side effects? How do you identify patients suitable for breast reduction (e.g. via some diet, drugs or other procedures.)
The first evidence that size matters is that breast cancer has a much lower incidence in men than in women, thought it could be for other reasons than size, in short - a spurious correlation. What about incidence in other primates, males versus females?
Most importantly, how would you gather data or test this assumption?
Finally - this has nothing to do with cancer - another question: why breasts in the human species are the biggest among all primates? It is because we only have two, we stand up (we are not 4-legged creatures) thus gravitation acts differently, or some other reasons? Also why has size decreased in modern societies? How do you get data and analyze it, to answer these questions?