Born of personal excitement and curiosity about data analytics, I've curated this site for students and fellow enthusiasts.
Check out my ABOUT page for a general introduction to this site, but the gist is that you'll find information, links, and an ever-increasing set of data project examples leveraging public health data and open-source programming languages and libraries in R and Python. Machine learning, data preprocessing, predictive modeling, and practical statistics will feature prominently on this site as well.
Here's an outline of the path I'm following on the public health data front, beginning with the United States:
General Population Distributions
Socioeconomic Data
Chronic Disease Burden
Mortality
Oncology Data
Mental Health Dynamics
Education
Diet & Exercise
Cancer Screenings & Immunizations
Medical Treatments (drugs & procedures)
Care Utilization (hospital inpatient & outpatient)
Healthcare Spend
Insurance Distributions
Environment & Pollutants
Crime
You'll also find some of my favorite helpful links at the bottom of every page. Watch for change as I become aware of additional available content.