Co-authored a publication on respiratory disease surveillance, identifying leading indicators of hospitalization trends through prior-week analysis and rank correlation techniques.
Designed and maintained scalable statistical analysis pipelines using PySpark and Spark SQL to analyze seasonal COVID-19, Flu, and RSV hospital admission trends, supporting surveillance and reporting efforts.
Improved data reliability and reporting efficiency by implementing data quality checks for missing and erroneous values, testing hospital data submission APIs, and mentoring team members on Python-based automation for PowerPoint, Word, and Excel report generation.
Role Summary
Co-authored a publication on respiratory disease surveillance, identifying leading indicators of hospitalization trends through prior-week analysis and rank correlation techniques.
Designed and maintained scalable statistical analysis pipelines using PySpark and Spark SQL to analyze seasonal COVID-19, Flu, and RSV hospital admission trends, supporting surveillance and reporting efforts.
Improved data reliability and reporting efficiency by implementing data quality checks for missing and erroneous values, testing hospital data submission APIs, and mentoring team members on Python-based automation for PowerPoint, Word, and Excel report generation.