Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore key insights from the migration of one of the largest U.S. immunization registries to Databricks in this 35-minute conference talk. Learn about implementing SCD Type 2 tables using Delta Live Tables and change data capture from an Oracle database for a system managing over 50 million individuals and nearly one billion records. Discover crucial lessons on understanding data nature, file structure impact on performance, compute requirement considerations, and workflow decoupling. Gain valuable knowledge on balancing cost and meeting SLAs when crafting cluster strategies. Hear from Michael Pisarsky, Solution Architect at Mosaic Data Solutions, and Rex Phillips, Strategy Senior Principal at Accenture, as they share their experiences and anticipate future enhancements like Liquid Clustering and serverless computing for managing vast healthcare datasets efficiently.