ETL

  • Extract
    • get data from multiple sources
  • Data Cleaning
  • Data Transformation
    • to wareshouse format
  • Load
    • sort, summarize, consolidation, views, integrity, indexes, partitions
  • Refresh
    • update data regulary (only propagate changes from the data sources to the warehouse)
    • Integrations should work automatically based on the new data