0.3.0 Databases and storage section updated
Databases and storage
Disk: HDD, SDD and cloud (S3) storage. Disk costs and time to access. File systems (HDFS).
Relational databases, ER diagram, normalized forms. SQL. Other types of databases and NoSQL (key-value, graph, column, vector).
Processing large data in parallel: MapReduce, Hadoop (HDFS+Yarn+MapReduce), Spark.
Cloud providers (AWS, GCP, Azure). New data solution providers (Snowflake, Databricks).
Decoupling storage and compute. Data warehouses and DW architectures. OLAP and OLTP.
Mergers and acquisitions, venture financing and forks.
Extra video: The Ancient Art of Data Management (2023) from DuckDB co-founder.
Don't miss what's next. Subscribe to Machine Learning My Way - Self-study and Review Guide: