Runaway complexity in Big Data And a plan to stop it Nathan Marz - PowerPoint PPT Presentation
Runaway complexity in Big Data And a plan to stop it Nathan Marz @nathanmarz 1 Agenda Common sources of complexity in data systems Design for a fundamentally better data system What is a data system? A system that manages the storage
Batch views are optimized for the queries they serve
Batch views • Batch-writable from MapReduce • Fast random reads • Examples: ElephantDB, Voldemort
Batch view database No random writes required!
Properties All Batch data view Function ElephantDB is only a few thousand lines of code Simple
Properties All Batch data view Function Scalable
Properties All Batch data view Function Highly available
Properties All Batch data view Function Can be heavily optimized (b/c no random writes)
Properties All Batch data view Function Normalized
Recommend
More recommend
Explore More Topics
Stay informed with curated content and fresh updates.