Your Stream is Only Half the Story: Data Requires Context (in Real Time) Struggling to […]
For Data Engineers
Latest technology news for the
future of real-time data engineering.
Star Schema Benchmark: How to Model Multidimensional Data in FeatureBase
Column-oriented databases are no longer enough to power today’s massive analytical workloads at the speed […]
2022 State of Data Practice Report
Introduction At Molecula, we wanted to better understand how data practitioners are handling rapid […]
The 4 Key Requirements of Real-Time Analytics: Latency, Fresh Data, Throughput, and Concurrency
Despite advances in big data and analytics, organizations still struggle with the complexity currently […]
How to Implement Artificial Intelligence: An Introductory Overview
A Crash Course in Geek Speak “Companies like calling their technologies AI. It sounds […]
Market Segmentation at ANY Scale: 350 Million Customer Records in Milliseconds
Customer data footprint reduced from 700GB to <70GB. 350 million records queried in 9 milliseconds. […]
Modeling Data In Molecula FeatureBase to Improve Efficiencies
FeatureBase is optimized for both analytical workloads and statistical computation. This multi-part optimization requires […]
Real-World Results: How to Reduce Queries from 8 Hours to 8 Seconds
How a large manufacturer cut preaggregation to power real-time customer segmentation A 2020 […]
The Data Engineer’s Social Dilemma
What do you do when someone asks, “What do you do?” Whether you love […]
What is Data Aggregation? (And How Can It Be Avoided)
Data Aggregation: An Overview Data aggregation is the process whereby raw data is gathered […]