LinuxCon + CloudOpen + ELC-E Europe 2014 has ended
To Learn More: LinuxCon Europe | CloudOpen Europe | Embedded Linux Conference Europe.

Attendees! Please provide us feedback on the sessions you attend! Click here to submit a brief survey for each session and win a $250 Amazon gift certificate. 

>> Tracing Summit: View the Full Schedule
Back To Schedule
Monday, October 13 • 2:30pm - 3:20pm
Reducing Cost in Big Data Using Statistics & In-Memory Technology - Praveen Rachabattuni, Sigmoid Analytics

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

The world is shifting from private dedicated data center to on-demand compute on the cloud. This shift moves the onus of cost from the hands of IT to the developers. As your data sizes start to rise the computing cost grows linearly with it. In this talk I will show how improving computation speed using Statistical techniques & in-memory technology Apache Spark helped us cut down a customers cost from $1000/TB down to $100/TB on the cloud. I will also show a hands on demo of how to several statistical techniques like HyperLogLog, CountMinSketch & Bloom filters can be applied to solve everyday problems & save as much as 10x in terms of cost & machines on your existing workloads.


Praveen Rachabattuni

Praveen Rachabattuni is a technical team lead at Sigmoid Analytics. His areas of expertise include Real Time Big Data Analytics using open source technologies like Apache Spark, Shark and Pig on Spark. He is currently working with Apache Pig team in contributing Pig on Spark. Has... Read More →

Monday October 13, 2014 2:30pm - 3:20pm CEST
Room 18

Attendees (0)