Loading…
This event has ended. Create your own event → Check it out
This event has ended. Create your own
To Learn More: LinuxCon Europe | CloudOpen Europe | Embedded Linux Conference Europe.

Attendees! Please provide us feedback on the sessions you attend! Click here to submit a brief survey for each session and win a $250 Amazon gift certificate. 

>> Tracing Summit: View the Full Schedule

View analytic
Monday, October 13 • 2:30pm - 3:20pm
Reducing Cost in Big Data Using Statistics & In-Memory Technology - Praveen Rachabattuni, Sigmoid Analytics

Sign up or log in to save this to your schedule and see who's attending!

The world is shifting from private dedicated data center to on-demand compute on the cloud. This shift moves the onus of cost from the hands of IT to the developers. As your data sizes start to rise the computing cost grows linearly with it. In this talk I will show how improving computation speed using Statistical techniques & in-memory technology Apache Spark helped us cut down a customers cost from $1000/TB down to $100/TB on the cloud. I will also show a hands on demo of how to several statistical techniques like HyperLogLog, CountMinSketch & Bloom filters can be applied to solve everyday problems & save as much as 10x in terms of cost & machines on your existing workloads.

Speakers
PR

Praveen Rachabattuni

Praveen Rachabattuni is a technical team lead at Sigmoid Analytics. His areas of expertise include Real Time Big Data Analytics using open source technologies like Apache Spark, Shark and Pig on Spark. He is currently working with Apache Pig team in contributing Pig on Spark. Has worked on building json apis for Spark tasks data, consumable by custom dashboards or tools. Sigmoid Analytics has worked with over 25 customers in the Big data space... Read More →


Monday October 13, 2014 2:30pm - 3:20pm
Room 18

Attendees (21)