Over the last five years, the amount of data in our data centers has exploded, and today there are numerous NoSQL, streaming, and batch systems that promise to scale with your data. These new technologies bring with them a difficult question that needs to be answered: which tool is the best fit for my use case? This session looks at antipatterns in tools such as Hadoop, Spark, Cassandra, and Kafka and discusses picking the wrong tool for the job, how misconfiguration can counter your attempts to scale, and how even simple operations such as counts can break. Attendees will walk away from the session with a solid grasp of the strengths and weaknesses of various big data tools and will have learned which pitfalls to avoid when working with these tools.
Alex Holmes, Software Engineer, Alex Holmes
Alex Holmes is a software engineer, author, speaker and blogger. He has worked on distributed systems since the mid 90’s, and has specialized in building large-scale projects using tools such as Hadoop, Cassandra and Kafka to solve tough big data problems. He is the author of “Hadoop in Practice”, a book published by Manning Publications (http://www.manning.com/holmes2/). He has presented multiple times at JavaOne, and is a JavaOne Rock Star.