Here we go again with another TechBits, this time with the news, what trended on my timeline and what I'm currently reading.


Principles of Reactive Programming @ Coursera finished

Had a good time doing the Principles of Reactive Programming. Managed to score the 20/20 on the last exercise after six days of trying/refactoring/redoing. Learnt conversation Patterns, the management of conversational state between actors and patterns for flow control, routing of messages to pools of actors for resilience or load balancing and acknowledgement of reception to achieve reliable delivery.

edX Introduction to Big Data with Apache Spark started

So I've enrolled in another course, the BerkeleyX: CS100.1x Introduction to Big Data with Apache Spark. I'm somewhat familiar to Spark using Scala and did a few things in my spare time (see my Test Lab here), but the course if free and the topics are quite nice:

  • Learn how to use Apache Spark to perform data analysis
  • How to use parallel programming to explore data sets
  • Apply Log Mining, Textual Entity Recognition and Collaborative Filtering to real world data questions
  • Prepare for the Spark Certified Developer exam

The environment is provided as a Vagrant file and the grader only work against python (the whole course exercises are given in Python).

Currently Reading and Watching

Safari Books Online has proven a reliable source for study and reading, three months later and I'm addicted to it.

  • Mesos: An SDK for Distributed Systems Developers The shift to the cloud is old news. Unfortunately, the pain of developing distributed architectures is not. Apache Mesos handles the hard parts of building distributed systems and lets developers focus on what makes their application special. In this workshop, we will illustrate how to write applications on Mesos by walking through the implementation of an example framework.

  • API Ecosystem with Scala, Scalatra, and Swagger at Netflix At Netflix Engineering's Partner Product Innovation group, we underwent a revamp of the tech stack to make it API-driven. This was to not only help with the expanding list of API consumers, but also to address the evolving streaming business. With Scala, Scalatra, and Swagger, we achieved one of the best architecture for the scale, agility and robustness needed.

  • The Just Enough Math series with Paco Nathan and Allen Day is amazing! Advanced math for business people: "€œjust enough mathâ€" to take advantage of new classes of open source frameworks. Many take college math up to calculus, but never learn how to approach sparse matrices, complex graphs, or supply chain optimizations. This tutorial ties these pieces together into a conceptual whole, with use cases and simple Python code, as a new approach to computational thinking.

  • Statistics Without the Agonizing Pain by John Rauser is a must to see, there are two essential skills for the data scientist: engineering and statistics. A great many data scientists are very strong engineers but feel like impostors when it comes to statistics. In this talk John will argue that the ability to program a computer gives you special access to the deepest and most fundamental ideas in statistics.

  • Zero to Cloud with @NetflixOSS is a three hours workshop to get you working with NetflixOSS tooling in Java.

  • Living Clojure by Carin Meier, I'm still reading it and trying my first steps with a clojure rest API

  • Lean Enteprise by Barry O'Reilly, Jez Humble, Joanne Molesky, is a full fledge great management and administration book, full of quotes from Druker, Jack Welch and all is a practical guide which presents Lean and Agile principles and patterns to help you move fast at scale - not limited to your development team.

Developement and Architecture

Process and People

That's it for today, I hope you all enjoy the reading and keep it posted for more news by following me