Most interesting links of July ’14

Recommended Readings Video: The Unreasonable Effectiveness of Dynamic Typing for Practical Programs – a static-typing zealot turned friend of dynamic typing under the experience of real-world projects and problems shares thoughts about the limits of type systems (f.ex. both energy and torque are measured in N*m yet cannot be combined) and their cost: according toContinue reading “Most interesting links of July ’14”

Most interesting links of September ’13

Recommended Readings Stuff The Internet Says On Scalability For September 13, 2013 – a collection of interesting performance related articles with summaries (via @_dagi) Can you copy a culture? The NUMMI story (audio/transcript) – how the GM factory with the worst workforce has been turned around via a good application of Toyota Production System –Continue reading “Most interesting links of September ’13”

Most interesting links of August ’13

Sorry folks, this month it will be very brief. I have many more great stuff in the queue but haven’t managed to write it down yet. Next month will be heavy 🙂 Recommended Readings Interested in native vs. webapp? Check out Why mobile web apps are slow (mobile browser much slower, not much real improvements,Continue reading “Most interesting links of August ’13”

Most interesting links of June ’13

Recommended Readings Agile, process, SW dev, people etc. Real Options—a Mindset – an intro into the Real Options approach, which has been quite a hot topic and a transformational way of thinking for a number of inspiring people (Dan North, Liz Keogh etc.). “Real Options help us to better make decisions and commitments with three simpleContinue reading “Most interesting links of June ’13”

Most interesting links of April ’13

Recommended Readings The top top article How To Survive a Ground-Up Rewrite Without Losing Your Sanity (recommended by Kent Beck) – sometimes you need to actually rewrite an important part of a system; here we learn about two such rewrites, one which went well and one that failed badly – and what are the importantContinue reading “Most interesting links of April ’13”

Most interesting links of January ’13

Recommended Readings Various Dustin Marx: Significant Software Development Developments of 2012 – Groovy 2.0 with static typing, rise of Git[Hub], NoSQL, mobile development (iOS etc.), Scala and Typesafe stack 2.0, big data, HTML5, security (Java issues etc.), cloud, DevOps. 20 Kick-ass programming quotes – including Bill Gates’ “Measuring programming progress by lines of code isContinue reading “Most interesting links of January ’13”

Most interesting links of December ’12

Recommended Readings Software development Kent Beck: When Worse Is Better: Incrementally Escaping Local Maxima – Kent reintroduces his Sprinting Centipede strategy (“reduce the cost of each change as much as possible so as to enable small changes to be chained together nearly continuously” => “From the outside it is clear that big changes are happening,Continue reading “Most interesting links of December ’12”

Note: Loading Tab-Separated Data In Cascalog

To load all fields from a tab-separated text file in Cascalog we need to use the generic hfs-tap and specify the “scheme” (notice that loading all fields and expecting tab as the separator is the default behavior of TextDelimited): With a custom separator and fields: Hadoop doesn’t manage to load data files from nested sub-directoriesContinue reading “Note: Loading Tab-Separated Data In Cascalog”

Enabling JMX Monitoring for Hadoop And Hive

Hadoop’s NameNode and JobTracker expose interesting metrics and statistics over the JMX. Hive seems not to expose anything intersting but it still might be useful to monitor its JVM or do simpler profiling/sampling on it. Let’s see how to enable JMX and how to access it securely, over SSH.