This post will help you get started using Apache Spark DataFrames with Scala on the MapR Sandbox. The new Spark DataFrames API is designed to make big data processing on tabular data easier. A Spark DataFrame is a distributed collection of data organized into named columns that provides operations to filter, group, or compute aggregates, and can be used with Spark SQL.
on Jun 28, 2015
Redis is good for prototyping, shared memory, messaging, caching and maximum performance. It might be used orthogonally and/or complementary to your SQL relational store, and/or NoSQL document store. For example, Redis might be used to cache dimensional aggregates of relational data for analytical purposes.
It's use-case as "persistent shared memory" is important especially for microservices and...
on Dec 31, 2014
I introduced myself to Konstantin Shvachko after hearing him speak in the Duke's Choice Awards BOF session at JavaOne 2012 (Hadoop was a 2012 award winner). Konstantin happened to be sitting right next to me in the audience, so before we exited the session, I asked him if we could arrange a chat before JavaOne ended.
You've probably heard of Hadoop, but a great many likely don't know much...
on Dec 16, 2012
Do You Want to Get This Message?
by Mark Fisher, Jonas Partner, Marius Bogoevici, and Iwein Fuld, authors of Spring Integration in Action
Spring Integration allows you to selectively process messages and define alternative routes within the system. In this article, based on chapter 6 of Spring Integration in Action, you'll see how you can limit the scope of what your components will handle by...
on Oct 19, 2012
Imagine if, for example, the hypervisors that run EC2 were compromised - imagine almost every business you deal with online compromised, all at once. This is the never-talked-about problem with the cloud - over on my timboudreau.com blog.
on Mar 8, 2015
In this post we will be looking at code for a system designed to integrate all of the devices used to provide surveillance and security to extensive physical premises such as malls, campuses, and industrial parks. The approach I am taking involves the actor paradigm and the Java programming language. The selection of actors for this type of application is based on a number of...
on Nov 30, 2013
Exciting information technologies emerge from new discoveries and re-emerge from past discoveries at a rapid rate. Despite the glamor and curiosity appeal engendered by cutting edge technologies when they debut, after the hype fades and we look at what actually is there in the cold light of reason we find ourselves turning to our pragmatic side and asking:
“Ok, but what advantages do we...
on Oct 15, 2013
I've just finished tuning a client's application where one of the items on the table was to find the source of calls to System.gc(). Using Censum made easy work of understanding the source of the calls. The team I was working with not only missed that these calls to System.gc() where creating havoc with their end users experience, they didn't even realize that something, some where was messing...
on Jan 27, 2015
Web Development Tools
Bunyan is a NodeJS library that rethinks logging in some really useful ways. I wrote a Java port you can use in your applications.
In particular, with Bunyan, logs are JSON - and Bunyan comes with a great filtering and analysis tool.
The Java port uses some innovative techniques to make logging simple and foolproof - in particular, a use of AutoCloseable to make a logging code simple and...
on Jan 18, 2015
While parts of the cloud IT that are still being hyped are changing rapidly, the whole segment is not past the hype yet. The cloud report  that Gartner published 9 months ago still holds true today.
Reports of disillusioned customers of such solutions  are a clear sign that the hype might have reached its peak for most parts of cloud offerings, though. And while many articles discuss the...
on Mar 30, 2014
This post introduces a new Java actor based open source sub-project of the project "Learning Actors in Java". This work will develop a premises guardian system. A premises guardian system is a distributed application whose purpose is to support the physical protection of premises such as office buildings, campuses, apartment complexes, shopping malls, etc. Protecting information...
on Nov 20, 2013
Remote systems cannot be ignored. They are the heart of actor programming. The ability of an actor application to smoothly scale out geographically is a tremendous advantage in meeting the challenges of growth. Today we will learn the basics of setting up for remote interaction between actors. If you wish to set up to try coding a set of remote actor systems and do not as yet have a...
on Nov 10, 2013
Configuration of an actor system affects the way in which the Akka actor libraries and microkernel interact with actors. The visibility of a configuration is limited to the specific actor system with which the configuration has been associated. The effective configuration for any given actor system is a merger of three sources. In Fig. 1 below the precedence of...
on Oct 30, 2013
This week I am presenting a coding of a simple two actor application designed to show actor creation, message passing, and actor termination. This application is called the "Fortune Cookie Application". It is the next state of the art beyond the HelloWorld program. Despite its simplicity this application shows the elements that every actor application uses.
What We Will Be...
on Oct 22, 2013
What happens if you configure a Hadoop-centric scalable big data infrastructure running on the Amazon EC2 cloud, that's programmed to perform some very basic processing on delimited ASCII data records (about 30 characters in each record), and you feed it a single data record? How quickly do you think that massively powerful system would process that teensy, trivial input data set? Milliseconds?...
on Sep 26, 2013
Here are my slides on my "Remoting Retrospective" presentation at JavaOne 2012:
Remoting Retrospective (pdf)
on Oct 26, 2012
The applications have grown together with the man dream to get more and more information, then more application have been emerging and the man had became hostage of the application.
For example we can recognized the evolution between man and machine:
In the first time: Many men to just one machine ( like mainframe's age) then one man to one machine ( in to Personal computer era) and...
on Mar 17, 2013
The Foundations of Mobile First Design
by Matthew Carver, author of The Responsive Web
In this article from The Responsive Web, author Matthew Carver opens up a discussion of how to build a site for a mobile or small-screen device using a graphic design program, such as Photoshop.
Building a site in its smallest iteration first affords the designer a lot of opportunities. By starting in a...
on Nov 14, 2012
From 0 to First Hit with Grails Support
by Glen Smith and Peter Ledbrook, authors of Grails in Action, Second Edition
Grails is a next-generation Java web development framework that generates great developer productivity gains through the confluence of a dynamic language, a Convention over Configuration philosophy, powerfully pragmatic supporting tools, and an agile perspective drawn from the...
on Oct 25, 2012
Nowadays there are too many NOSQL database, with different architecture and data structure. However with much variety they share one thing in common: Only search information from the keys. A good option is uses other service to help the NOSQL databases. In this post will how the Lucene does work together a NOSQL database, join two world in one application.
on Oct 13, 2012