This post will help you get started using Apache Spark DataFrames with Scala on the MapR Sandbox. The new Spark DataFrames API is designed to make big data processing on tabular data easier. A Spark DataFrame is a distributed collection of data organized into named columns that provides operations to filter, group, or compute aggregates, and can be used with Spark SQL.
This tutorial will help you get started with Standalone Spark applications on the MapR Sandbox.
Recommendation engines help narrow your choices to those that best meet your particular needs. In this post, we’re going to take a closer look at how all the different components of a recommendation engine work together. We’re going to use collaborative filtering on movie ratings data to recommend movies.
SQL will become one of the most prolific use cases in the Hadoop ecosystem, according to Forrester Research. Apache Drill is an open source SQL query engine for big data exploration. REST services and clients have emerged as popular technologies on the Internet. Apache HBase is a hugely popular Hadoop NoSQL database.
Backbone.js gives structure to web applications by providing models with key-value binding and custom events, collections with a rich API of enumer
What is FindBugs?
This Pet Catalog app explains a web application that
uses Wicket, JPA, GlassFish and MySQL.
Number 3 in the Top 10 most critical web application security vulnerabilities identified by the Open Web Application Security
OWASP Top 10 number 2: Injection Flaws
This and the next series of blog entries will highlight the
10 most critical web application security vulnerabilities
identified by the Open
Web Application Security Project (OWASP).