Big Data TechCon | April 26-28, 2015 | Boston, MA


FalconStor brings data services to OpenStack Cinder

FalconStor today announced the forthcoming availability of new integrations with OpenStack’s Cinder project. This new software will allow the FalconStor FreeStor intelligent software-defined storage platform to be plugged into the Cinder block storage pool virtualization system.

Oct 26, 2015 1:26:49 PM

Topics: FalconStor

Arun Murthy discusses the future of Hadoop

Founder of Hortonworks (and Big Data TechCon keynote speaker) talks about the future of Hadoop

Arun Murthy is a busy fellow. When he’s not acting as architect at Hortonworks, the Hadoop company he founded, he’s flying around the world giving keynote addresses. This is quite a long ways from where he was 10 years ago, working on Hadoop inside Yahoo.

Oct 19, 2015 2:06:21 PM

Topics: Big Data TechCon, YARN, Big Data,, Container Tech, Docker

Seven best practices for Big Data management your business will benefit from


Oct 1, 2015 3:22:15 PM

Topics: Big Data,

Cask extends Big Data app development platform

Cask Software, creator of an application development platform for Big Data, has updated the platform and expanded beyond Hadoop through a new partnership with Cassandra company DataStax.

In a blog post, CEO Jonathan Gray discussed Cask Hydrator, a new capability built into version 3.2 of the CDAP app platform that enables data ingestion and ETL from a wide variety of sources. He also noted a new integration with Cassandra that takes the platform past its roots in Hadoop, as delivered via partnerships with Cloudera, Hortonworks and MapR.

(Related: A primer for working with Hadoop)

“As the first example, Cask Hydrator is implemented as an application template for batch and real-time ETL,” wrote Gray. “It defines plug-in APIs for source, transform and sink. You can create instances of an ETL pipeline through JSON configuration. New sources, transforms and sinks can be easily developed as plug-ins in Java.”

The application templates, Gray wrote, “extend the dataset concept of individual data patterns to complete application patterns. Application Templates are based on the concepts of Applications and Plugins. An application can contain any number of programs like Spark, MapReduce, etc., and those programs can define and reference the API of a plug-in.”

In a news release announcing the DataStax partnership, Cask wrote: “Moving forward, the CDAP road map will support rapid development of real-time data applications on DataStax Enterprise… The first phase includes CDAP’s direct support for Cassandra Datasets, providing the usability of CDAP Dataset libraries for Cassandra users and the flexibility for CDAP applications to run against both Apache HBase and Apache Cassandra. The second phase includes integration of Cassandra with CDAP’s open-source transaction engine, Tephra. This will provide scale-out, fault-tolerant, high-throughput transactions on Cassandra and will allow any application developed on CDAP for HBase to be run on Cassandra without changing any code.”

“By extending our platform to integrate with Cassandra, we will enable a broader set of use cases and allow our customers to have more choices,” Gray said in the statement. “Our solution will also bridge the critical gap of governing and operationalizing data between Cassandra and Hadoop.”

Sep 28, 2015 3:23:00 PM

Topics: Big Data,, hadoop, Cask, app development

MetaDapper: Data Mapping and Conversion Made Easy With the Right Tools

Data conversion, translation, and mapping is by no means rocket science, but it is by all means tedious. Even a simple data conversion task (e.g., reading a CSV file into a list of class instances) can require a non-trivial amount of code. While all of these tasks share much in common, they are all “just different enough” to require their own data conversion methods.

Sep 22, 2015 11:35:33 AM

Topics: databases

A Deep Learning Tutorial: From Perceptrons to Deep Networks

In recent years, there’s been a resurgence in the field of Artificial Intelligence. It’s spread beyond the academic world with major players like Google, Microsoft, and Facebook creating their own research teams and making some impressive acquisitions.

Sep 2, 2015 8:58:03 AM

Topics: machine learning

Hortonworks Dives into The World of IoT Big Data


Aug 25, 2015 3:36:29 PM

Topics: Big Data,, hadoop

Big Data TechCon Announces Keynote Speaker for Chicago Conference

Arun Murthy, Co-founder and Architect at Hortonworks to Keynote on the Future of Hadoop

Big Data TechCon today has announced its keynote speaker for its November 2-4 conference in Chicago. Hortonworks’ founder and architect, Arun Murthy, will give the keynote address to attendees at the show on November 3.

Aug 19, 2015 11:20:01 AM

Topics: Big Data TechCon

Datapipe and DataStax team up, bring data analytics to the enterprise


Aug 18, 2015 1:32:51 PM

Topics: Big Data,

Choosing a Database Security Solution – 5 Things You Must Have

How you choose to secure your database is one of the most important decisions for a business owner, perhaps only after the choice of database. There are five important elements which a robust database security system provides, and these are the subject of discussion in the following paragraphs.

Jul 28, 2015 9:05:26 AM

Topics: databases