Details for Government Employees
Big Data TechCon
is sponsored by:


OCTOBER 20, 2014

3 Questions: Gloria Lau Discusses Data Science

Gloria Lau recently moved to Timeful, where she leads a team to thread together data on time management – a project Lau calls “a hugely challenging and important data problem that I am passionate about.” She’s also a consulting faculty at Stanford – still trying to crack legal informatics – and is keynote speaker at Big Data TechCon on Tuesday, October 28. Between it all, she found time to answer three questions for Actuate.”...READ MORE

OCTOBER 16, 2014

Microsoft to add Docker containers to Windows Server

Yesterday, Microsoft announced it would be joining the ranks of Apache, Google, IBM, and Rackspace by building Docker container applications for Windows Server. Microsoft expects to deploy the container technology in a Windows Server update by mid 2015.”...READ MORE

OCTOBER 16, 2014

Public Data Sets Collection: Health

One of the promises of Big Data is that it can be used to improve people’s lives. Improving public health may be one of the most important pillars of this perspective. With these data sets, we can see how often people visit their doctor, how and why diseases spread, and the effects of treatment on a level once only imaginable.”...READ MORE

OCTOBER 8, 2014

Public Data Sets Collection: Urban Life

Life in the “Big City” means Big Data. Cities offer a plethora of insights into human behavior and developing socioeconomic patterns. Here’s a collection of public data sets that explore the different facets of urban life. ”...READ MORE

OCTOBER 7, 2014

Concurrent, Inc. to Present at Upcoming Big Data and Developer Industry Events

Supreet Oberoi, vice president of field engineering, will deliver two sessions at Big Data TechCon, taking place Oct. 27-29 in Burlingame, Calif. At Big Data TechCon, Supreet will deliver a talk on how organizations can increase Hadoop utilization with their data warehouses using Cascading Lingual, an open source project that allows users to utilize existing SQL skills to instantly create and run applications on Hadoop. ”...READ MORE

OCTOBER 2, 2014

Big Data can Create Big Headaches

Data is the big thing in business these days. Big Data. Information that is gathered by the organization, retrieved from a Web service, or collected from social media can create big problems regarding the way your applications perform. More and more enterprises today are looking to leverage the cloud for all the benefits it brings. But these enterprises fall into three buckets, according to Don Tirsell, vice president of worldwide technical alliances at Informatica. ”...READ MORE

SEPTEMBER 29, 2014

Apache Storm has been upgraded to a Top-Level Project -

The Apache Storm project has graduated to a Top-Level Project (TLP), according to the Apache Software Foundation. Storm is an open-source high-performance distributed real-time computation framework for Big Data stream processing. Hadoop clusters and other queuing and database technologies can leverage Storm to process a full range of workloads, from real-time to interactive to batch. The system can be used with any programming language. ”...READ MORE

September 9, 2014

Digital Reasoning announces latest version of Big Data machine-learning platform for enterprise clouds

Synthesys uses natural-language processing along with entity and fact extraction to process structured and unstructured data from enterprise cloud platforms. The Big Data analytics platform then employs its machine-learning algorithms to organize and categorize the enterprise data about customers, employees, and business assets into a knowledge graph and API for analysis and delivery. ”...READ MORE

August 25, 2014

Guest View | Containers: The Enabler of YARN

In the emerging model of the datacenter as a computer, several projects, including YARN, Mesos and more recently Kubernetes, are undertaking the effort of building an operating system for this “new computer.” These new operating systems in turn need the equivalent of the early multiuser time sharing systems to support a multitenant environment of diverse application and user ecosystems over the distributed resources of a datacenter ”...READ MORE

August 25, 2014

Mesa Brings Scalable Data Warehousing to the Next Plateau

In a recently released white paper, Google revealed their new data warehouse system, Mesa. With the ability to hold petabytes of data, field trillions of queries, and update millions of rows of data in a second, Mesa, Google’s near real-time scalable data warehousing solution, promises high availability, reliability, fault tolerance, and scalability for their Internet advertising business. ”...READ MORE

August 25, 2014

Big Data TechCon Welcomes LinkedIn to Technical Program

BZ Media LLC today announced its opening keynote at Big Data TechCon, the how-to technical conference for IT professionals implementing Big Data. Gloria Lau, manager of data science at LinkedIn, will keynote on “Building Data Products: The Right Order of Things.”...READ MORE

August 20, 2014

Espresso Logic brings NoSQL and SQL data together into a single API

One of the biggest problems developers face when building data-driven apps is having to access data from multiple data sources, according to R. Paul Singh, CEO of Espresso Logic. “What we are seeing and hearing is a lot of customers’ data isn’t in SQL only; they also have it in NoSQL databases,” he said. “Having to access and combine data from multiple sources isn’t easy and takes a significant amount of time.””...READ MORE

August 15, 2014

Data Visualization: A Quest for Accuracy

A picture is worth a thousand words, and an accurate bar graph can paint a very clear picture of a data set. Naomi Robbins adheres to that philosophy with a particular emphasis on the word “accurate.” She trains employees of corporations and organizations on the effective presentation of data, and is the author of "Creating More Effective Graphs." We conferenced with Naomi to get her perspective on the growing field of data visualization....READ MORE

August 11, 2014

Adatao plans for Big Data 2.0 to focus on the user

Big Data 1.0 is not easy, it’s not fast and it’s not for everyone, but Big Data 2.0 looks to change that, according to Christopher Nguyen, CEO and cofounder of Adatao. “We see Adatao as the bridge that builds the current Hadoop of the Big Data world, which is Big Data 1.0, to 2.0,” he said. “Just as Web 1.0 focused on technologies and capabilities of the Web, and Web 2.0 switched the focus to the user, we are experiencing the same fundamental shift with Big Data.”. ...READ MORE

August 4, 2014

YARN and the Data Center as an Operating System

A discussion of the next generation evolution of Hadoop

Like many great concepts, it takes time for industries as a whole to eventually understand them. When Dinesh Subhraveti developed the notion of Operating System level virtualization as a part of his Ph.D. from Columbia University, he was almost a decade ahead of the curve. His concept, now known as Docker containers, is a widely used component of modular data centers, or what he refers to as “the data center as an OS” model. Docker containers can also be used within a Hadoop environment, so Dinesh knows the changes to the Hadoop ecosystem since the release of version 2.2 and YARN and shared them with us. ...READ MORE

July 29, 2014

Survey: Hadoop isn’t enough for Big Data developers

The bigger Big Data gets, the more need, it seems, developers have for complex event processing outside of Hadoop. In its 2014 Big Data & Advanced Analytics Survey, market research firm Evans Data found that only 16% of developers said Hadoop batch processing was satisfactory in all use cases. Seventy-one percent of the more than 400 developers surveyed worldwide also expressed a need for real-time complex event processing more than half the time in their applications, and 27% said they use it all the time ...READ MORE

JuLY 27, 2014

Evolutionary Spark

Ben Coverston explains how Apache Spark moves beyond MapReduce

With the announcement that Cloudera, Databricks, IBM, Intel, and MapR Technologies will be backing its standardization, Apache Spark continues to gain a reputation as a reliable analytics tool for Big Data professionals. Spark was originally designed at UC Berkley as an open source data analytics framework with the goal of improving the speed of Hadoop workloads in mind. Ben Coverston validated for us that Spark fulfills its promise of speed and more....READ MORE 

July 1, 2014

MongoDB 2.8 enhancements revealed

For Eliot Horowitz, CTO and cofounder of MongoDB, it’s been all about solving database problems. From his time as cofounder of a website called ShopWiki, where he developed crawling and data extraction algorithms, he found himself spending more time customizing databases to do what he needed than working on his original application...READ MORE


June 27, 2014

Google Moves on from MapReduce with Cloud Dataflow at Google I/O

Over 10 years have passed since Google invented MapReduce, and they admitted at this year’s Google I/O Keynote that they rarely use it anymore. The replacement technology, Cloud Dataflow is a managed service for creating data pipelines that ingest, transform, and analyze massive amounts of data...READ MORE


June 24, 2014

Parse announces open-source Flashback tool at MongoDB World

MongoDB users can now record and replay traffic and requests using Flashback, an open-source tool announced by Parse at MongoDB World in New York City

Flashback is a MongoDB benchmark framework that allows developers to gauge database performance by benchmarking queries. Flashback records the real traffic to the database and replays operations with different strategies.... READ MORE


June 4, 2014

Stream-processing technology is coming to open source

A new technology is coming to open source that promises an integrated, high-quality and consistent streaming analytics application framework. The technology, codenamed “jetStream,” will allow original equipment manufacturers, independent software vendors, system integrators, service providers and developers to create Big Data analytics and streaming apps that address business use cases such as real-time analysis for spam filtering, network intrusion detection and analytics, location analytics, and social media market analysis...READ MORE


June 3, 2014

Hadoop Summit predicts big growth in future

SAN FRANCISCO — For a project that's grown like Apache Hadoop, it's not unusual to find many products and services cropping up around it. But Merv Adrian, research vice president for information management at Gartner, said in the opening keynote to this year’s Hadoop Summit that all of this growth has resulted in a confusing and nebulous space in which the term “Hadoop” increasingly means different things to different people... READ MORE

May 30, 2014

Getting a handle on Hadoop

You can think of it as an ever-inflating pink elephant. It's either got its own space in which to grow, or it'll just end up sucking all the air out of the room. It's always easier to talk about the elephant in the zoo than the elephant in the room, and Hadoop is definitely a zoo-full of complex moving parts that can cause just as much damage as an enraged bull elephant, provided we drag this metaphor into the realm of data...READ MORE


May 30, 2014

Apache Spark turns 1

The Apache Software Foundation (ASF) has announced version 1 of its open-source cluster-computing framework for Big Data analysis, Apache Spark.

"1.0 is a huge milestone for the fast-growing Spark community," said Matei Zaharia, vice president of Apache Spark. “Every contributor and user who's helped bring Spark to this point should feel proud of this release.” READ MORE

May 14, 2014

The future of databases: A chat about managing and scaling ‘agile Big Data’ in the cloud

As Big Data has gotten bigger and bigger, and businesses demand more and more out of their data, traditional database structures just don’t cut it anymore. The traditional single static repository simply isn’t equipped to handle the industry’s rapidly evolving needs...READ MORE


May 1, 2014

What the White House has to say about Big Data

Big Data has taken the technology world by storm, but what is the potential value and implications that could come from Big Data...READ MORE

April 29, 2014

MIT Bitcoin Project aims to create a cryptocurrency ecosystem

Two MIT students, first-year MBA student and Bitcoin Club founder Dan Elitzer, and sophomore Jeremy Rubin, have raised US$500,000 from MIT alumni and the Bitcoin community to launch the MIT Bitcoin Project...READ MORE

APRIL 18, 2014

Big Data Analytics: A Wide Range of Tools, Limited Resources and Market Opportunity for Cloud Solutions

The promise of analytics for telecommunications and media service providers is to enable them with the intelligence necessary to optimize customer relationships, bring costs down, improve customer experience and develop revenue streams (i.e. advertising, OTT players) using information from routers and other network elements, OSS/BSS systems and application services. On this front, service providers have much to learn from digital enterprises, such as Google, Amazon, Netflix and Facebook, which have disrupted traditional business models thanks to their understanding of customer preferences and ability to anticipate customer behavior and personalize customer experience. READ MORE


APRIL 16, 2014

Get out and Learn Something: The Value of Technical Conferences

It’s good to get out of the office and clear your head once in a while. That may mean a walk in the park at lunch, or it may mean going to a seminar or technical conference. With the latter, the benefits can far outweigh the immediate gratification you feel upon leaving the cubicle or home office behind. READ MORE


APRIL 16, 2014

Quick Wrap-up of Big Data TechCon

I’ve been fortunate in my role as an ESG Lab Analyst in that I’ve been able to be hands-on with a mix of companies that range from new, emerging startups to mature, proven companies. Specifically in the “Big Data” space, the exposure to the new products and technology has been just enough that I really want to wrap my head around everything there is to know. My ultimate goal at a high-level is to provide clarity where it’s needed - clarity in a cloudy (pun intended), confusing big data ecosystem. What do I need to fix my problem? Who does what? How do they do it? And the big question for me right now: Who really does what they say they do? READ MORE


APRIL 14, 2014

The True Chasm Yet to be Crossed for Big Data

Every vendor, customer, and channel player has a common problem with the big data market today. They don’t know how to build a complete solution. There are literally hundreds of companies with products and services being positioned as “big data platforms.” I know this is true, I have a list of each. READ MORE


APRIL 11, 2014

Step aside Facebook, graph database use cases go beyond social media

While Facebook may be the poster child for successful use of graph databases right now, social media companies aren't the only ones harnessing big data with the technology. At Big Data TechCon 2014 in Cambridge, Mass., Neo Technology Software Field Engineer Max De Marzi discussed a plethora of organizations benefiting from graph databases. READ MORE


APRIL 9, 2014

Hadoop expands data infrastructure, boosts business intelligence

The big data that companies successfully transform into usable business intelligence (BI) is just the tip of a massive data-iceberg, according to Jonathan Seidman, solutions architect at Cloudera. At Big Data Techcon 2014, Seidman hosted a session called “Extending your data infrastructure with Hadoop,” in which he explained how Hadoop could help the enterprise tap into that potential business intelligence below the water. “That data that’s getting thrown away can have a lot of value but it can be very difficult to fit that data into your data warehouse,” Seidman explained. READ MORE


APRIL 4, 2014

Software engineer builds Facebook graph search with Neo4j database

It took Max De Marzi, Neo Technology software field engineer, one weekend to build his own Facebook graph search with Neo4j. At Big Data Techcon 2014, he explained to developers how they could do the same and more importantly, why they should. With De Marzi’s tools and strategies — namely, Neo4j and Cypher — developers can build a graph search without, as De Marzi put it, querying hundreds of servers and putting in six months of work. READ MORE


March 26, 2014

Big Data TechCon Q&A on NoSQL and the Future of Big Data

Hadoop, and its open-source architecture, continues to grow as a force in the world of Big Data. Big Data TechCon speakers Jonathan Seidman and Mark Grover of Cloudera have years of practical knowledge and real-world experience on open-source projects using Hadoop and they are currently co-authoring a book on Big Data solutions utilizing the popular Apache framework. That’s why Big Data TechCon thought it was a perfect time to interview Jonathan and Mark on the NoSQL landscape and the future of Big Data. READ MORE


March 25, 2014

NoSQL Database Solutions: Which Elixir is Right for You?

The number of NoSQL database options grows daily. In a world with so much diversity, how does one choose the right database for their business? We at Big Data TechCon wanted to make the choice a little easier, so we made an infographic for a side-by-side comparison of some of the most popular solutions out there. For additional background and choices, be sure to read SD Times article "A Guide to NoSQL Offerings."


March 25, 2014

Number of NoSQL options grow

NoSQL databases continue to proliferate as the demand for Big Data solutions grows. While relational databases aren’t going away anytime soon, different data models require different types of solutions. As a result, several types of NoSQL databases have emerged, each with its own pros and cons. READ MORE


March 24, 2014

Top 5 Reasons Your Big Data Should Play Well with Others

Ariel Weisberg, lead architect at VoltDB and Big Data TechCon speaker, knows the importance of managing your data properly. He lent us five tips to take hold of your Big Data and make it play nice. READ MORE


March 24, 2014

The Dos and Don’ts of Getting Started with Big Data

Getting started in the world of Big Data can be intimidating. To help you avoid some of the pitfalls, we had Big Data TechCon speakers Mark Grover and Jonathan Seidman of Cloudera offer their expertise. READ MORE


March 21, 2014

Big Data TechCon Q&A with Alexis Roos

Alexis Roos is a senior solutions architect, focusing on Big Data solutions at Concurrent. He has over 18 years of experience in software and sales engineering, and will be teaching two courses on Cascading at Big Data TechCon. We had a chance to chat with him before the conference for some insight into the power and scalability of Cascading.  READ MORE


March 17, 2014

Big Data: Taking the guesswork out of March Madness

It’s that time of year again. This year’s the year, you tell yourself. You feel good about your bracket. You’ve done your research, picked the right upsets, and you’re confident in your Final Four picks. READ MORE


March 14, 2014

SD Times Blog: EclipseCon kicks off Monday

On Monday, EclipseCon returns to the San Francisco Bay Area after a long hiatus in Virginia. The conference tutorials begin Monday, but we caught up with Eclipse Foundation executive director Mike Milinkovich to discuss his plans for the show, and to get a preview of his keynote address.  READ MORE


March 14, 2014

Spark promotion highlights in-memory excitement

The Apache Foundation has promoted the Spark project from the Apache incubator to the status of top-level project, less than a month after the release of Spark 0.9.0, which the developers said was the largest release of the project yet..  READ MORE


March 10, 2014

Graduate-level course offered for data center system engineering

Southern Methodist University (SMU) will begin offering a master’s degree in data center systems engineering this fall. According to the university, it is the first in the United States to offer the technical depth of a graduate degree in this field.  READ MORE


February 28, 2014 

Spark advances from Apache Incubator to top-level project

Apache Spark is gaining prominence. The Apache Software Foundation (ASF) announced the open-source cluster-computing framework for Big Data analysis has graduated from the Apache Incubator to a top-level project.  READ MORE


February 25, 2014 

IBM to acquire Cloudant

IBM has reached an agreement to acquire Cloudant, a Database-as-a-Service company that stores, analyzes and distributes Web and mobile app data.  READ MORE


February 10, 2014 

Java and its role in modernizing data trapped within legacy applications

While Big Data, the cloud and mobile access grab the lion’s share of IT market attention from pundits and trade reporters, many Fortune 500 corporations and federal and state agencies still depend on legacy business applications written in COBOL, Fortran or other languages that date back more than 50 years.  READ MORE


February 20, 2014 

Big Data TechCon Update

If you are professional journalist, blogger or analyst, we cordially invite you to cover Big Data TechCon Boston, March 31 – April 2. You will be given a free, full-evet passport for access to all sessions, keynotes, special events and lunches in exchange for sharing your coverage of the event. In turn, we will cross-promote your piece across all of our social media platforms, the news section on our sites and emails when applicable. Please see our full press pass policy. We look forward to hearing from you!


February 14, 2014 

Zeichick’s Take: Five phrases you need to know about Big Data

Is Big Data good? Or is it evil?

On one hand, by providing new insights into their customers, and by allowing amalgamation of information from disparate sources, Big Data is enabling tremendous efficiencies at many companies. That’s great! Let’s keep investing. READ MORE


February 12, 2014 

SD Times Blog: Tresata shows evolution of Hadoop at Strata

The big news out of the Strata conference on Big Data is Tresata, a new company that as far as I can tell is the only one building commercial products on top of the new Apache Spark project. And that is precisely why they're the belle of this year's Big Data ball.  READ MORE


February 5, 2014 

DARPA launches public website for its open-source software projects

DARPA, the Defense Advanced Research Projects Agency, has made all its open-source software projects available in one location with the DARPA Open Catalog. READ MORE


February 5, 2014

Code Intelligence: The Big Data we should be paying attention to

Today’s networks, websites, communications, apps and complex IT infrastructures are churning out massive amounts of data, so much so that there were 2.7 billion terabytes of digital content in 2012, according to IDC. That number is predicted to skyrocket to more than 8 billion terabytes by 2015. READ MORE


January 31, 2014

Event Preview – Big Data TechCon 2014

Read why TechnologyAdvice is excited about Big Data TechCon Boston! READ MORE


January 30, 2014

Hadoop is now a general-purpose platform

Apache Hadoop adoption is accelerating among enterprises and advanced computing environments as the project, related projects, and ecosystem continue to expand. While there were valid reasons to avoid the 1.x versions, skeptics are reconsidering since Hadoop 2 (particularly the latest 2.2.0 version) provides a viable choice for a wider range of users and uses. READ MORE


January 28, 2014

Big Data TechCon Update

Do you need help getting approval for Big Data TechCon? Then check out our nine time-tested tactics that will help get the ball rolling. We also made it super convenient for you with a downloadable customized letter template and additional snippets and ideas for getting approval. You can even sweeten the deal more with supporting testimonials, photos, and videos, including a keynote by Hadoop Founder Doug Cutting. We also have plenty of opportunities for you to save more with codes and group discounts!


January 27, 2014

SD Times Blog: Five ways to advance your development career

A shell and a book: the two tools any developer needs to enhance his or her self-worth. Open a fresh shell, crack the spine on that brand new O'Reilly or No Starch book, pop up vi or Emacs, and go to town. Obviously, the equation has also changed to include the millions of websites that can teach you how to re-develop your development skills:,, or even Zed Shaw's “Learn Code the Hard Way. READ MORE


January 24, 2014

White House to tackle Big Data

Since former U.S. National Security Agency (NSA) contractor Edward Snowden brought the agency's secret spying programs to light, there has been much controversy and concern surrounding privacy.

"We have to make some important decisions about how to protect ourselves and sustain our leadership in the world while upholding the civil liberties and privacy protections our ideals and our Constitution require," said President Obama. READ MORE


January 10, 2014

SD Times Blog: MIT launches Big Data online training

The Massachusetts Institute of Technology (MIT) is bringing Big Data training around the world. Beginning in March, MIT will offer a four-week online course, “Tackling the Challenges of Big Data,” aimed at technical professionals and executives worldwide.

Big Data topics will include data collection, storage and processing, analytics, visualization, and a range of real-world applications. Experts from the MIT Computer Science and Artificial Intelligence Laboratory (CSAIL) will be teaching the course. READ MORE


January 7, 2014

Hadoop and NoSQL: Friends, not frenemies

The term Big Data is an all-encompassing phrase that has various subdivisions addressing different needs of the customers. The most common description of Big Data talks about the four V’s: Volume, Velocity, Variety and Veracity. READ MORE