Real-Time Capability of SAP HANA Meets the Scalable Performance of Apache Hadoop

Girish Juneja is CTO of Datacenter Software Division and General Manager of the Big data and Expressway software businesses. Girish has over 21 year experience in building software businesses, at Intel, as an entrepreneur and earlier at Verizon Telecommunications.  Prior to his current position, he was the Director of Application Security & Identity Products in the Software Services Division in SSG. In this role he was responsible for conceptualization, product development, sales & marketing of Intel Expressway software product line for Service Providers, ISVs, OEMs and select end-users. Girish also led the development of Identity software strategy for SSG that led to acquisition of Nordic Edge, development & launch of McAfee Identity Manager & Intel CloudSSO Identity-as-a-Service offering in collaboration with  Girish joined Intel in 2005 with the acquisition of Sarvega, a company he founded. Girish received his MBA from University of Chicago, MS Computer Science from University of Maryland, and a Bachelor’s degree in Electrical & Electronics Engineering from BITS, Pilani, India.

I am very excited to announce the newest agreement between Intel and SAP to bring the value of big data to a growing number of enterprise business analysts. By combining the security, scalability, and manageability of the Intel Distribution for Apache Hadoop with the real-time analytics capabilities of SAP HANA, Intel and SAP are delivering a tightly integrated, highly optimized solution with some unique characteristics.

Intel and SAP who have long collaborated to make SAP HANA run well on Intel Xeon processors are now collaborating on an enterprise solution that combines the big data management framework of Apache Hadoop with the real-time analytic capabilities of SAP HANA.

SAP HANA is capable of processing both transactional and analytical data in real time, especially with optimizations delivered by Intel Xeon E7 servers, such as memory addressability, more cores, larger cache, and enhanced error correction. So what value does Hadoop add to the platform?

After the Intel(r) Distribution for Apache Hadoop* was released in February 2013, the collaboration between Intel and SAP focused on optimizing the entire solution stack to run smarter, faster and more simply.

Hadoop can store and process massive amounts of unstructured data. It provides scalability , which is essential for organizations looking to gain insight from large volumes of data. This cost-effective scalability eliminates limits on data storage, which means enterprises that archive data on Hadoop continue to keep it in their analytic purview. Because its cluster-based architecture allows Hadoop to store even petabytes of data, enterprises can add nodes to the cluster rather than expensive database servers. SAP HANA integrated with the Intel Distribution for Apache Hadoop provides a real-time big data analytics solution. Unlike other offerings, the SAP HANA solution with Intel is optimized top-to-bottom through the stack with Intel technologies.

Just two specific examples can serve to illustrate this. With the Intel Distribution, SAP HANA can not only execute federated queries on diverse data stores, whether transactional or analytical, structured or unstructured, but also do so with data protection mechanisms that protect the confidentiality and integrity of the data through hardware-enhanced encryption. Another example is the roadmap of advanced manageability features that are available in the Intel Manager included in the Intel Distribution. Using the Intel Active Tuner, the Intel Manager automatically configures the Hadoop cluster for the MapReduce application that runs on it, reducing the time to deploy the cluster in production and extract value from the data. No other distribution provides these capabilities. We already have several customers actively testing these unique capabilities of the SAP HANA solution with the Intel Distribution.