Simplifying user-logs management and access in YARN
User logs of Hadoop jobs serve multiple purposes. First and foremost, they can be used to debug issues while running a MapReduce application – correctness problems with the application itself, race...
View ArticleModern Healthcare Architectures Built with Hadoop
We have heard plenty in the news lately about healthcare challenges and the difficult choices faced by hospital administrators, technology and pharmaceutical providers, researchers, and clinicians. At...
View ArticleFast Search and Analytics on Hadoop with Elasticsearch
Hortonworks customers can now enhance their Hadoop applications with Elasticsearch real-time data exploration, analytics, logging and search features, all designed to help businesses ask better...
View ArticleA Roadmap for Hadoop and OpenStack Integration
A recent survey conducted by the OpenStack foundation shows incredible adoption in the enterprise. Cost savings and operational efficiency stand out as the top business motivators that are driving...
View ArticleApache Falcon Technical Preview Available Now
We believe the fastest path to innovation is the open community and we work hard to help deliver this innovation from the community to the enterprise. However, this is a two way street. We are also...
View ArticleApache Ambari graduates to Apache Top Level Project!
We are very excited to announce that Apache Ambari has graduated out of Incubator and is now an Apache Top Level Project! Hortonworks introduced Ambari as an Apache Incubator project back in August...
View ArticleApache Tez 0.2.0 Released
The Apache Tez team is proud to announce the first release of Apache Tez – version 0.2.0-incubating. Apache Tez is an application framework which allows for a complex directed-acyclic-graph of tasks...
View ArticleHadoop Security : Today and Tomorrow
Security is a top agenda item and represents critical requirements for Hadoop projects. Over the years, Hadoop has evolved to address key concerns regarding authentication, authorization, accounting,...
View ArticleEnterprise Hadoop Market in 2013: Reflections and Directions
2013 was certainly a revealing year for the Enterprise Hadoop market. We witnessed the emergence of the YARN-based architecture of Hadoop 2 and a strong ecosystem embracement that will fuel its next...
View ArticleAnnouncing the Technical Preview of Apache Knox Gateway
Just yesterday, we talked about our roadmap for Security in Enterprise Hadoop. At our Security labs page you can see in one place the security roadmap and efforts underway across Hadoop and their...
View ArticleHow To Secure Apache Sqoop Jobs with Oracle Wallet
Apache Sqoop is a tool that transfers data between the Hadoop ecosystem and enterprise data stores. Sqoop does this by providing methods to transfer data to HDFS or Hive (using HCatalog). Oracle...
View ArticleStorm Technical Preview Available Now!
In October, we announced our intent to include and support Storm as part of Hortonworks Data Platform. With this commitment, we also outlined and proposed an open roadmap to improve the enterprise...
View ArticleGetting Started Writing YARN Applications
There is a lot of information available on the benefits of Apache YARN but how do you get started building applications? On December 18 at 9am Pacific Time, Hortonworks will host a webinar and go over...
View ArticleHortonworks Data Platform 2.0 on openjdk
Apache Hadoop has always been very fussy about Java versions. It’s a big application running across tens of thousands of processes across thousands of machines in a single datacenter. This makes it...
View ArticleDownloads for Storm, Falcon, Knox Gateway and Tez
Last week was a busy week for shipping code, so here’s a quick recap on the new stuff to keep you busy over the holiday season. Technical Preview of Storm. This preview includes the latest release of...
View ArticleWire Encryption in Hadoop
Encryption is applied to electronic information in order to ensure its privacy and confidentiality. Typically, we think of protecting data as it rests or in motion. Wire Encryption protects the...
View ArticleAnnouncing Stinger Phase 3 Technical Preview
As an early Christmas present, we’ve made a technical preview of Stinger Phase 3 available. While just a preview by moniker, the release marks a significant milestone in the transformation of Hadoop...
View Article7 Minutes on Hortonworks Data Platform v2.0
The year is coming to its end. Maybe you’re reading this as you race to check a few more 2013 items off of your to-do list (at work or at home). Or maybe you’ve already got a hot toddy in your hand and...
View ArticleHow To Use Local Repositories with Apache Ambari
The network and security teams at your company do not allow internet access from the machines where you plan to install Hadoop. What do you do? How do you install your Hadoop cluster without having...
View ArticleHeterogeneous Storages in HDFS
Hadoop has traditionally been used for batch processing data at large scale. Batch processing applications care more about raw sequential throughput than low-latency and hence the existing HDFS model...
View Article