TY - BOOK AU - Lima P P (93613017) TI - Bigdata security using apache knox KW - INFORMATION SYSTEMS KW - SECURITY AND PRIVACY KW - HADOOP N2 - Today, all the organisations/business enterprise had got a lot of data which was flooded over the years. Exponential growth of this structured/semi-structured/unstructured data is termed as Bigdata. Studies show that a bank generates 65 billion data/year; Facebook generates 500terabytes /day, etc. So, storing and processing this huge amount of data is a hard task. Apache Hadoop is a technology used to handle bigdata. A growing number of companies are using Apache Hadoop technology to store and analyse petabytes of data including web logs, click stream data and social media content to gain better insights about their customers and their business. Adding security to Hadoop is challenging because all the interactions do not follow the classic client-server pattern: the file system is partitioned and distributed requiring authorization checks at multiple points; a submitted batch job is executed at a later time on nodes different from the node on which the client authenticated and submitted the job; job tasks from different users are executed on the same compute node; secondary services such as a workflow system access Hadoop on behalf of users; and the system scales to thousands of servers and tens of thousands of concurrent tasks. In this project we propose ApacheKnox as the efficient way for providing security to Hadoop clusters. Security and Privacy issues of Hadoop(bigdata) are magnified by velocity, volume, and variety of bigdata, such as large-scale cloud infrastructures, diversity of data sources and formats, streaming nature of data acquisition, and high volume inter-cloud migration. Therefore, traditional security mechanisms, which are tailored to securing small-scale static (as opposed to streaming) data, are inadequate. In this project, we highlight bigdata (Hadoop)-specific security – APACHE KNOX. The Apache Knox Gateway provides perimeter security so that the enterprise can confidently extend Hadoop access to more of those new users while also maintaining compliance with enterprise security policies. Knox also simplifies Hadoop security for users who access the cluster data and execute jobs. The Knox community is working on development efforts to focus on extending the reach of Hadoop services to users outside of the cluster, while further enhancing security ER -