 Expertize with the tools in Hadoop Ecosystem including Pig, Hive, HDFS, MapReduce, Sqoop, Spark, Kafka, Yarn, Oozie, and Zookeeper.  Experience in analyzing data using HiveQL, PIG Latin  Experience in designing and developing applications in Spark using Scala  Developed Spark code using Scala and Spark-SQL for faster processing of data.  Indepth understanding of Spark Architecture including Spark Core,Spark SQL,Data Frames and Spark Streaming  Worked on importing and exporting data from different databases like Oracle, Mysql into HDFS and Hive using Sqoop  Experience in collecting and storing stream data like log data in HDFS using Flume  Written Hive and PIG queries for data analysis to meet the business requirements.  Involved in creating tables, partitioning, bucketing of table and creating UDF’s in Hive.  Experience with Hive Queries Performance Tuning.  Well experienced with implementing Join operations using PIG Latin.  Experience with Oozie Workflow Engine to automate and parallelize Hadoop Map/Reduce, Hive and PIG jobs.  Knowledge in NoSQL databases like HBase, MongoDB,Cassandra.
Profile ©