reduceByKey Spark

Basically reduceByKey function works only for RDDs which contains key and value pairs kind of elements(i.e RDDs having tuple or Map as a data element). It is a transformation operation which means... Read more »

Sqoop command to import data from mysql database to HDFS

Step 1. Login to Mysql database. cloudera@cloudera-vm:~$ mysql -uroot -p Enter password: Welcome to the MySQL monitor. Commands end with ; or \g. Your MySQL connection id is 54 Server version: 5.1.61-0ubuntu0.10.10.1... Read more »

Import all tables from MySql database to HDFS

In this blog, I will show you the way to import all tables from MySql database. STEP 1. First connect to MySQL database and check the no of tables. cloudera@cloudera-vm:~$ mysql -uroot... Read more »

Exclude parameter in SQOOP to exclude import tables

STEP 1. Check the no of tables in mysql database. mysql> show tables; +—————-+ | Tables_in_test | +—————-+ | company | | student | | transactions | +—————-+ 3 rows in set... Read more »

Sqoop Command with Secure Password

There are different way to secure sqoop password. following are the different method to secure sqoop password while running script. 1. Use — password Option with Sqoop Command $sqoop import –connect jdbc:mysql://localhost/MYDB... Read more »

Exclude parameter in SQOOP to exclude import tables

Exclude parameter is useful to exclude the tables from sqoop export operation. exclude parameter work with import-all-tables parameter. here import-all-tables will import all tables but exclude the tables in exclude list. STEP... Read more »

Sqoop Command with Secure Password

There are different way to secure sqoop password. following are the different method to secure sqoop password while running script. 1. Use — password Option with Sqoop Command $sqoop import –connect jdbc:mysql://localhost/MYDB... Read more »

Difference between Hadoop and Bigdata

I personally found many students have confusion between hadoop and big data. Actually both are different entity. In one sentense I would say “big data is the problem” and “Hadoop is framework... Read more »

Switch your career from Oracle DBA to Hadoop Bigdata

I have seen a many people those who want to switch their career from the Oracle DBA to the Hadoop but they are not sure how to start and from where to... Read more »