Big Data - Data Ingestion Tools : Sqoop, Flume, Kafka, Nifi..

QBoard » Big Data » Big Data - Data Ingestion Tools : Sqoop, Flume, Kafka, Nifi..

Most Liked Topics

Best way to delete mil...
752 views 3 likes

4
How to choose cross-en...
759 views 1 like

4
Does changing the orde...
1,499 views 1 like

4
Any 'pretty' data visu...
1,685 views 1 like

6
Can one get hierarchic...
4,246 views 1 like

4

Search QBoard

Sqoop export duplicates

Will sqoop export create duplicates when the number of mappers is higher than the number of blocks in the source hdfs location?
My source hdfs directory has 24 million records and... more

Last post by Advika Banerjee - February 11, 2022
334 views 0 likes

2
Rebalancing issue while reading messages in Kafka

I am trying to read messages on Kafka topic, but I am unable to read it. The process gets killed after sometime, without reading any messages.
Here is the rebalancing error which... more

Last post by Vaibhav Mali - January 6, 2022
494 views 0 likes

2
Sqoop vs Informatica Big Data edition for Data sourcing

I have a option of using Sqoop or Informatica Big Data edition to source data into HDFS. The source systems are Tearadata, Oracle.
I would like to know which one is better and any... more

Last post by Vaibhav Mali - January 6, 2022
164 views 0 likes

1
Error ocurring in apache flume

Help required with this error in my apache flume Stackoverflow Question

Last post by Vaibhav Mali - January 6, 2022
151 views 0 likes

1
What's the difference between Flume and Sqoop?

Both Flume and Sqoop are meant for data movement, then what is the difference between them? Under what condition should I use Flume or Sqoop?

Last post by Sindhuja Martha - September 15, 2021
515 views 0 likes

3
What is use of kafka in Big Data cluster?

I have recently deployed Big Data cluster. In that I've used Apache Kafka and zookeeper. But still I didn't understand about its usage in cluster. When both are required and for... more

Last post by Sindhuja Martha - September 4, 2021
220 views 0 likes

1
What is the most mature library for building a Data Analytics Pipeline in Java/Scala for Hadoop?

I found many options recently, and interesting in their comparisons primarely by maturity and stability.

Crunch - https://github.com/cloudera/crunch
Scrunch... more

Last post by Samar Patil - August 13, 2021
502 views 0 likes

2
How to setup a HTTP Source for testing Flume setup?

I am a newbie to Flume and Hadoop. We are developing a BI module where we can store all the logs from different servers in HDFS.
For this I am using Flume. I just started trying... more

Last post by Samar Patil - August 4, 2021
618 views 0 likes

2
Data Modeling with Kafka? Topics and Partitions

One of the first things I think about when using a new service (such as a non-RDBMS data store or a message queue) is: "How should I structure my data?".
I've read and watched... more

Last post by Nitara Bobal - December 25, 2020
443 views 0 likes

1
Purge Kafka Topic

Is there a way to purge the topic in kafka?I pushed a message that was too big into a kafka message topic on my local machine, now I'm getting an... more

Last post by Nitara Bobal - December 25, 2020
402 views 0 likes

1
Sqoop Import --password-file function not working properly in sqoop 1.4.4

I am using hadoop-1.2.1 and sqoop version is 1.4.4.
I am trying to run the following query.
sqoop import --connect jdbc:mysql://IP:3306/database_name --table clients --target-dir... more

Last post by Vinaya Chahal - October 14, 2020
690 views 0 likes

4
Sqoop import without primary key in RDBMS

Can I import RDBMS table data (table doesn't have a primary key) to hive using sqoop? If yes, then can you please give the sqoop import command.
I have tried with sqoop import... more

Last post by Rakesh Racharla - October 5, 2020
869 views 0 likes

4
Leader Not Available Kafka in Console Producer

I am trying to use Kafka.All configurations are done properly but when I try to produce message from console I keep getting the following error
WARN Error while fetching metadata... more

Last post by Vinaya Chahal - October 5, 2020
2,418 views 0 likes

5
How to check if ZooKeeper is running or up from command prompt?

I exploring a few options to setup kafka and I knew that the Zookeeper has to be up and running to initiate a kafka.
I would like to know how can I find the below.
1) hostname and... more

Last post by Rishi Pandya - October 3, 2020
3,938 views 0 likes

5
Simple Explanation of Apache Flume

Can anybody explain Apache Flume for me in a plain language? I'd appreciate an explanation with a practical example instead of abstract theoretical definitions, then I can... more

Last post by Raji Reddy A - June 14, 2019
1,486 views 0 likes

2