Search QBoard Topic - Cluzters.ai

Quick Navigation

Most Rated Topics

When to use Hadoop, HB...
880 views 0 likes

3
Recommended package fo...
1,343 views 0 likes

5
Can one get hierarchic...
4,253 views 1 like

4
Best way to delete mil...
756 views 3 likes

4
How to convert text da...
498 views 1 like

3

Popular Tags

.NET² #AI¹ #apache-kafka¹ #apache-zookeeper¹ #Azure¹ #computervision¹ #ggplot2 ¹ #ggpmisc¹ #hadoop³ #hbase¹ #hdfs¹ #hive² #hiveql¹ #impala¹ #IOT¹ #LogisticRegression¹ #Model_Metrics¹ #R² #sql¹ #text_analytics¹ #visualization¹ 3d¹ abstraction¹ accessor¹ acf¹
Explore More »

How to Delete a directory from Hadoop cluster which is having comma(,) in its name?

I have uploaded a Directory to hadoop cluster that is having "," in its name like "MyDir, Name" when I am trying to delete this Directory by using rmr hadoop shell command as... more

Last post by Advika Banerjee - January 17, 2022
123 views 0 likes

1
maven artifactId hadoop 2.2.0 for hadoop-core

I am migrating my application from hadoop 1.0.3 to hadoop 2.2.0 and maven build had hadoop-core marked as dependency. Since hadoop-core is not present for hadoop 2.2.0. I tried... more

Last post by Advika Banerjee - February 11, 2022
183 views 0 likes

3
Hadoop: start-dfs.sh permission denied

I am installing Hadoop on my laptop. SSH works fine, but I cannot start hadoop.
munichong@GrindPad:~$ ssh localhost
Welcome to Ubuntu 12.10 (GNU/Linux 3.5.0-25-generic... more

Last post by Advika Banerjee - January 17, 2022
1,816 views 0 likes

2
When do reduce tasks start in Hadoop?

In Hadoop when do reduce tasks start? Do they start after a certain percentage (threshold) of mappers complete? If so, is this threshold fixed? What kind of threshold is typically used?

Last post by Vaibhav Mali - January 1, 2022
125 views 0 likes

2
Error ocurring in apache flume

Help required with this error in my apache flume Stackoverflow Question

Last post by Vaibhav Mali - January 6, 2022
151 views 0 likes

1
hadoop error coming when starting hadoop

Hi i can't resolve my problem when running hadoop with start-all.sh

rochdi@127:~$ start-all.sh
/usr/local/hadoop/bin/hadoop-daemon.sh: line 62: [: localhost: integer expression... more

Last post by Advika Banerjee - January 17, 2022
205 views 0 likes

4
A starting point for learning how to implement MapReduce/Hadoop in Python?

I've recently started getting into data analysis and I've learned quite a bit over the last year (at the moment, pretty much exclusively using Python). I feel the next step is to... more

Last post by Vaibhav Mali - January 1, 2022
138 views 0 likes

3
Sqoop vs Informatica Big Data edition for Data sourcing

I have a option of using Sqoop or Informatica Big Data edition to source data into HDFS. The source systems are Tearadata, Oracle.
I would like to know which one is better and any... more

Last post by Vaibhav Mali - January 6, 2022
165 views 0 likes

1
Hadoop cluster setup - java.net.ConnectException: Connection refused

I want to setup a hadoop-cluster in pseudo-distributed mode. I managed to perform all the setup-steps, including startuping a Namenode, Datanode, Jobtracker and a Tasktracker on... more

Last post by Maryam Bains - December 23, 2021
1,923 views 0 likes

3
Differences between Hadoop-common, Hadoop-core and Hadoop-client?

I am newer to Hadoop, and want to know what is the differences between Hadoop-common, Hadoop-core and Hadoop-client?
By the way,for a given class, how do I know which... more

Last post by Vaibhav Mali - December 16, 2021
193 views 0 likes

3
Apache Spark: The number of cores vs. the number of executors

I'm trying to understand the relationship of the number of cores and the number of executors when running a Spark job on YARN.
The test environment is as follows:

Number of data... more

Last post by Advika Banerjee - December 6, 2021
195 views 0 likes

3
Where are hadoop jar files in hadoop 2?

I am trying to implement one sample word count program using Hadoop. I have downloaded and installed Hadoop 2.0.0. I want to do this sample program using Eclipse because i think... more

Last post by Viaan Prakash - November 29, 2021
175 views 0 likes

3
Failed to locate the winutils binary in the hadoop binary path

I am getting the following error while starting namenode for latest hadoop-2.2 release. I didn't find winutils exe file in hadoop bin folder. I tried below commands
$ bin/hdfs... more

Last post by Viaan Prakash - November 29, 2021
398 views 0 likes

3
Chaining multiple MapReduce jobs in Hadoop

In many real-life situations where you apply MapReduce, the final algorithms end up being several MapReduce steps.
i.e. Map1 , Reduce1 , Map2 , Reduce2 , and so on.
So you have... more

Last post by Advika Banerjee - December 6, 2021
182 views 0 likes

3
Spark - load CSV file as DataFrame?

I would like to read a CSV in spark and convert it as DataFrame and store it in HDFS with df.registerTempTable("table_name")I have tried:
scala> val df =... more

Last post by Vaibhav Mali - December 8, 2021
232 views 0 likes

3
Hadoop on Kubernetes vs Standard Hadoop

What is the difference between Hadoop on Kubernetes and the standard Hadoop ? and what is the benefit from deploying Hadoop on Kubernetes ?

Last post by Vaibhav Mali - December 16, 2021
194 views 0 likes

4
out of Memory Error in Hadoop

I tried installing Hadoop following this http://hadoop.apache.org/common/docs/stable/single_node_setup.html document. When I tried executing this
bin/hadoop jar... more

Last post by Viaan Prakash - November 12, 2021
178 views 0 likes

2
hadoop No FileSystem for scheme: file

I am trying to run a simple NaiveBayesClassifer using hadoop, getting this error
Exception in thread "main" java.io.IOException: No FileSystem for scheme: file
at... more

Last post by Vaibhav Mali - November 27, 2021
438 views 0 likes

4
what is the minimum configuration required to do a Hadoop Proof of Concept in Cloud?

I am looking for so guidance and tips in understanding what would it take to do a reasonable Hadoop Proof of Concept in the Cloud? I am a complete noob to the Big Data Analytics... more

Last post by Vaibhav Mali - December 16, 2021
264 views 0 likes

4
what's the difference between "hadoop fs" shell commands and "hdfs dfs" shell commands?

Are they supposed to be equal?
but, why the "hadoop fs" commands show the hdfs files while the "hdfs dfs" commands show the local files?
here is the hadoop version... more

Last post by Advika Banerjee - November 15, 2021
304 views 0 likes

3
Merging files in Google Cloud Storage using Google Cloud Dataflow

Nathan Marz in his book "Big Data" describes how to maintain files of data in HDFS and how to optimize files' sizes to be as near native HDFS block size as possible using... more

Last post by Maryam Bains - October 8, 2021
1,200 views 0 likes

3
How to turn off INFO logging in Spark?

I installed Spark using the AWS EC2 guide and I can launch the program fine using the bin/pyspark script to get to the spark prompt and can also do the Quick Start quide... more

Last post by Advika Banerjee - October 9, 2021
899 views 0 likes

3
Hadoop "Unable to load native-hadoop library for your platform" warning

I'm currently configuring hadoop on a server running CentOs. When I run start-dfs.sh or stop-dfs.sh, I get the following error:WARN util.NativeCodeLoader: Unable to load... more

Last post by Viaan Prakash - December 11, 2021
314 views 0 likes

4
working with big scientific data on Hadoop

I am currently starting a project titled "Cloud computing for time series mining algorithms using Hadoop". The data which I have is hdf files of size over a terabyte.In hadoop as... more

Last post by Advika Banerjee - October 9, 2021
185 views 0 likes

4
Talend and Apache Spark?

I am confused as to where Talend and Apache spark fit in the big data ecosystem as both Apache Spark and Talend can be used for ETL.
Could someone please explain this with an example?

Last post by Sindhuja Martha - October 23, 2021
190 views 0 likes

4

Member Sign In

Member Sign In

Create Account

Quick Navigation

Most Rated Topics

Popular Tags

Connect With Us