Search QBoard Topic - Cluzters.ai

Quick Navigation

Most Rated Topics

When to use Hadoop, HB...
874 views 0 likes

3
Recommended package fo...
1,339 views 0 likes

5
Can one get hierarchic...
4,248 views 1 like

4
Best way to delete mil...
755 views 3 likes

4
How to convert text da...
498 views 1 like

3

Popular Tags

.NET² #AI¹ #apache-kafka¹ #apache-zookeeper¹ #Azure¹ #computervision¹ #ggplot2 ¹ #ggpmisc¹ #hadoop³ #hbase¹ #hdfs¹ #hive² #hiveql¹ #impala¹ #IOT¹ #LogisticRegression¹ #Model_Metrics¹ #R² #sql¹ #text_analytics¹ #visualization¹ 3d¹ abstraction¹ accessor¹ acf¹
Explore More »

Recommended package for very large dataset processing and machine learning in R [closed]

It seems like R is really designed to handle datasets that it can pull entirely into memory. What R packages are recommended for signal processing and machine learning on very... more

Last post by Samar Patil - November 1, 2021
183 views 0 likes

3
Impala has his own execution engine or it works on MapR in Hadoop eco system?

I never got a chance to work on Impala. I have just started reading about Impala. But i have one basic question which i am not clear about Impala. Impala has its own demons so it... more

Last post by Vaibhav Mali - February 2, 2022
184 views 0 likes

3
Hive alternative for big data query

From the official Hive documentation:Hive aims to provide acceptable (but not optimal) latency for interactive data browsing, queries over small data sets or test queries.
I'm... more

Last post by Advika Banerjee - November 26, 2021
301 views 0 likes

4
Install BigDL in Data Science Experience on Cloud

I would like to use Intel BigDL in notebooks on Data Science Experience on Cloud.
How can I install it?

Last post by Maryam Bains - December 29, 2021
272 views 0 likes

2
Hive or HBase for reporting?

I am trying to understand what would be the best big data solution for reporting purposes?
Currently I narrowed it down to HBase vs Hive.
The use case is that we have hundreds of... more

Last post by Advika Banerjee - December 24, 2021
242 views 0 likes

3
What is use of kafka in Big Data cluster?

I have recently deployed Big Data cluster. In that I've used Apache Kafka and zookeeper. But still I didn't understand about its usage in cluster. When both are required and for... more

Last post by Sindhuja Martha - September 4, 2021
221 views 0 likes

1
how to optimize google-bigquery for finding most frequent categories from big data table?

I'm using google-bigquery on Chicago crime dataset. However, I want to find out the most frequent crime type from primary_type column for each distinct block. To do so, I come up... more

Last post by Maryam Bains - December 10, 2021
475 views 0 likes

4
Is it really that difficult to get a job in Big Data/Hadoop as a fresher?

Is it really that difficult to get a job in Big Data/Hadoop as a fresher?

Last post by Samar Patil - September 11, 2021
235 views 0 likes

3
The best way to filter large data sets

I have a query about how to filter relevant records from a large data set of financial transactions. We use Oracle 11g database and one of the requirements is to produce various... more

Last post by Maryam Bains - October 19, 2021
250 views 0 likes

2
Confusion between Operational and Analytical Big Data and on which category Hadoop operates?

I can't wrap my head around the basic theoretical concept of 'Operational and Analytical Big Data'.
According to... more

Last post by Advika Banerjee - November 20, 2021
225 views 0 likes

2
Best way to delete millions of rows by ID

I need to delete about 2 million rows from my PG database. I have a list of IDs that I need to delete. However, any way I try to do this is taking days.
I tried putting them in a... more

Last post by Jasmine Chacko - December 21, 2020
755 views 3 likes

3
Hbase quickly count number of rows

Right now I implement row count over ResultScanner like this

for (Result rs = scanner.next(); rs != null; rs = scanner.next()) {
... more

Last post by Vaibhav Mali - January 1, 2022
3,105 views 0 likes

6
Hadoop Resource Manager Won't Start

I am a relatively new user to Hadoop (using version 2.4.1). I installed hadoop on my first node without a hitch, but I can't seem to get the Resource Manager to start on my second... more

Last post by Maryam Bains - October 19, 2021
1,830 views 0 likes

4
Simple Explanation of Apache Flume

Can anybody explain Apache Flume for me in a plain language? I'd appreciate an explanation with a practical example instead of abstract theoretical definitions, then I can... more

Last post by Raji Reddy A - June 14, 2019
1,488 views 0 likes

2
Successful ETL Automation: Libraries, Review papers, Use Cases

I'm curious if anyone can point to some successful extract, transform, load (ETL) automation libraries, papers, or use cases for somewhat inhomogenious data?
I would be... more

Last post by Vaibhav Mali - January 7, 2022
2,229 views 0 likes

5
How can I delete a hive database without using hive terminal?

I have a multinode Hadoop cluster setup with two nodes(one master node and one slave node). Each node with 8GB RAM.
I have also configured hive on the master node. Everything is... more

Last post by Vaibhav Mali - February 2, 2022
1,396 views 0 likes

3
Big data implementation on cloud

Could someone please let me know what does it mean by 'Big Data implementation over Cloud'
I have been using Amazon S3 to store data and query using hive, which I read is one of... more

Last post by Viaan Prakash - September 8, 2021
1,542 views 0 likes

4

Member Sign In

Member Sign In

Create Account

Quick Navigation

Most Rated Topics

Popular Tags

Connect With Us