Search QBoard Topic - Cluzters.ai

Quick Navigation

Most Rated Topics

When to use Hadoop, HB...
880 views 0 likes

3
Recommended package fo...
1,339 views 0 likes

5
Can one get hierarchic...
4,250 views 1 like

4
Best way to delete mil...
756 views 3 likes

4
How to convert text da...
498 views 1 like

3

Popular Tags

.NET² #AI¹ #apache-kafka¹ #apache-zookeeper¹ #Azure¹ #computervision¹ #ggplot2 ¹ #ggpmisc¹ #hadoop³ #hbase¹ #hdfs¹ #hive² #hiveql¹ #impala¹ #IOT¹ #LogisticRegression¹ #Model_Metrics¹ #R² #sql¹ #text_analytics¹ #visualization¹ 3d¹ abstraction¹ accessor¹ acf¹
Explore More »

Write single CSV file using spark-csv

I am using https://github.com/databricks/spark-csv , I am trying to write a single CSV, but not able to, it is making a folder.
Need a Scala function which will take parameter... more

Last post by Samar Patil - December 28, 2021
271 views 0 likes

2
Add jars to a Spark Job - spark-submit

True ... it has been discussed quite a lot.
However there is a lot of ambiguity and some of the answers provided ... including duplicating jar references in the... more

Last post by Maryam Bains - December 20, 2021
320 views 0 likes

3
Spark - load CSV file as DataFrame?

I would like to read a CSV in spark and convert it as DataFrame and store it in HDFS with df.registerTempTable("table_name")I have tried:
scala> val df =... more

Last post by Vaibhav Mali - December 8, 2021
232 views 0 likes

3
DataFrame equality in Apache Spark

Assume df1 and df2 are two DataFrames in Apache Spark, computed using two different mechanisms, e.g., Spark SQL vs. the Scala/Java/Python API.Is there an idiomatic way to... more

Last post by Vaibhav Mali - February 2, 2022
216 views 0 likes

3
How to turn off INFO logging in Spark?

I installed Spark using the AWS EC2 guide and I can launch the program fine using the bin/pyspark script to get to the spark prompt and can also do the Quick Start quide... more

Last post by Advika Banerjee - October 9, 2021
897 views 0 likes

3
Install BigDL in Data Science Experience on Cloud

I would like to use Intel BigDL in notebooks on Data Science Experience on Cloud.
How can I install it?

Last post by Sindhuja Martha - October 23, 2021
201 views 0 likes

4
Install BigDL in Data Science Experience on Cloud

I would like to use Intel BigDL in notebooks on Data Science Experience on Cloud.
How can I install it?

Last post by Maryam Bains - December 29, 2021
273 views 0 likes

2
Tune spark performance while writing big data to csv

Hi :) I have this code in spark /scala that partitions big data ( more than 50GB) by category into csv files.
df.write
.mode(SaveMode.Overwrite)
... more

Last post by Advika Banerjee - October 18, 2021
1,040 views 0 likes

3
Task not serializable: java.io.NotSerializableException when calling function outside closure only on classes not objects

Getting strange behavior when calling function outside of a closure:

when function is in a object everything is working
when function is in a class get... more

Last post by Advika Banerjee - October 18, 2021
677 views 0 likes

3
What is the most mature library for building a Data Analytics Pipeline in Java/Scala for Hadoop?

I found many options recently, and interesting in their comparisons primarely by maturity and stability.

Crunch - https://github.com/cloudera/crunch
Scrunch... more

Last post by Samar Patil - August 13, 2021
503 views 0 likes

2
How to convert rdd object to dataframe in spark

How can I convert an RDD (org.apache.spark.rdd.RDD) to a Dataframe org.apache.spark.sql.DataFrame. I converted a dataframe to rdd using .rdd. After processing it I want it back in... more

Last post by Viaan Prakash - December 22, 2021
941 views 0 likes

3

Member Sign In

Member Sign In

Create Account

Quick Navigation

Most Rated Topics

Popular Tags

Connect With Us